Learning Relaying Strategies in Cellular D2D Networks with Token-Based Incentives

Size: px
Start display at page:

Download "Learning Relaying Strategies in Cellular D2D Networks with Token-Based Incentives"

Transcription

1 Learning Relaying Sraegies in Cellular D2D Neworks wih Token-Based Incenives Nicholas Masronarde, Viral Pael Deparmen of Elecrical Engineering Sae Universiy of New York a Buffalo Buffalo, NY USA Jie Xu, Mihaela van der Schaar Deparmen of Elecrical Engineering Universiy of California a Los Angeles Los Angeles, CA USA Absrac We consider a cellular nework where inelligen cellular devices owned by selfish users are incenivized o cooperae wih each oher by using okens, which hey exchange elecronically o buy and sell downlink relay services, hereby increasing he nework s capaciy. We endow each device wih he abiliy o learn is opimal cooperaion sraegy online in order o maximize is long-erm uiliy in he dynamic nework environmen. We invesigae he impac of he oken exchange sysem on he overall downlink nework performance and he performance of individual devices in various deploymen scenarios involving mixures of high and low mobiliy users. Our resuls sugges ha devices have he greaes incenive o cooperae when he nework conains many highly mobile users (e.g., users in moor vehicles). Moreover, wihin he oken sysem, devices can effecively learn o cooperae online, and achieve over 20% higher hroughpu on average han wih direc ransmission alone, all while selfishly maximizing heir own uiliy. Keywords D2D cooperaive relaying; oken exchange sysem; online learning; LTE-Advanced I. INTRODUCTION Cooperaive device-o-device (D2D) relaying can increase he capaciy of cellular neworks [1] and has he poenial o significanly improve he performance of imporan applicaions such as muliuser wireless video sreaming [2][3]. For his reason, cooperaive communicaion and D2D echnologies are cenral o he LTE-Advanced sandard, which aims o address an anicipaed exponenial increase in mobile daa over he nex few years. However, mos exising work on cooperaive ransmission assumes ha all nework devices will ac as relays whenever requesed. Unforunaely, his assumpion does no hold when devices are cellphones/lapops/ables ha are owned by self-ineresed users, which aim o maximize heir own uiliy. In his siuaion, devices may no be willing o use heir limied baery energy o relay oher devices daa wihou any direc benefi o hemselves. Therefore, o simulae cooperaion among selfineresed devices, and realize he full poenial of cooperaive relaying, proper incenives mus be designed ino he sysem. One way o incenivize cooperaion is o inroduce a paymen o reward he self-ineresed devices when hey ac as relays and inroduce a charge o he devices ha receive help from relays. To his end, elecronic okens have been proposed [4][5]. The key idea is ha, each ime a device receives help from a relay, i pays he relay wih a oken, which he relay can use o ge relay service in he fuure. As long as he expeced fuure benefi of having an addiional oken ouweighs he immediae cos of relaying, hen a self-ineresed device will be willing o ac as a relay [5]. In his paper, we adop a oken-based approach similar o [5]. However, unlike in [5], where he focus is on designing oken-based incenive schemes from a sysem perspecive, in his paper, we invesigae how he devices can learn o adap heir wireless cooperaion and oken gahering sraegies online, a run-ime, based on heir experience. Specifically, we sudy how an individual user in he sysem can learn is opimal (bes response) cooperaion sraegy online, and how his sraegy impacs and is impaced by he oher heerogeneous devices in he nework, which are simulaneously learning. There is a lo of grea lieraure on cooperaive relaying in wireless and cellular neworks (e.g., [1]-[3][6][7][10][12]). However, hese works do no consider selfish users ha require incenives o cooperae, which is he focus of his paper. Imporanly, many of he echniques and soluions in he exising lieraure on cooperaive relaying can be implemened in conuncion wih our proposed framework. Our conribuions are as follows: We model each device s decision problem, i.e., wheher or no o relay, as a Markov decision process (MDP). The obecive of a device is o maximize is long-erm uiliy, which is defined as he difference beween he benefi i gains when i receives daa hrough a relay and he energy cos i incurs when i relays daa for anoher device. We propose a low complexiy learning algorihm ha enables each individual device o opimize is long-erm uiliy in he dynamic nework environmen. We sysemaically evaluae he sysem performance in various deploymen scenarios involving boh high mobiliy and low mobiliy users. Our simulaion resuls sugges ha devices have he greaes incenive o cooperae when he nework conains many highly mobile users (e.g., users in moor vehicles). The remainder of his paper is organized as follows. In Secion II, we presen he sysem model and formulae an individual device s opimizaion problem. In Secion III, we propose a simple and effecive algorihm ha an individual device can deploy o learn he opimal cooperaion sraegy online. In Secion IV, we presen our simulaion resuls. We conclude in Secion V /13/$ IEEE 163 1

2 II. SYSTEM MODEL A. Cooperaive Downlink Nework Model We consider a cellular nework comprising N mobile ransceiver devices. 1 We assume ha ime is sloed ino discree ime inervals indexed by N. A any given ime, a fracion of he devices need o receive daa from he base saion; however, some of hese devices may experience bad channel condiions (e.g., due o fading or shadowing), which will limi heir received daa rae. In his siuaion, inermediae devices can ac as device-o-device relays o help deliver he daa from he base saion o he desinaion device. We use an Amplify-and-Forward (AF) cooperaion scheme [12] and a simple algorihm for deciding when a device will reques a relay and which relay i will selec. Imporanly, he oken sysem and learning soluion proposed in his paper are no dependen on hese specific deails: in fac, hey can be applied o Decode-and-Forward (DF) cooperaion schemes and any oher relay selecion sraegies ha use one relay a a ime for each cooperaive link, e.g., [6][7]. 2 Oubound relay demand rae: Suppose ha in slo, device {1,, N} wans o receive daa from he base saion (device 0). We assume ha device has a arge,arge receive rae r ha is achievable a a arge Signal-o,arge Inerference-and-Noise Raio (SINR) Γ. Assuming ha 0 he source power is fixed a P, here is a non-zero probabiliy λ ha he device will no achieve is arge rae in slo : i.e., he ouage probabiliy 0,arge 0,arge Pr( r r Pr( λ = < ) = Γ <Γ ), (1) 0 where Γ is he SINR of he channel beween he base saion 0 and device, and r is he corresponding receive rae for device. If device is unable o receive is arge rae during slo, hen i will aemp o find a relay hrough which i can achieve is arge rae. For his reason, we refer o λ as device s insananeous oubound relay demand rae (ORDR). Noe ha a device s insananeous ORDR is unknown a priori because i depends on he device s geographic locaion, is disance from he neares base saion, and he underlying nework condiions. Moreover, since devices are mobile, heir insananeous ORDRs are ime-varying. For convenience, in he remainder of his secion, we model device s ORDR as a fixed value λ ; however, in Secion III, we assume ha i is ime-varying and propose a learning algorihm ha enables he device o dynamically adap as is ORDR changes over ime. Illusraive relay selecion sraegy: If device is unable o receive is arge rae during slo, hen i aemps o find a relay hrough which i can achieve is arge rae. Following sandard relay channel analysis for AF cooperaion [12], we obain he following received SINR over he cooperaive link: 1 The proposed soluion can also be applied o an ad-hoc nework where here is no base saion. 2 The sysem can be exended o work wih cooperaion schemes ha use muliple relays, e.g., hose based on randomized space-ime block coding [10]. 0i i 0i ΓΓ 0i i 1 Γ = Γ +Γ +, (2) where i {1,, N} denoes he relay device and Γ and are he SINRs of he source-relay and relay-desinaion channels, respecively. Assuming ha he relay ransmission i power is P, device selecs he relay i ha can mee he arge SINR while using he leas power: i.e., i i cell( ) i 0i,arg 0i i Γ = argmin { P : Γ Γ }, (3) where cell( ) is he se of devices ha are wihin range of he same base saion as device. If here is no relay for which 0i,arg Γ Γ ha is also willing o provide relay services, hen device will receive is daa direcly from he base 0, arg saion a he rae r < r. B. Token Sysem Providing a relay ransmission coss he relay energy wihou providing i any immediae benefi. Consequenly, wihou proper incenives, no ransceiver will be willing o cooperae. To overcome his, we incenivize devices o cooperae hrough he use of okens, which hey exchange elecronically o buy and sell relay services. Therefore, in order for a device o reques relay service, i mus have a oken available o buy he service from anoher device. Afer a device places a relay reques, a relay replies wih eiher a posiive or negaive acknowledgemen (ACK/NACK) [5] indicaing if i is willing o relay or no (we describe how his decision is made in Secion II.D). If he device s reques is ACK d, hen a oken ransfer akes place (from he requesing device o he relaying device) and relay ransmission occurs. Several echniques ha enable secure elecronic oken ransacions have been proposed [4][11], which require no cenral eniy. Our work assumes ha such echnology is in place o implemen he proposed oken exchange sysem. C. Model of a Nework Device We now describe our model of device {1,, N}. Token holding sae: A a given ime, device holds k K = {0,1,..., T} okens, where T is he oal number of okens in he nework. Baery sae/energy budge: Mobile devices have limied baery energy o suppor heir operaions. We le p max represen he energy ha he user allocaes o relaying daa for oher users. p max can be se based on user preferences. Wih each relay ransmission ha device provides, i expends some energy and herefore reduces is energy budge p [0, pmax ]. When is energy budge reaches 0, he device can neiher relay nor ask for relay service. We refer o p = 0 as he dead sae. In our problem formulaion, we assume ha p is coninuous; however, we quanize p in our simulaion resuls o reduce he implemenaion complexiy. Noe ha he budge is for energy spen relaying; hence, i is no decreased when ransmiing or receiving he device s own daa

3 Oubound Relay Reques Inbound Relay Reques Receiver Candidae Relay Oubound Requesλ Device (Receiver) Inbound Reques µ ACK σ ( k, p ) (a) ACK e (c) Device (Candidae Relay) Receiver Relay Device (Receiver) ( k, p ) ( k 1, p) Device (Relay) ( k, p ) ( k + 1, p c) (b) (d) Fig. 1. Illusraion of oubound and inbound relay requess in cellular nework wih cooperaive downlink ransmission. (a) Device requess a relay wih probabiliy λ and is reques is ACK d wih probabiliy e. (b) Afer is reques is ACK d, device ges help from he relay in exchange for one oken. (c) Device receives a relay reques wih probabiliy µ and ACK s he reques based on is cooperaion sraegy σ ( k, p ). (d) Afer i ACK s he reques, device acs as a relay and expends c unis of energy in exchange for one oken. Oubound relay demand rae (ORDR): λ denoes he h device s ORDR as defined in (1). I should be inerpreed as he probabiliy ha device wans o use a relay o receive is daa from he base saion (i is no he probabiliy ha he device acually ges help from a relay). A device s ORDR is unknown a priori for he reasons described in Secion II.A. The definiion of he ORDR is illusraed in Fig. 1(a). Inbound relay demand rae (IRDR): Le µ denoe he probabiliy ha device is asked o relay daa for anoher device. A device s IRDR is unknown a priori because i depends on he device s geographic locaion, is disance from he neares base saion, he locaions of oher devices in he nework, and he underlying nework condiions. The definiion of he IRDR is illusraed in Fig. 1(c). Relay recruimen efficiency: Le e denoe he h device s relay recruimen efficiency, which is defined as he following condiional probabiliy: e 0,arg k > p = Pr(ACK recv'd Γ <Γ, 0, > 0). (4) In words, e is he condiional probabiliy ha device ges help from a relay (i.e., receives an ACK), given ha i requires a relay ransmission, has enough okens o pay a relay, and has non-zero baery energy. I follows ha, if k > 0 and p > 0, hen he probabiliy ha device acually receives an ACK from a relay in a ime slo can be wrien as λ e. A device s relay recruimen efficiency is unknown a priori because i depends on he device s geographic locaion, is disance from he neares base saion, he locaions of oher devices in he nework, he underlying nework condiions, and he cooperaion sraegies, oken holdings, and baery saes of he oher devices. The definiion of he relay recruimen efficiency is illusraed in Fig. 1(a). Cos and benefi: Le b ( r ) be he benefi gained by device when i receives daa hrough a relay a daa rae r and le c be he cos incurred by device (i.e., energy cos) when i provides a relay ransmission o anoher device. Le b =E br ( ) be he expeced benefi over all possible ransmission raes. We normalize he coss o his expeced benefi (i.e., by dividing he cos by b ). Cooperaion acions: When he device receives a relay reques, i mus decide if i will ACK he reques. The acion se ( p A ) depends on he device s baery sae p : i.e., A ( p ) = {0,1} if p > 0 and A ( p ) = {0} if p = 0, where a = 1 A ( p ) (ACK) means ha he device is willing o ac as a relay and a = 0 A ( p ) (NACK) means ha i is no. 3 If he device is in he dead sae (i.e., p = 0 ), hen i canno ac as a relay (i.e., A (0) = {0} ). Cooperaion sraegy/policy: Le σ ( k, p ) A ( p ) denoe he device s cooperaion sraegy when i has k okens and is in power saep. Given is IRDR and cooperaion sraegy, he probabiliy ha device acually relays daa for anoher device in a ime slo is µσ ( k, p ). The role of he cooperaion sraegy is illusraed in Fig. 1(c). Sae evoluion: Denoe he device s sae by s = ( k, p ) S, where k K = {0,1,..., T} is is oken holding sae and p [0, p ] is is baery sae/energy max 3 More precisely, A ( p ) = {0,1} if p > c, where c is he amoun of energy required o relay. However, o simplify he model descripion, we do no explicily wrie his. This mismach beween model and realiy is largely insignifican because i only changes he behavior of he sysem in he firs ime slo when p < c

4 budge. When he device acs as a relay, i gains one oken and loses c unis of baery energy: i.e., ( k, p ) ( k + 1, p c ). When he device uses a relay, i loses one oken and remains in he same baery sae: ( k, p ) ( k 1, p ). The evoluion of he sae is illusraed in Fig. 1(b,d). Le ([, P k p ] [ k, p ], a ) denoe he sae ransiion probabiliy funcion, which gives he probabiliy of ransiioning from sae s = ( k, p ) o sae s = ( k, p ) afer aking cooperaion decision a. The sae ransiion probabiliy funcion is defined as follows: P ([0, p ] [0, p ], a ) = 1 µ a, P ([1, p c ] [0, p ], a ) = µ a, P ([ k 1, p ] [ k, p ], a ) = λe, P ([ k, p ] [ k, p ], a ) = 1 λ e µ a, P ([ k + 1, p c ] [ k, p ], a ) = µ a, P ([ k,0] [ k,0], a ) = 1, where a A ( p ) and ([, Ρ k p ] [ k, p ], a ) = 0 under all cases ha are no included in (5). The firs and second lines of (5) indicae ha if device has zero okens and non-zero baery energy, hen i remains in ha sae wih probabiliy 1 µa (i.e., i does no provide help as a relay) or gains one oken and loses c unis of baery energy wih probabiliy µ a (i.e., i provides help as a relay), respecively. The hird, fourh, and fifh lines of (5) indicae ha if device has nonzero okens and non-zero baery energy, hen i ges help in exchange for a oken wih probabiliy λ e, remains in he same sae wih probabiliy 1 λ e µ a (i.e., i neiher ges help nor provides help), and gains one oken and loses c unis of baery energy wih probabiliy µ a (i.e., i provides help), respecively. The final line of (5) indicaes ha device canno gain or lose okens if is baery is dead (i.e., p = 0 ). Expeced uiliy: Le u ( k, p, a ) denoe he expeced uiliy of being in sae s = ( k, p ) and aking cooperaion acion a : specifically, µ ac, ifk = 0, p > 0 u ( k, p, a ) = λ eb µ ac, if k > 0, p > 0 (6) 0, oherwise. In (6), when a device has non-zero okens (i.e., k > 0 ), i eiher ges help from a relay wih probabiliy λ e and receives benefi b or provides relay service wih probabiliy µ a and incurs cos c. If he device does no have any okens (i.e., k = 0 ), i canno reques help bu can provide help; herefore, i incurs cos c wih probabiliyµ a. If he device is in a dead sae (i.e., p = 0 ), i canno seek or provide help. (5) D. Opimal Collaboraion Sraegy for a Single Device We formulae he problem of deermining a device s opimal cooperaion sraegy as an MDP [8]. The opimal sae value funcion V ( k, p ) represens how good i is o be in each sae. I is given by he following Bellman equaion: { (,, ) β ( k, )] } ( p ) V ( k, p ) = max u k p a + E [ V p a A, (7) Where E [ ] denoes an expecaion over he nex sae disribuion defined in (5) and β is an economic discoun facor reflecing he fac ha having a oken now is more valuable han having one in he fuure. The key idea in (7) is ha he opimal cooperaion decision in sae s = ( k, p ) no only depends on he immediae uiliy, bu also on he expeced fuure uiliy. Indeed, a long-erm opimizaion is required because, if he device only considers is immediae uiliy, i will never choose o ac as a relay (because i will incur a cos c wihou any direc benefi o iself). The opimal cooperaion sraegy/policy σ ( k, p ) can be deermined by aking he argumen ha maximizes (7). Assuming ha he ransiion probabiliy funcion and uiliy funcion are known a priori, he opimal value funcion can be compued using he well-known value ieraion algorihm [8]. In pracice, however, he cos and ransiion probabiliy funcions are unknown a priori (due o he fac ha he ORDR λ, IRDR µ, and relay recruimen efficiency e are unknown), so device canno direcly apply value ieraion o find he opimal policy. Insead, i mus learn is opimal cooperaion sraegy online based on experience. We discuss our proposed learning soluion in Secion III. E. Balance of Oubound and Inbound Token Exchanges On average, a device will only be able o ge relay service as ofen as i provides relay service because i mus earn as many okens as i spends: specifically, he following condiion mus hold Pr(Device receives inbound ACK [-1 Token]) (8) = Pr(Device sends oubound ACK [+1 Token]) In his secion, we characerize his balance afer making several simplifying assumpions. Firs, we are only ineresed in he balance of oken exchanges while he device has non-zero baery energy, so we assume ha he power sae is non-zero and fixed. Second, since he ORDR λ, IRDR µ, and relay recruimen efficiency e are ime-varying, i is difficul o characerize he seady sae behavior; hence, we assume ha hese parameers are fixed. Under hese assumpions, (8) is equivalen o: λ e Pr( k > 0) = µ E[ σ ( k, p )], (9) 1 Token + 1 Token where he lef hand side is he probabiliy ha device uses a relay (and herefore spends one oken), he righ hand side is 166 4

5 he probabiliy ha i acs as a relay (and herefore earns one oken), Pr( k > 0) is he probabiliy ha i has a leas one oken, and E [ σ ( k, p )] is he probabiliy ha i ACK s an k inbound relay reques, where he expecaion is aken over is seady-sae oken holding disribuion. F. Threshold Sraegies Device s cooperaion decision depends on is oken and baery saes. The opimal cooperaion sraegy σ ( k, p ) is hreshold in he oken sae k and he hreshold depends on he baery sae p : specifically, 1, if k Kh( p ), σ ( k, p ) = (10) 0, oherwise, where he hreshold Kh ( p ) depends on he baery sae, and Kh ( p ) is non-decreasing in p. In oher words, as he device s baery drains, i will wan o conserve is energy and will herefore use a lower hreshold. A proof of his resul is lef as fuure work; however, i has been shown in [5] ha policies are hreshold when here is unlimied baery energy. A represenaive opimal cooperaion policy is illusraed in Fig. 2. For compuaional reasons (i.e., o compue he opimal policy wih value ieraion), we quanize he baery sae ino eleven bins (one represening he dead sae) and we limi he maximum number of okens ha he device can hold o 20. III. LEARNING THE OPTIMAL POLICY σ generaed using Fig. 2. Illusraive opimal cooperaion policy ( kp, ) β= 0.99, λ e= 0.45, µ= 0.45, and c / b= All devices in he cellular nework experience differen environmenal dynamics depending on heir geographic locaion in he nework, heir disance from he neares base saion, he underlying nework condiions, and, imporanly, he locaions, cooperaion sraegies, oken holdings, and power saes of he oher nework devices. These dynamics are capured by he a priori unknown and ime-varying ORDR λ, IRDR µ, and relay recruimen efficiency e, for all {1,, N}, as defined in Secion II.C. In his secion, we propose a hybrid offline/online learning algorihm ha a device can deploy o learn he opimal collaboraion sraegy σ ( k, p ) on-he-fly, despie hese parameers being unknown and ime-varying. In he online phase of he proposed algorihm, device esimaes several unknown parameers (namely, is ORDR λ, IRDR µ, relay recruimen efficiency e, and ransmission cos c ). Then, in each ime slo, device selecs is cooperaion sraegy based on he esimaed values. In principle, device could use hese esimaed values o populae he uiliy and ransiion probabiliy funcions defined in (6) and (5), respecively, and hen use value ieraion o compue he opimal cooperaion sraegy corresponding o he esimaed parameers; however, recompuing he opimal policy in every ime slo is compuaionally prohibiive. For his reason, we propose o firs compue a collecion of cooperaion policies offline, which correspond o a represenaive se of discreized nework parameers, and hen use a simple online able look-up o selec device s cooperaion sraegy in each ime slo. To reduce he size of he look-up able, insead of esimaing he ORDR λ and relay recruimen efficiency e independenly, we direcly esimae he oubound relay success rae π = λe [0,1 / 2]. 4 Addiionally, since he opimal policy is hreshold in he oken sae as described in Secion II.F, each policy can be represened compacly wih only one (hreshold) value per baery sae. We now describe he offline and online phases in more deail. 2 Offline phase: Le 1 X Π= { π, π,, π }, Μ= { µ, µ, µ }, and C = { c c,, c } be finie ses conaining represenaive values of he oubound relay success rae, IRDR, and relay cos, respecively. In he offline phase of he hybrid learning algorihm, we compue he collecion of, Y, Z x y z x y z policies { σ( kp, π, µ, c ) : π, µ, c } x y z Π Μ C. To compue he policy σ( kp, π, µ, c ), we firs populae he uiliy and ransiion probabiliy funcions defined in (6) and (5), respecively, wih x π, y µ, and z c (in place of π, µ, and c ). Subsequenly, we use value ieraion o compue he opimal cooperaion sraegy corresponding o he represenaive parameers. Online phase: In he algorihm s online porion, he device mainains run-ime esimaes of he oubound relay success rae π, denoed by π ˆ, and he IRDR µ, denoed by µ ˆ. These esimaes can be made using an exponenial average of successful relay ransmissions and incoming relay requess, respecively. Afer evaluaing he cos c required o provide a 4 The maximum oubound relay success rae is ½. This is because, on average, a device can only use a relay as ofen as i relays

6 relay ransmission in slo, he device follows he policy σ ( k, p f( π ˆ ), ( ˆ ), ( z gµ hc )), where f : [0,1 / 2] Π, g : [0,1] Μ, and h : [0,1] C map π ˆ, µ ˆ, and c o he neares values in Π, Μ, and C, respecively. IV. SIMULATION RESULTS In his secion, we presen our simulaion resuls. In Secion IV.A, we describe he simulaion seup. In Secion IV.B, we invesigae how, wihin he oken sysem, users mobiliy and baery saes impac heir own performance, he performance of oher users, and he overall nework performance. A. Cellular Nework Simulaion Seup We assume ha N = 1500 mobile ransceivers are uniformly and randomly disribued in a 10 km x 10 km square area consising of 100 cells wih size 1 km x 1 km. There is one base saion a he cener of each cell. In each ime slo, each ransceiver moves o a nearby locaion according o a random waypoin mobiliy model and needs o receive daa from he base saion in he corresponding cell wih probabiliy 0.3. We consider pah loss and shadow fading for he channel model such ha [9], P = P PL( d ) 10 αlog( d / d ) χ, (11) where rx x 0 0 P rx and P x are he receive and ransmi powers (in db), respecively, PLd ( 0) is he pah loss of he reference disance d 0, d is he disance beween he source and desinaion, α is he pah loss facor, and χ is a Gaussian disribued random variable represening he effec of shadow fading. We assume ha he maximum ransmission power of a ransceiver is 15 dbm, he channel bandwidh is 10 MHz, and he arge daa rae is 10 Mbps for all devices. If he arge daa rae canno be achieved wih he maximum power, hen he device requess a relay ransmission. Using he above parameer values, he average ORDR hroughou he nework is approximaely λ= 0.1. The required relay ransmission power is calculaed as he minimum power in he opimizaion defined in (3). The (normalized) cos c is he raio of he relay ransmission power (in mw) o he expeced benefi E br ( ) of achieving rae r. All devices adap heir cooperaion sraegy using he hybrid offline/online learning algorihm. Finally, we assume ha here are a oal of T= 9000 okens in he nework ha are uniformly and randomly disribued among he users a he sar of he simulaion. B. Impac of Mobiliy In his secion, we sudy how each user s mobiliy impacs is own performance, he performance of oher users, and he overall nework performance. We use he simulaion es-bed described in Secion IV.A and assume ha all users deploy he hybrid offline/online learning algorihm wih x π {0.05, 0.1,, 0.45} =Π, y µ {0.05, 0.1,, 0.45} =Μ, and c {0.05, 0.10,, 0.95} =C o compue he collecion of policies offline.. We consider wo classes of users: 1. High mobiliy users move a speeds beween 50 and 120 km/hour. For insance, users in moor vehicles on roads or highways are considered high mobiliy users. These users play differen roles in he nework over ime because someimes hey will be far from he base saion and have a high ORDR, and someimes hey will be closer o he base saion where hey will have a high IRDR. 2. Low mobiliy users move a speeds beween 0 and 8 km per hour. For example, users in offices, resaurans, or on foo are considered o be low mobiliy users. Due o heir limied mobiliy, hese users ypically do no swich roles over ime and heir ORDR and IRDR are relaively saic. Since cellular neworks usually conain boh high mobiliy and low mobiliy users, we now invesigae how differen mobiliy mixures impac he nework performance. We consider five mixures in which he nework comprises 10%, 30%, 50%, 70%, and 90% high mobiliy users, and he remaining users have low mobiliy. In oal, here are N = 1500 users. We assume ha all users sar in he maximum baery sae, which is defined such ha each user can ACK an average of 1000 relay requess before enering he dead sae. In Fig. 3, we plo he average ORDR λ, average IRDR µ, average modified relay recruimen efficiency e Pr( k> 0, p> 0) 5, and average hroughpu gain 6 over 3000 ime slos for users in each mobiliy class and for all users combined. The lower and upper error bars in Fig. 3 indicae he 25 h and 75 h perceniles, respecively, over he class of users. ORDR and IRDR: Fig. 3(a) and Fig. 3(b) illusrae he average ORDR and average IRDR, respecively, under he various user mobiliy mixures. Recall ha, a he beginning of he simulaion, users are uniformly disribued hroughou he nework. Since he low mobiliy users do no significanly deviae from heir saring posiions, and he high mobiliy users move many places hroughou he nework, he average ORDR and IRDR for users in each class is approximaely he same; however, he variaion of ORDRs and IRDRs across users wihin each class varies significanly. In paricular, he low mobiliy users have a much larger range of ORDRs and IRDRs because if hey are a he periphery of a cell, hen hey will have large ORDRs and small IRDRs, and if hey are in he core of a cell, hen hey will have small ORDRs and large IRDRs. In conras, he high mobiliy users move beween he peripheries and cores of various cells over ime, and herefore experience less deviaion in hese parameers across he populaion. Modified relay recruimen efficiency and ACK rae: Fig. 3(c) illusraes he average modified relay recruimen efficiency under he various user mobiliy mixures. The device s modified relay recruimen efficiency 5 The device s modified relay recruimen efficiency is he probabiliy ha is oubound requess are ACK d given ha a relay is required [see (4)]. 6 The hroughpu gain is he raio of he acual hroughpu o he direc ransmission hroughpu. Since he acual hroughpu is always greaer han or equal o he direc hroughpu, he minimum hroughpu gain is

7 e Pr( k> 0, p> 0) is he condiional probabiliy ha is oubound requess are ACK d given ha a relay is required [see (4)]. Imporanly, his parameer is inimaely ied o each user s ORDR and IRDR. On average, low mobiliy users have lower modified relay recruimen efficiencies and more variaion of hese parameers across he populaion. These effecs emerge because low mobiliy users have srongly imbalanced ORDRs and IRDRs unless hey are lucky enough o be siuaed beween he core and periphery of a cell. If a device s ORDR is larger han is IRDR, i.e., λ > µ, hen i ends o run ou of okens (i.e., Pr( k > 0) in (9) will be small). Wihou okens, i canno recrui relays, which reduces is modified relay recruimen efficiency. Noe ha his also hurs oher users because i reduces opporuniies o earn okens. On he oher hand, if a device s ORDR is smaller han is IRDR, i.e., λ < µ, hen i will end o collec a surplus of okens and no have any incenive o ACK incoming relay requess, which in urn reduces he relay recruimen efficiencies of he oher devices in he nework. Noe ha he ACK rae, i.e., he probabiliy ha a device ACKs an inbound reques [he expecaion E [ σ ( k, p )] in (9)], is direcly proporional o he modified relay recruimen efficiency as shown in (9); consequenly, we omi i from Fig. 3. Throughpu gain relaive o direc ransmission: Fig. 3(d) illusraes he average hroughpu gain under he various user mobiliy mixures. Clearly, having a higher fracion of high mobiliy users in he nework improves he average nework hroughpu (relaive o direc ransmission only) because hese users play differen roles in he nework over ime, which enables a coninuous exchange of okens beween devices in he periphery and core of each cell. [2] O. Alay, P. Liu, Y. Wang, E. Erkip, and S. S. Panwar, Cooperaive layered video mulicas using randomized disribued space ime codes, IEEE Trans. on Mulimedia, vol. 13, no. 5, pp , Oc [3] N. Masronarde, F. Verde, D. Darsena, A. Scaglione, and M. van der Schaar, Transmiing imporan bis and sailing high radio waves: a decenralized cross-layer approach o cooperaive video ransmission, IEEE J. on Selec. Areas in Comm. Cooperaive Neworking Challenges and Applicaions, vol. 30, no. 9, pp , Oc [4] L. Buyan and J.-P. Hubaux, Nugles: a virual currency o simulae cooperaion in self-organized mobile ad hoc neworks, EPFL echinical repor, Jan [5] J. Xu and M. van der Schaar, Token sysem design for auonomic wireless relay neworks, IEEE Trans on Communicaions, o appear, [6] A. S. Ibrahim, A. K. Sadek, W. Su, and K. J. R. Liu, Cooperaive communicaions wih relay-selecion: when o cooperae and whom o cooperae wih?, IEEE Trans. on Wireless Communicaions, vol. 7, no. 7, pp , July [7] R. Madan, N. Meha, A. Molisch, and J. Zhang, Energy-efficien cooperaive relaying over fading channels wih simple relay selecion, IEEE Trans. on Wireless Communicaions, vol. 7, no. 8, pp , Aug [8] M. L. Puerman, Markov Decision Processes: Discree Sochasic Dynamic Programming. New York: John Wiley & Sons, [9] R. Jain, Channel models: a uorial, Available online: hp:// [10] B. Sirkeci-Mergen and A. Scaglione, Randomized space-ime coding for disribued cooperaive communicaion, IEEE Trans. on Signal Processing, vol. 55, pp , Oc [11] M. Franklin and M. Reier, Fair exchange wih a semi-rused hird pary, ACM conference on Compuer and Communicaion Securiy, [12] Y. Zhao, R. Adve, T. J. Lim, Improving amplify-and-forward relay neworks: opimal power allocaion versus selecion, IEEE Trans. Wireless Commun., vol. 6, no. 8, V. CONCLUSION Cooperaive communicaion and D2D echnologies are cenral o he LTE-Advanced sandard. However, hese echnologies assume ha users are willing o share heir limied baery energy o relay daa for oher users, even when doing so may reduce heir uiliy. In his paper, we use oken exchanges o provide self-ineresed users wih incenives o relay daa for oher users. We formulae each device s decision problem, i.e., wheher or no o relay, as a Markov decision process. We propose a simple and effecive learning algorihm ha a device can deploy o learn is opimal cooperaion sraegy online based on is experience. We evaluae he nework performance in various deploymen scenarios involving boh high mobiliy and low mobiliy users. Our simulaion resuls indicae ha individual devices have he greaes incenive o cooperae when he nework conains many highly mobile users (e.g., users in moor vehicles). REFERENCES [1] T. C.-Y Ng and W. Yu, Join opimizaion of relay sraegies and resource allocaions in cooperaive cellular neworks, IEEE Trans. on Selec. Areas in Comm., vol. 25, no. 2, Feb Fig. 3. Impac of differen mobiliy mixures on differen user classes. (a) Average ORDR λ. (b) Average IRDR µ. (c) Average modified relay recruimen efficiency e Pr( k> 0, p> 0). (d) Average hroughpu gain

Vehicle Arrival Models : Headway

Vehicle Arrival Models : Headway Chaper 12 Vehicle Arrival Models : Headway 12.1 Inroducion Modelling arrival of vehicle a secion of road is an imporan sep in raffic flow modelling. I has imporan applicaion in raffic flow simulaion where

More information

On Measuring Pro-Poor Growth. 1. On Various Ways of Measuring Pro-Poor Growth: A Short Review of the Literature

On Measuring Pro-Poor Growth. 1. On Various Ways of Measuring Pro-Poor Growth: A Short Review of the Literature On Measuring Pro-Poor Growh 1. On Various Ways of Measuring Pro-Poor Growh: A Shor eview of he Lieraure During he pas en years or so here have been various suggesions concerning he way one should check

More information

20. Applications of the Genetic-Drift Model

20. Applications of the Genetic-Drift Model 0. Applicaions of he Geneic-Drif Model 1) Deermining he probabiliy of forming any paricular combinaion of genoypes in he nex generaion: Example: If he parenal allele frequencies are p 0 = 0.35 and q 0

More information

Resource Allocation in Visible Light Communication Networks NOMA vs. OFDMA Transmission Techniques

Resource Allocation in Visible Light Communication Networks NOMA vs. OFDMA Transmission Techniques Resource Allocaion in Visible Ligh Communicaion Neworks NOMA vs. OFDMA Transmission Techniques Eirini Eleni Tsiropoulou, Iakovos Gialagkolidis, Panagiois Vamvakas, and Symeon Papavassiliou Insiue of Communicaions

More information

T L. t=1. Proof of Lemma 1. Using the marginal cost accounting in Equation(4) and standard arguments. t )+Π RB. t )+K 1(Q RB

T L. t=1. Proof of Lemma 1. Using the marginal cost accounting in Equation(4) and standard arguments. t )+Π RB. t )+K 1(Q RB Elecronic Companion EC.1. Proofs of Technical Lemmas and Theorems LEMMA 1. Le C(RB) be he oal cos incurred by he RB policy. Then we have, T L E[C(RB)] 3 E[Z RB ]. (EC.1) Proof of Lemma 1. Using he marginal

More information

Solutions for Assignment 2

Solutions for Assignment 2 Faculy of rs and Science Universiy of Torono CSC 358 - Inroducion o Compuer Neworks, Winer 218 Soluions for ssignmen 2 Quesion 1 (2 Poins): Go-ack n RQ In his quesion, we review how Go-ack n RQ can be

More information

Lecture 4 Notes (Little s Theorem)

Lecture 4 Notes (Little s Theorem) Lecure 4 Noes (Lile s Theorem) This lecure concerns one of he mos imporan (and simples) heorems in Queuing Theory, Lile s Theorem. More informaion can be found in he course book, Bersekas & Gallagher,

More information

ACE 562 Fall Lecture 5: The Simple Linear Regression Model: Sampling Properties of the Least Squares Estimators. by Professor Scott H.

ACE 562 Fall Lecture 5: The Simple Linear Regression Model: Sampling Properties of the Least Squares Estimators. by Professor Scott H. ACE 56 Fall 005 Lecure 5: he Simple Linear Regression Model: Sampling Properies of he Leas Squares Esimaors by Professor Sco H. Irwin Required Reading: Griffihs, Hill and Judge. "Inference in he Simple

More information

Online Appendix to Solution Methods for Models with Rare Disasters

Online Appendix to Solution Methods for Models with Rare Disasters Online Appendix o Soluion Mehods for Models wih Rare Disasers Jesús Fernández-Villaverde and Oren Levinal In his Online Appendix, we presen he Euler condiions of he model, we develop he pricing Calvo block,

More information

STATE-SPACE MODELLING. A mass balance across the tank gives:

STATE-SPACE MODELLING. A mass balance across the tank gives: B. Lennox and N.F. Thornhill, 9, Sae Space Modelling, IChemE Process Managemen and Conrol Subjec Group Newsleer STE-SPACE MODELLING Inroducion: Over he pas decade or so here has been an ever increasing

More information

Random Walk with Anti-Correlated Steps

Random Walk with Anti-Correlated Steps Random Walk wih Ani-Correlaed Seps John Noga Dirk Wagner 2 Absrac We conjecure he expeced value of random walks wih ani-correlaed seps o be exacly. We suppor his conjecure wih 2 plausibiliy argumens and

More information

Energy Storage Benchmark Problems

Energy Storage Benchmark Problems Energy Sorage Benchmark Problems Daniel F. Salas 1,3, Warren B. Powell 2,3 1 Deparmen of Chemical & Biological Engineering 2 Deparmen of Operaions Research & Financial Engineering 3 Princeon Laboraory

More information

Transmitting important bits and sailing high radio waves: a decentralized cross-layer approach to cooperative video transmission

Transmitting important bits and sailing high radio waves: a decentralized cross-layer approach to cooperative video transmission Transmiing imporan bis and sailing high radio waves: a decenralized cross-layer approach o cooperaive video ransmission 1 Nicholas Masronarde, Francesco Verde, Donaella Darsena, Anna Scaglione, and Mihaela

More information

Object tracking: Using HMMs to estimate the geographical location of fish

Object tracking: Using HMMs to estimate the geographical location of fish Objec racking: Using HMMs o esimae he geographical locaion of fish 02433 - Hidden Markov Models Marin Wæver Pedersen, Henrik Madsen Course week 13 MWP, compiled June 8, 2011 Objecive: Locae fish from agging

More information

3.1.3 INTRODUCTION TO DYNAMIC OPTIMIZATION: DISCRETE TIME PROBLEMS. A. The Hamiltonian and First-Order Conditions in a Finite Time Horizon

3.1.3 INTRODUCTION TO DYNAMIC OPTIMIZATION: DISCRETE TIME PROBLEMS. A. The Hamiltonian and First-Order Conditions in a Finite Time Horizon 3..3 INRODUCION O DYNAMIC OPIMIZAION: DISCREE IME PROBLEMS A. he Hamilonian and Firs-Order Condiions in a Finie ime Horizon Define a new funcion, he Hamilonian funcion, H. H he change in he oal value of

More information

Block Diagram of a DCS in 411

Block Diagram of a DCS in 411 Informaion source Forma A/D From oher sources Pulse modu. Muliplex Bandpass modu. X M h: channel impulse response m i g i s i Digial inpu Digial oupu iming and synchronizaion Digial baseband/ bandpass

More information

Lecture 2-1 Kinematics in One Dimension Displacement, Velocity and Acceleration Everything in the world is moving. Nothing stays still.

Lecture 2-1 Kinematics in One Dimension Displacement, Velocity and Acceleration Everything in the world is moving. Nothing stays still. Lecure - Kinemaics in One Dimension Displacemen, Velociy and Acceleraion Everyhing in he world is moving. Nohing says sill. Moion occurs a all scales of he universe, saring from he moion of elecrons in

More information

Biol. 356 Lab 8. Mortality, Recruitment, and Migration Rates

Biol. 356 Lab 8. Mortality, Recruitment, and Migration Rates Biol. 356 Lab 8. Moraliy, Recruimen, and Migraion Raes (modified from Cox, 00, General Ecology Lab Manual, McGraw Hill) Las week we esimaed populaion size hrough several mehods. One assumpion of all hese

More information

An introduction to the theory of SDDP algorithm

An introduction to the theory of SDDP algorithm An inroducion o he heory of SDDP algorihm V. Leclère (ENPC) Augus 1, 2014 V. Leclère Inroducion o SDDP Augus 1, 2014 1 / 21 Inroducion Large scale sochasic problem are hard o solve. Two ways of aacking

More information

Expert Advice for Amateurs

Expert Advice for Amateurs Exper Advice for Amaeurs Ernes K. Lai Online Appendix - Exisence of Equilibria The analysis in his secion is performed under more general payoff funcions. Wihou aking an explici form, he payoffs of he

More information

E β t log (C t ) + M t M t 1. = Y t + B t 1 P t. B t 0 (3) v t = P tc t M t Question 1. Find the FOC s for an optimum in the agent s problem.

E β t log (C t ) + M t M t 1. = Y t + B t 1 P t. B t 0 (3) v t = P tc t M t Question 1. Find the FOC s for an optimum in the agent s problem. Noes, M. Krause.. Problem Se 9: Exercise on FTPL Same model as in paper and lecure, only ha one-period govenmen bonds are replaced by consols, which are bonds ha pay one dollar forever. I has curren marke

More information

Comparing Means: t-tests for One Sample & Two Related Samples

Comparing Means: t-tests for One Sample & Two Related Samples Comparing Means: -Tess for One Sample & Two Relaed Samples Using he z-tes: Assumpions -Tess for One Sample & Two Relaed Samples The z-es (of a sample mean agains a populaion mean) is based on he assumpion

More information

Lecture Notes 5: Investment

Lecture Notes 5: Investment Lecure Noes 5: Invesmen Zhiwei Xu (xuzhiwei@sju.edu.cn) Invesmen decisions made by rms are one of he mos imporan behaviors in he economy. As he invesmen deermines how he capials accumulae along he ime,

More information

Air Traffic Forecast Empirical Research Based on the MCMC Method

Air Traffic Forecast Empirical Research Based on the MCMC Method Compuer and Informaion Science; Vol. 5, No. 5; 0 ISSN 93-8989 E-ISSN 93-8997 Published by Canadian Cener of Science and Educaion Air Traffic Forecas Empirical Research Based on he MCMC Mehod Jian-bo Wang,

More information

Problem Set 5. Graduate Macro II, Spring 2017 The University of Notre Dame Professor Sims

Problem Set 5. Graduate Macro II, Spring 2017 The University of Notre Dame Professor Sims Problem Se 5 Graduae Macro II, Spring 2017 The Universiy of Nore Dame Professor Sims Insrucions: You may consul wih oher members of he class, bu please make sure o urn in your own work. Where applicable,

More information

Inventory Control of Perishable Items in a Two-Echelon Supply Chain

Inventory Control of Perishable Items in a Two-Echelon Supply Chain Journal of Indusrial Engineering, Universiy of ehran, Special Issue,, PP. 69-77 69 Invenory Conrol of Perishable Iems in a wo-echelon Supply Chain Fariborz Jolai *, Elmira Gheisariha and Farnaz Nojavan

More information

Chapter 2. First Order Scalar Equations

Chapter 2. First Order Scalar Equations Chaper. Firs Order Scalar Equaions We sar our sudy of differenial equaions in he same way he pioneers in his field did. We show paricular echniques o solve paricular ypes of firs order differenial equaions.

More information

Lecture Notes 3: Quantitative Analysis in DSGE Models: New Keynesian Model

Lecture Notes 3: Quantitative Analysis in DSGE Models: New Keynesian Model Lecure Noes 3: Quaniaive Analysis in DSGE Models: New Keynesian Model Zhiwei Xu, Email: xuzhiwei@sju.edu.cn The moneary policy plays lile role in he basic moneary model wihou price sickiness. We now urn

More information

Diebold, Chapter 7. Francis X. Diebold, Elements of Forecasting, 4th Edition (Mason, Ohio: Cengage Learning, 2006). Chapter 7. Characterizing Cycles

Diebold, Chapter 7. Francis X. Diebold, Elements of Forecasting, 4th Edition (Mason, Ohio: Cengage Learning, 2006). Chapter 7. Characterizing Cycles Diebold, Chaper 7 Francis X. Diebold, Elemens of Forecasing, 4h Ediion (Mason, Ohio: Cengage Learning, 006). Chaper 7. Characerizing Cycles Afer compleing his reading you should be able o: Define covariance

More information

Simulation-Solving Dynamic Models ABE 5646 Week 2, Spring 2010

Simulation-Solving Dynamic Models ABE 5646 Week 2, Spring 2010 Simulaion-Solving Dynamic Models ABE 5646 Week 2, Spring 2010 Week Descripion Reading Maerial 2 Compuer Simulaion of Dynamic Models Finie Difference, coninuous saes, discree ime Simple Mehods Euler Trapezoid

More information

Recursive Estimation and Identification of Time-Varying Long- Term Fading Channels

Recursive Estimation and Identification of Time-Varying Long- Term Fading Channels Recursive Esimaion and Idenificaion of ime-varying Long- erm Fading Channels Mohammed M. Olama, Kiran K. Jaladhi, Seddi M. Djouadi, and Charalambos D. Charalambous 2 Universiy of ennessee Deparmen of Elecrical

More information

Zürich. ETH Master Course: L Autonomous Mobile Robots Localization II

Zürich. ETH Master Course: L Autonomous Mobile Robots Localization II Roland Siegwar Margaria Chli Paul Furgale Marco Huer Marin Rufli Davide Scaramuzza ETH Maser Course: 151-0854-00L Auonomous Mobile Robos Localizaion II ACT and SEE For all do, (predicion updae / ACT),

More information

Physics 235 Chapter 2. Chapter 2 Newtonian Mechanics Single Particle

Physics 235 Chapter 2. Chapter 2 Newtonian Mechanics Single Particle Chaper 2 Newonian Mechanics Single Paricle In his Chaper we will review wha Newon s laws of mechanics ell us abou he moion of a single paricle. Newon s laws are only valid in suiable reference frames,

More information

Essential Microeconomics : OPTIMAL CONTROL 1. Consider the following class of optimization problems

Essential Microeconomics : OPTIMAL CONTROL 1. Consider the following class of optimization problems Essenial Microeconomics -- 6.5: OPIMAL CONROL Consider he following class of opimizaion problems Max{ U( k, x) + U+ ( k+ ) k+ k F( k, x)}. { x, k+ } = In he language of conrol heory, he vecor k is he vecor

More information

Applying Genetic Algorithms for Inventory Lot-Sizing Problem with Supplier Selection under Storage Capacity Constraints

Applying Genetic Algorithms for Inventory Lot-Sizing Problem with Supplier Selection under Storage Capacity Constraints IJCSI Inernaional Journal of Compuer Science Issues, Vol 9, Issue 1, No 1, January 2012 wwwijcsiorg 18 Applying Geneic Algorihms for Invenory Lo-Sizing Problem wih Supplier Selecion under Sorage Capaciy

More information

1 Review of Zero-Sum Games

1 Review of Zero-Sum Games COS 5: heoreical Machine Learning Lecurer: Rob Schapire Lecure #23 Scribe: Eugene Brevdo April 30, 2008 Review of Zero-Sum Games Las ime we inroduced a mahemaical model for wo player zero-sum games. Any

More information

. Now define y j = log x j, and solve the iteration.

. Now define y j = log x j, and solve the iteration. Problem 1: (Disribued Resource Allocaion (ALOHA!)) (Adaped from M& U, Problem 5.11) In his problem, we sudy a simple disribued proocol for allocaing agens o shared resources, wherein agens conend for resources

More information

Modal identification of structures from roving input data by means of maximum likelihood estimation of the state space model

Modal identification of structures from roving input data by means of maximum likelihood estimation of the state space model Modal idenificaion of srucures from roving inpu daa by means of maximum likelihood esimaion of he sae space model J. Cara, J. Juan, E. Alarcón Absrac The usual way o perform a forced vibraion es is o fix

More information

Inventory Analysis and Management. Multi-Period Stochastic Models: Optimality of (s, S) Policy for K-Convex Objective Functions

Inventory Analysis and Management. Multi-Period Stochastic Models: Optimality of (s, S) Policy for K-Convex Objective Functions Muli-Period Sochasic Models: Opimali of (s, S) Polic for -Convex Objecive Funcions Consider a seing similar o he N-sage newsvendor problem excep ha now here is a fixed re-ordering cos (> 0) for each (re-)order.

More information

Macroeconomic Theory Ph.D. Qualifying Examination Fall 2005 ANSWER EACH PART IN A SEPARATE BLUE BOOK. PART ONE: ANSWER IN BOOK 1 WEIGHT 1/3

Macroeconomic Theory Ph.D. Qualifying Examination Fall 2005 ANSWER EACH PART IN A SEPARATE BLUE BOOK. PART ONE: ANSWER IN BOOK 1 WEIGHT 1/3 Macroeconomic Theory Ph.D. Qualifying Examinaion Fall 2005 Comprehensive Examinaion UCLA Dep. of Economics You have 4 hours o complee he exam. There are hree pars o he exam. Answer all pars. Each par has

More information

Performance of Stochastically Intermittent Sensors in Detecting a Target Traveling between Two Areas

Performance of Stochastically Intermittent Sensors in Detecting a Target Traveling between Two Areas American Journal of Operaions Research, 06, 6, 99- Published Online March 06 in SciRes. hp://www.scirp.org/journal/ajor hp://dx.doi.org/0.436/ajor.06.60 Performance of Sochasically Inermien Sensors in

More information

KINEMATICS IN ONE DIMENSION

KINEMATICS IN ONE DIMENSION KINEMATICS IN ONE DIMENSION PREVIEW Kinemaics is he sudy of how hings move how far (disance and displacemen), how fas (speed and velociy), and how fas ha how fas changes (acceleraion). We say ha an objec

More information

) were both constant and we brought them from under the integral.

) were both constant and we brought them from under the integral. YIELD-PER-RECRUIT (coninued The yield-per-recrui model applies o a cohor, bu we saw in he Age Disribuions lecure ha he properies of a cohor do no apply in general o a collecion of cohors, which is wha

More information

Module 2 F c i k c s la l w a s o s f dif di fusi s o i n

Module 2 F c i k c s la l w a s o s f dif di fusi s o i n Module Fick s laws of diffusion Fick s laws of diffusion and hin film soluion Adolf Fick (1855) proposed: d J α d d d J (mole/m s) flu (m /s) diffusion coefficien and (mole/m 3 ) concenraion of ions, aoms

More information

Lecture 4 Kinetics of a particle Part 3: Impulse and Momentum

Lecture 4 Kinetics of a particle Part 3: Impulse and Momentum MEE Engineering Mechanics II Lecure 4 Lecure 4 Kineics of a paricle Par 3: Impulse and Momenum Linear impulse and momenum Saring from he equaion of moion for a paricle of mass m which is subjeced o an

More information

WEEK-3 Recitation PHYS 131. of the projectile s velocity remains constant throughout the motion, since the acceleration a x

WEEK-3 Recitation PHYS 131. of the projectile s velocity remains constant throughout the motion, since the acceleration a x WEEK-3 Reciaion PHYS 131 Ch. 3: FOC 1, 3, 4, 6, 14. Problems 9, 37, 41 & 71 and Ch. 4: FOC 1, 3, 5, 8. Problems 3, 5 & 16. Feb 8, 018 Ch. 3: FOC 1, 3, 4, 6, 14. 1. (a) The horizonal componen of he projecile

More information

PENALIZED LEAST SQUARES AND PENALIZED LIKELIHOOD

PENALIZED LEAST SQUARES AND PENALIZED LIKELIHOOD PENALIZED LEAST SQUARES AND PENALIZED LIKELIHOOD HAN XIAO 1. Penalized Leas Squares Lasso solves he following opimizaion problem, ˆβ lasso = arg max β R p+1 1 N y i β 0 N x ij β j β j (1.1) for some 0.

More information

ACE 562 Fall Lecture 4: Simple Linear Regression Model: Specification and Estimation. by Professor Scott H. Irwin

ACE 562 Fall Lecture 4: Simple Linear Regression Model: Specification and Estimation. by Professor Scott H. Irwin ACE 56 Fall 005 Lecure 4: Simple Linear Regression Model: Specificaion and Esimaion by Professor Sco H. Irwin Required Reading: Griffihs, Hill and Judge. "Simple Regression: Economic and Saisical Model

More information

1. An introduction to dynamic optimization -- Optimal Control and Dynamic Programming AGEC

1. An introduction to dynamic optimization -- Optimal Control and Dynamic Programming AGEC This documen was generaed a :45 PM 8/8/04 Copyrigh 04 Richard T. Woodward. An inroducion o dynamic opimizaion -- Opimal Conrol and Dynamic Programming AGEC 637-04 I. Overview of opimizaion Opimizaion is

More information

Guest Lectures for Dr. MacFarlane s EE3350 Part Deux

Guest Lectures for Dr. MacFarlane s EE3350 Part Deux Gues Lecures for Dr. MacFarlane s EE3350 Par Deux Michael Plane Mon., 08-30-2010 Wrie name in corner. Poin ou his is a review, so I will go faser. Remind hem o go lisen o online lecure abou geing an A

More information

On the Vulnerability of the Proportional Fairness Scheduler to Retransmission Attacks

On the Vulnerability of the Proportional Fairness Scheduler to Retransmission Attacks On he Vulnerabiliy of he Proporional Fairness Scheduler o Reransmission Aacks Udi Ben-Pora, Ana Bremler-Barr, Hanoch Levy and Bernhard Planer Compuer Engineering and Neworks Laboraory, ETH Zurich, Swizerland

More information

Decentralized Stochastic Control with Partial History Sharing: A Common Information Approach

Decentralized Stochastic Control with Partial History Sharing: A Common Information Approach 1 Decenralized Sochasic Conrol wih Parial Hisory Sharing: A Common Informaion Approach Ashuosh Nayyar, Adiya Mahajan and Demoshenis Tenekezis arxiv:1209.1695v1 [cs.sy] 8 Sep 2012 Absrac A general model

More information

CHAPTER 2 Signals And Spectra

CHAPTER 2 Signals And Spectra CHAPER Signals And Specra Properies of Signals and Noise In communicaion sysems he received waveform is usually caegorized ino he desired par conaining he informaion, and he undesired par. he desired par

More information

d 1 = c 1 b 2 - b 1 c 2 d 2 = c 1 b 3 - b 1 c 3

d 1 = c 1 b 2 - b 1 c 2 d 2 = c 1 b 3 - b 1 c 3 and d = c b - b c c d = c b - b c c This process is coninued unil he nh row has been compleed. The complee array of coefficiens is riangular. Noe ha in developing he array an enire row may be divided or

More information

Linear Response Theory: The connection between QFT and experiments

Linear Response Theory: The connection between QFT and experiments Phys540.nb 39 3 Linear Response Theory: The connecion beween QFT and experimens 3.1. Basic conceps and ideas Q: How do we measure he conduciviy of a meal? A: we firs inroduce a weak elecric field E, and

More information

ACE 562 Fall Lecture 8: The Simple Linear Regression Model: R 2, Reporting the Results and Prediction. by Professor Scott H.

ACE 562 Fall Lecture 8: The Simple Linear Regression Model: R 2, Reporting the Results and Prediction. by Professor Scott H. ACE 56 Fall 5 Lecure 8: The Simple Linear Regression Model: R, Reporing he Resuls and Predicion by Professor Sco H. Irwin Required Readings: Griffihs, Hill and Judge. "Explaining Variaion in he Dependen

More information

Evaluation of Mean Time to System Failure of a Repairable 3-out-of-4 System with Online Preventive Maintenance

Evaluation of Mean Time to System Failure of a Repairable 3-out-of-4 System with Online Preventive Maintenance American Journal of Applied Mahemaics and Saisics, 0, Vol., No., 9- Available online a hp://pubs.sciepub.com/ajams/// Science and Educaion Publishing DOI:0.69/ajams--- Evaluaion of Mean Time o Sysem Failure

More information

A Shooting Method for A Node Generation Algorithm

A Shooting Method for A Node Generation Algorithm A Shooing Mehod for A Node Generaion Algorihm Hiroaki Nishikawa W.M.Keck Foundaion Laboraory for Compuaional Fluid Dynamics Deparmen of Aerospace Engineering, Universiy of Michigan, Ann Arbor, Michigan

More information

Lecture 2 October ε-approximation of 2-player zero-sum games

Lecture 2 October ε-approximation of 2-player zero-sum games Opimizaion II Winer 009/10 Lecurer: Khaled Elbassioni Lecure Ocober 19 1 ε-approximaion of -player zero-sum games In his lecure we give a randomized ficiious play algorihm for obaining an approximae soluion

More information

5. Stochastic processes (1)

5. Stochastic processes (1) Lec05.pp S-38.45 - Inroducion o Teleraffic Theory Spring 2005 Conens Basic conceps Poisson process 2 Sochasic processes () Consider some quaniy in a eleraffic (or any) sysem I ypically evolves in ime randomly

More information

Approximation Algorithms for Unique Games via Orthogonal Separators

Approximation Algorithms for Unique Games via Orthogonal Separators Approximaion Algorihms for Unique Games via Orhogonal Separaors Lecure noes by Konsanin Makarychev. Lecure noes are based on he papers [CMM06a, CMM06b, LM4]. Unique Games In hese lecure noes, we define

More information

INTRODUCTION TO MACHINE LEARNING 3RD EDITION

INTRODUCTION TO MACHINE LEARNING 3RD EDITION ETHEM ALPAYDIN The MIT Press, 2014 Lecure Slides for INTRODUCTION TO MACHINE LEARNING 3RD EDITION alpaydin@boun.edu.r hp://www.cmpe.boun.edu.r/~ehem/i2ml3e CHAPTER 2: SUPERVISED LEARNING Learning a Class

More information

Online Appendix for "Customer Recognition in. Experience versus Inspection Good Markets"

Online Appendix for Customer Recognition in. Experience versus Inspection Good Markets Online Appendix for "Cusomer Recogniion in Experience versus Inspecion Good Markes" Bing Jing Cheong Kong Graduae School of Business Beijing, 0078, People s Republic of China, bjing@ckgsbeducn November

More information

GMM - Generalized Method of Moments

GMM - Generalized Method of Moments GMM - Generalized Mehod of Momens Conens GMM esimaion, shor inroducion 2 GMM inuiion: Maching momens 2 3 General overview of GMM esimaion. 3 3. Weighing marix...........................................

More information

EE650R: Reliability Physics of Nanoelectronic Devices Lecture 9:

EE650R: Reliability Physics of Nanoelectronic Devices Lecture 9: EE65R: Reliabiliy Physics of anoelecronic Devices Lecure 9: Feaures of Time-Dependen BTI Degradaion Dae: Sep. 9, 6 Classnoe Lufe Siddique Review Animesh Daa 9. Background/Review: BTI is observed when he

More information

Application of a Stochastic-Fuzzy Approach to Modeling Optimal Discrete Time Dynamical Systems by Using Large Scale Data Processing

Application of a Stochastic-Fuzzy Approach to Modeling Optimal Discrete Time Dynamical Systems by Using Large Scale Data Processing Applicaion of a Sochasic-Fuzzy Approach o Modeling Opimal Discree Time Dynamical Sysems by Using Large Scale Daa Processing AA WALASZE-BABISZEWSA Deparmen of Compuer Engineering Opole Universiy of Technology

More information

Article from. Predictive Analytics and Futurism. July 2016 Issue 13

Article from. Predictive Analytics and Futurism. July 2016 Issue 13 Aricle from Predicive Analyics and Fuurism July 6 Issue An Inroducion o Incremenal Learning By Qiang Wu and Dave Snell Machine learning provides useful ools for predicive analyics The ypical machine learning

More information

Reading from Young & Freedman: For this topic, read sections 25.4 & 25.5, the introduction to chapter 26 and sections 26.1 to 26.2 & 26.4.

Reading from Young & Freedman: For this topic, read sections 25.4 & 25.5, the introduction to chapter 26 and sections 26.1 to 26.2 & 26.4. PHY1 Elecriciy Topic 7 (Lecures 1 & 11) Elecric Circuis n his opic, we will cover: 1) Elecromoive Force (EMF) ) Series and parallel resisor combinaions 3) Kirchhoff s rules for circuis 4) Time dependence

More information

Maintenance Models. Prof. Robert C. Leachman IEOR 130, Methods of Manufacturing Improvement Spring, 2011

Maintenance Models. Prof. Robert C. Leachman IEOR 130, Methods of Manufacturing Improvement Spring, 2011 Mainenance Models Prof Rober C Leachman IEOR 3, Mehods of Manufacuring Improvemen Spring, Inroducion The mainenance of complex equipmen ofen accouns for a large porion of he coss associaed wih ha equipmen

More information

A Dynamic Model of Economic Fluctuations

A Dynamic Model of Economic Fluctuations CHAPTER 15 A Dynamic Model of Economic Flucuaions Modified for ECON 2204 by Bob Murphy 2016 Worh Publishers, all righs reserved IN THIS CHAPTER, OU WILL LEARN: how o incorporae dynamics ino he AD-AS model

More information

A Hop Constrained Min-Sum Arborescence with Outage Costs

A Hop Constrained Min-Sum Arborescence with Outage Costs A Hop Consrained Min-Sum Arborescence wih Ouage Coss Rakesh Kawara Minnesoa Sae Universiy, Mankao, MN 56001 Email: Kawara@mnsu.edu Absrac The hop consrained min-sum arborescence wih ouage coss problem

More information

Two Popular Bayesian Estimators: Particle and Kalman Filters. McGill COMP 765 Sept 14 th, 2017

Two Popular Bayesian Estimators: Particle and Kalman Filters. McGill COMP 765 Sept 14 th, 2017 Two Popular Bayesian Esimaors: Paricle and Kalman Filers McGill COMP 765 Sep 14 h, 2017 1 1 1, dx x Bel x u x P x z P Recall: Bayes Filers,,,,,,, 1 1 1 1 u z u x P u z u x z P Bayes z = observaion u =

More information

1904 IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, VOL. 58, NO. 4, MAY 2009

1904 IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, VOL. 58, NO. 4, MAY 2009 1904 IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, VOL. 58, NO. 4, MAY 2009 Learning o Compee for Resources in Wireless Sochasic Games Fangwen Fu, Suden Member, IEEE, and Mihaela van der Schaar, Senior Member,

More information

Appendix to Creating Work Breaks From Available Idleness

Appendix to Creating Work Breaks From Available Idleness Appendix o Creaing Work Breaks From Available Idleness Xu Sun and Ward Whi Deparmen of Indusrial Engineering and Operaions Research, Columbia Universiy, New York, NY, 127; {xs2235,ww24}@columbia.edu Sepember

More information

The Arcsine Distribution

The Arcsine Distribution The Arcsine Disribuion Chris H. Rycrof Ocober 6, 006 A common heme of he class has been ha he saisics of single walker are ofen very differen from hose of an ensemble of walkers. On he firs homework, we

More information

Designing Information Devices and Systems I Spring 2019 Lecture Notes Note 17

Designing Information Devices and Systems I Spring 2019 Lecture Notes Note 17 EES 16A Designing Informaion Devices and Sysems I Spring 019 Lecure Noes Noe 17 17.1 apaciive ouchscreen In he las noe, we saw ha a capacior consiss of wo pieces on conducive maerial separaed by a nonconducive

More information

Bias-Variance Error Bounds for Temporal Difference Updates

Bias-Variance Error Bounds for Temporal Difference Updates Bias-Variance Bounds for Temporal Difference Updaes Michael Kearns AT&T Labs mkearns@research.a.com Sainder Singh AT&T Labs baveja@research.a.com Absrac We give he firs rigorous upper bounds on he error

More information

2.7. Some common engineering functions. Introduction. Prerequisites. Learning Outcomes

2.7. Some common engineering functions. Introduction. Prerequisites. Learning Outcomes Some common engineering funcions 2.7 Inroducion This secion provides a caalogue of some common funcions ofen used in Science and Engineering. These include polynomials, raional funcions, he modulus funcion

More information

On-line Adaptive Optimal Timing Control of Switched Systems

On-line Adaptive Optimal Timing Control of Switched Systems On-line Adapive Opimal Timing Conrol of Swiched Sysems X.C. Ding, Y. Wardi and M. Egersed Absrac In his paper we consider he problem of opimizing over he swiching imes for a muli-modal dynamic sysem when

More information

Cash Flow Valuation Mode Lin Discrete Time

Cash Flow Valuation Mode Lin Discrete Time IOSR Journal of Mahemaics (IOSR-JM) e-issn: 2278-5728,p-ISSN: 2319-765X, 6, Issue 6 (May. - Jun. 2013), PP 35-41 Cash Flow Valuaion Mode Lin Discree Time Olayiwola. M. A. and Oni, N. O. Deparmen of Mahemaics

More information

Delivering Better Time-of-Day Using Synchronous Ethernet and Yaakov (J) Stein, Alon Geva, Gabriel Zigelboim RAD Data Communications

Delivering Better Time-of-Day Using Synchronous Ethernet and Yaakov (J) Stein, Alon Geva, Gabriel Zigelboim RAD Data Communications Delivering Beer Time-of-Day Using Synchronous Eherne and 1588 Yaakov (J) Sein, Alon Geva, Gabriel Zigelboim RAD Daa Communicaions The problem I wan discuss he use of 1588 (or NTP for ha maer) for ime of

More information

Explaining Total Factor Productivity. Ulrich Kohli University of Geneva December 2015

Explaining Total Factor Productivity. Ulrich Kohli University of Geneva December 2015 Explaining Toal Facor Produciviy Ulrich Kohli Universiy of Geneva December 2015 Needed: A Theory of Toal Facor Produciviy Edward C. Presco (1998) 2 1. Inroducion Toal Facor Produciviy (TFP) has become

More information

Seminar 4: Hotelling 2

Seminar 4: Hotelling 2 Seminar 4: Hoelling 2 November 3, 211 1 Exercise Par 1 Iso-elasic demand A non renewable resource of a known sock S can be exraced a zero cos. Demand for he resource is of he form: D(p ) = p ε ε > A a

More information

Matrix Versions of Some Refinements of the Arithmetic-Geometric Mean Inequality

Matrix Versions of Some Refinements of the Arithmetic-Geometric Mean Inequality Marix Versions of Some Refinemens of he Arihmeic-Geomeric Mean Inequaliy Bao Qi Feng and Andrew Tonge Absrac. We esablish marix versions of refinemens due o Alzer ], Carwrigh and Field 4], and Mercer 5]

More information

Unemployment and Mismatch in the UK

Unemployment and Mismatch in the UK Unemploymen and Mismach in he UK Jennifer C. Smih Universiy of Warwick, UK CAGE (Cenre for Compeiive Advanage in he Global Economy) BoE/LSE Conference on Macroeconomics and Moneary Policy: Unemploymen,

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION SUPPLEMENTARY INFORMATION DOI: 0.038/NCLIMATE893 Temporal resoluion and DICE * Supplemenal Informaion Alex L. Maren and Sephen C. Newbold Naional Cener for Environmenal Economics, US Environmenal Proecion

More information

Cooperative Ph.D. Program in School of Economic Sciences and Finance QUALIFYING EXAMINATION IN MACROECONOMICS. August 8, :45 a.m. to 1:00 p.m.

Cooperative Ph.D. Program in School of Economic Sciences and Finance QUALIFYING EXAMINATION IN MACROECONOMICS. August 8, :45 a.m. to 1:00 p.m. Cooperaive Ph.D. Program in School of Economic Sciences and Finance QUALIFYING EXAMINATION IN MACROECONOMICS Augus 8, 213 8:45 a.m. o 1: p.m. THERE ARE FIVE QUESTIONS ANSWER ANY FOUR OUT OF FIVE PROBLEMS.

More information

Time series Decomposition method

Time series Decomposition method Time series Decomposiion mehod A ime series is described using a mulifacor model such as = f (rend, cyclical, seasonal, error) = f (T, C, S, e) Long- Iner-mediaed Seasonal Irregular erm erm effec, effec,

More information

INDEX. Transient analysis 1 Initial Conditions 1

INDEX. Transient analysis 1 Initial Conditions 1 INDEX Secion Page Transien analysis 1 Iniial Condiions 1 Please inform me of your opinion of he relaive emphasis of he review maerial by simply making commens on his page and sending i o me a: Frank Mera

More information

MANAGEMENT SCIENCE doi /mnsc ec pp. ec1 ec20

MANAGEMENT SCIENCE doi /mnsc ec pp. ec1 ec20 MANAGEMENT SCIENCE doi.287/mnsc.7.82ec pp. ec ec2 e-companion ONLY AVAILABLE IN ELECTRONIC FORM informs 28 INFORMS Elecronic Companion Saffing of Time-Varying Queues o Achieve Time-Sable Performance by

More information

Exponential Weighted Moving Average (EWMA) Chart Under The Assumption of Moderateness And Its 3 Control Limits

Exponential Weighted Moving Average (EWMA) Chart Under The Assumption of Moderateness And Its 3 Control Limits DOI: 0.545/mjis.07.5009 Exponenial Weighed Moving Average (EWMA) Char Under The Assumpion of Moderaeness And Is 3 Conrol Limis KALPESH S TAILOR Assisan Professor, Deparmen of Saisics, M. K. Bhavnagar Universiy,

More information

Some Basic Information about M-S-D Systems

Some Basic Information about M-S-D Systems Some Basic Informaion abou M-S-D Sysems 1 Inroducion We wan o give some summary of he facs concerning unforced (homogeneous) and forced (non-homogeneous) models for linear oscillaors governed by second-order,

More information

Unit Root Time Series. Univariate random walk

Unit Root Time Series. Univariate random walk Uni Roo ime Series Univariae random walk Consider he regression y y where ~ iid N 0, he leas squares esimae of is: ˆ yy y y yy Now wha if = If y y hen le y 0 =0 so ha y j j If ~ iid N 0, hen y ~ N 0, he

More information

1. An introduction to dynamic optimization -- Optimal Control and Dynamic Programming AGEC

1. An introduction to dynamic optimization -- Optimal Control and Dynamic Programming AGEC This documen was generaed a :37 PM, 1/11/018 Copyrigh 018 Richard T. Woodward 1. An inroducion o dynamic opimiaion -- Opimal Conrol and Dynamic Programming AGEC 64-018 I. Overview of opimiaion Opimiaion

More information

control properties under both Gaussian and burst noise conditions. In the ~isappointing in comparison with convolutional code systems designed

control properties under both Gaussian and burst noise conditions. In the ~isappointing in comparison with convolutional code systems designed 535 SOFT-DECSON THRESHOLD DECODNG OF CONVOLUTONAL CODES R.M.F. Goodman*, B.Sc., Ph.D. W.H. Ng*, M.S.E.E. Sunnnary Exising majoriy-decision hreshold decoders have so far been limied o his paper a new mehod

More information

13.3 Term structure models

13.3 Term structure models 13.3 Term srucure models 13.3.1 Expecaions hypohesis model - Simples "model" a) shor rae b) expecaions o ge oher prices Resul: y () = 1 h +1 δ = φ( δ)+ε +1 f () = E (y +1) (1) =δ + φ( δ) f (3) = E (y +)

More information

Echocardiography Project and Finite Fourier Series

Echocardiography Project and Finite Fourier Series Echocardiography Projec and Finie Fourier Series 1 U M An echocardiagram is a plo of how a porion of he hear moves as he funcion of ime over he one or more hearbea cycles If he hearbea repeas iself every

More information

Stable Scheduling Policies for Maximizing Throughput in Generalized Constrained Queueing Systems

Stable Scheduling Policies for Maximizing Throughput in Generalized Constrained Queueing Systems 1 Sable Scheduling Policies for Maximizing Throughpu in Generalized Consrained Queueing Sysems Prasanna Chaporar, Suden Member, IEEE, Saswai Sarar, Member, IEEE Absrac We consider a class of queueing newors

More information

Econ107 Applied Econometrics Topic 7: Multicollinearity (Studenmund, Chapter 8)

Econ107 Applied Econometrics Topic 7: Multicollinearity (Studenmund, Chapter 8) I. Definiions and Problems A. Perfec Mulicollineariy Econ7 Applied Economerics Topic 7: Mulicollineariy (Sudenmund, Chaper 8) Definiion: Perfec mulicollineariy exiss in a following K-variable regression

More information

EECS 141: FALL 00 MIDTERM 2

EECS 141: FALL 00 MIDTERM 2 Universiy of California College of Engineering Deparmen of Elecrical Engineering and Compuer Science J. M. Rabaey TuTh9:30-11am ee141@eecs EECS 141: FALL 00 MIDTERM 2 For all problems, you can assume he

More information