Foresighted Resource Reciprocation Strategies in P2P Networks

Size: px
Start display at page:

Download "Foresighted Resource Reciprocation Strategies in P2P Networks"

Transcription

1 Foreghted Reource Recprocaton Stratege n PP Networ Hyunggon Par and Mhaela van der Schaar Electrcal Engneerng Department Unverty of Calforna Lo Angele (UCLA) Emal: {hgpar mhaela@ee.ucla.edu Abtract We conder peer-to-peer (PP) networ where multple peer are ntereted n harng content. Whle harng reource autonomou and elf-ntereted peer need to mae decon on the amount of ther reource recprocaton (.e. repreentng ther acton) uch that ther ndvdual reward are maxmzed. We model the reource recprocaton among the peer a a tochatc game and how how the peer can determne ther optmal tratege for the acton ung a Marov Decon Proce (MDP) framewor. The optmal tratege determned baed on MDP enable the peer to mae foreghted decon about reource recprocaton uch that they can explctly conder both ther mmedate a well a future expected reward. To uccefully formulate the MDP framewor we propoe a novel algorthm that effcently dentfe the tate tranton probablte ung repreentatve reource recprocaton model of peer. Smulaton reult how that the propoed approach baed on the recprocaton model can effectvely cope wth a dynamcally changng envronment of PP networ. Moreover we how that the foreghted decon lead to the bet performance n term of the cumulatve expected reward. I. INTRODUCTION PP applcaton (e.g [1] []) have become ncreangly popular and PP networ provde a cot effectve and ealy deployable framewor for demnatng large fle wthout relyng on a centralzed nfratructure [3]. Hence t ha been recently propoed to ue PP networ for general fle harng [] [4] or multmeda treamng [3] [5] [6]. In th paper we conder data-drven PP ytem uch a CoolStreamng [5] Chanaw [6] or BtTorrent ytem [4] whch adopt pull-baed technque [5] [6]. Whle th approach ha been uccefully deployed n real-tme multmeda treamng and fle dtrbuton over PP networ ey challenge uch a reource recprocaton among autonomou and elf-ntereted peer tll reman open. The reource recprocaton trategy deployed n BtTorrent (.e. Tt-for-Tat (TFT) trategy) dtrbute the bandwdth equally: a peer equally dvde t avalable upload bandwdth among multple leecher [4]. The reource recprocaton n [5] baed on a heurtc chedulng algorthm whch enable the peer to determne the uppler of requred chun and elect the peer wth the hghet bandwdth. Alternatvely the reource recprocaton can be baed on the random chun electon [6]. Snce the oluton n [5] and [6] are mplemented aumng that the aocated peer are altrutc the reource recprocaton tratege baed on thee oluton do not conder the nteracton of the autonomou elfntereted and heterogeneou peer. We model the reource recprocaton among the ntereted peer a a tochatc game where peer decde ther reource dtrbuton by explctly conderng the probabltc behavor (recprocaton) of ther aocated peer. Unle extng reource recprocaton tratege whch focu on myopc decon we formalze the reource recprocaton game a a Marov Decon Proce (MDP) [7] to enable peer to mae foreghted decon on ther reource dtrbuton n a way that maxmze ther cumulatve reward.e. ther mmedate a well a future reward. To uccefully formulate the reource recprocaton game a an MDP problem the peer need to dentfy the aocated peer probabltc behavor for reource recprocaton. The probabltc behavor of the aocated peer can be etmated ung the pat htory of reource recprocaton and are repreented by tate tranton probablte n the MDP framewor. In th paper the tate of a peer defned a the et of receved reource from each of the aocated peer. We propoe a novel algorthm that can effcently dentfy the tate tranton probablte ung peer recprocaton model. The recprocaton model of the peer are motvated by [8] whch clafy the atttude of player n a game toward ther tratege a optmtc pemtc and neutral archetype. We how that th approach for reource recprocaton baed on the recprocaton model provde a fater convergence. Hence th approach enable peer to effcently capture the change of the tate tranton probablte reultng n an effcent oluton for PP networ. Th paper organzed a follow. In Secton II we model the reource recprocaton among peer a a reource recprocaton game. In Secton III the type of peer n the condered PP networ are dcued. In Secton IV an algorthm that determne the tate tranton probablte baed on the recprocaton model propoed. Smulaton reult are provded n Secton V and concluon are drawn n Secton VI. II. A NEW FRAMEWORK FOR RESOURCE RECIPROCATION In th ecton we model the reource recprocaton among the peer a a reource recprocaton game. Reource recprocaton game n the PP networ are played by the peer that are ntereted n each other. A reource recprocaton game played n a group where a group cont of a peer and t aocated peer whch mlar to concept uch a the warm n [4] partnerhp n [5] or neghbor n [6]. We Th full text paper wa peer revewed at the drecton of IEEE Communcaton Socety ubject matter expert for publcaton n the IEEE "GLOBECOM" 8 proceedng.

2 3 Peer x 1 1 a 1 3) State Tranton Probablty P a ( ): A tate tranton probablty repreent the probablty that by tang an acton a peer wll trant nto a new tate. Gven a tate S at tme t an acton a A of peer can lead to another tate S at t +1 wth probablty P a ( )=Pr( a ). Hence for a tate =( 1... NC ) of peer n C the probablty that an acton a lead the tate tranton from to can be expreed a C 1 5 Fg An llutratve example for C wth 5 matched peer. denote the aocated peer n a group of a peer by C and an llutratve example of C depcted n Fg. 1. A hown n Fg. 1 C doe not nclude peer. The peer n C are ndexed by 1...N C.e. C = {1...N C. Note that peer C alo ha t own group C whch nclude peer. The reource recprocaton game n C a tochatc game [7]. To play the reource recprocaton game a peer can deploy an MDP. An MDP of a peer a tuple < S A P R > where S the tate pace A the acton pace P : S A S [ 1] a tate tranton probablty functon and R : S R a reward functon. The detal are explaned a follow. 1) State Space S : A tate of peer repreent the et of receved reource from the peer n C and the tate pace of peer can be expreed a S = { = ψ (x ) Δ = { Δ 1...Δ n C where = ( 1... NC ) and x denote the provded reource (.e. rate) by peer C whch lmted by t maxmum upload bandwdth L. determned by a peer dependent functon ψ : R Δ = {Δ 1...Δn that map x nto one of n dcrete value 1 whch are called tate decrpton n th paper. ) Acton Space A : An acton of peer t reource allocaton to the peer n C. Hence the acton pace of peer n C can be expreed a A = {a a L 1 N C a L C where a =(a 1...a NC ) A = A A and a A denote the allocated reource to peer by peer n C. Hence peer acton a to peer become peer receved reource from peer.e. a = x. We aume that the acton are the number of allocated unt of bandwdth [9] to the aocated peer n ther group. We defne the reource recprocaton (â ŝ ) = ( (â 1...â NC ) (ŝ 1...ŝ NC ) ) a a par comprng peer acton â and the correpondng modeled reource recprocaton ŝ whch determne a ŝ = ψ (ˆx ) for all C. 1 A contnuou value of x can be dcretzed by peer baed on t quantzaton polcy a the bandwdth of each peer can be decompoed nto everal unt of bandwdth by the clent oftware e.g. [9]. P a ( )= N C P a l=1 l ( l l). (1) 4) Reward R : Snce the peer prefer hgher download rate we conder that reward R ( ) for a peer n tate the total receved reource n C expreed a R ( )=R ( 1... NC )= C r ( ) () where r ( ) repreent the receved reource n. 5) Recprocaton Polcy π : TheolutontotheMDP repreented by peer optmal polcy π whch a mappng from the tate to optmal acton. The optmal polcy can be obtaned ung well-nown method uch a value teraton and polcy teraton [7]. Hence peer can decde t acton baed on the optmal polcy π.e. π ( )=a for all S. Note that our focu on the reource recprocaton game n a group a the reource recprocaton game n a PP networ can be condered a multple reource recprocaton game n group. We wll dcu how the optmal polce can be determned for dfferent type of peer n the next ecton. III. PEER TYPES IN PP NETWORKS In th paper we categorze the peer baed on ther adopted utlte and ther reource recprocaton atttude. A. Peer type dependng on ther adopted utlte Peer n the PP networ can be condered a myopc or foreghted baed on how they compute ther reward. A peer n tate (t) myopc f t focue on maxmzng t mmedate expected reward R myo ) defned a R myo ) P (t+1) a S (t+1) )R ( (t+1) ). (3) Hence the myopc peer determne t acton a uch that maxmze t adopted expected reward Rmyo ).e. a a = arg max a A R myo ) ubject to. C a L Unle the myopc peer the foreghted peer tae ther acton uch that the acton maxmze a cumulatve dcounted expected reward [7]. Hence the objectve of a foreghted peer n tate (t) at tme t = t c gven a dcount factor γ can be expreed a R fore where E ) ] = [ R ) t=t γ(t (tc+1)) c+1 P (t+1) a (t+1) S [ ] E R ) (4) (t+1) )R ( (t+1) ). Hence the foreghted peer can determne a et of acton Th full text paper wa peer revewed at the drecton of IEEE Communcaton Socety ubject matter expert for publcaton n the IEEE "GLOBECOM" 8 proceedng.

3 ψ Fg.. L ( ) ˆ ψ () Optmtc Neutral X aˆ A Reource Recprocaton ( aˆ ˆ ) Pemtc Illutraton of reource recprocaton baed on peer atttude. that maxmze R fore ) for every tate n S whch lead to an optmal polcy π. The optmal polcy π thu map each tate S nto a correpondng optmal acton a.e. π ( )=a for all S. From (3) and (4) we can oberve that the myopc decon are a pecal cae of the foreghted decon when γ =. Note that the dcount factor γ n the condered PP networ can alternatvely repreent the belef of the peer about the valdty of the expected future reward nce the tate tranton probablty can be affected by ytem dynamc uch a the other peer jonng wtchng or leavng group. Hence for example f the PP networ n a tranent regme a mall dcount factor derable. However a large dcount factor can be ued f the PP networ n tatonary regme [1]. Thu we aume that the value of the dcount factor can be determned by the peer ung nformaton baed on ther pat experence reputaton of ther aocated peer [11] [1] etc. B. Peer type baed on ther atttude Peer n the condered PP networ can alo be characterzed baed on ther atttude toward the reource recprocaton whch are pemtc neutral or optmtc [8]. Let (â ŝ ) be a reource recprocaton between peer and peer. A peer neutral f t preume that the reource recprocaton change lnearly dependng on t acton. A peer pemtc f t preume that the reource recprocaton decreae fat for a â and ncreae low for a â. On the other hand an optmtc peer preume that the reource recprocaton decreae low for a â and ncreae fat for a â. Illutratve example of reource recprocaton hape that correpond to peer dfferent atttude are hown n Fg.. In th paper we conder thee reource recprocaton profle whch wll be referred to a recprocaton model. Thee type of peer dcued above obvouly affect ther reource recprocaton tratege. In the followng ecton we dcu how the peer atttude can mpact the way n whch peer model the other peer reource recprocaton behavor. L IV. DETERMINING THE STATE TRANSITION PROBABILITIES A. Emprcal Frequency baed State Tranton Probablte A peer can dentfy t tate tranton probablte baed on the frequency of the recprocaton. For th we conder a table T that tore the htory of reource recprocaton for peer gven acton of peer. An element T (Δl1 Δl a ) of the table T denote the number of tate tranton from =Δ l1 to =Δl gven an acton a. Hence the tate tranton probablty P a ( =Δ l1 =Δl ) baed on the emprcal frequency can be expreed a: P a ( =Δ l1 =Δ l )= T (Δl1 Δl a ) n h=1 T (Δl1 Δh a ). A dadvantage of th approach that t may requre a conderable amount of obervaton of the reource recprocaton over tme to accurately dentfy the tate tranton probablte. To reduce the number of requred obervaton we propoe an alternatve algorthm that baed on the reource recprocaton model. B. Recprocaton Model baed State Tranton Probablte A et of the tate tranton probablty that correpond to the reource recprocaton model called recprocaton matrx. The et of m avalable recprocaton matrce of peer n for peer denoted by M ( ) = {M1 ( )...Mm ( ) where Ml ( ) a A n matrx wth t element Ml ( )[a ]=P a ( ).A recprocaton matrx Ml ( ) for a pemtc peer tang acton a n hown n Fg. can be expreed by Ml( )[a ] (5) 1 f a < â = ψ () or a > â = = 1/W P f a =â ψ () otherwe where W P = {l ψ () l the number of tate decrpton between and ψ () and a denote the avalable acton that can be taen. Smlarly a recprocaton matrx of an optmtc peer n for peer hown n Fg. can be repreented by Ml( )[a ] (6) 1 f a < â = or a > â = ψ (L ) = 1/W O f a =â ψ (L ) otherwe where W O = {l l ψ (L ) the number of tate decrpton between and ψ (L ). A neutral peer preume that an acton a â wll lead to lnear change n reource recprocaton. Hence the recprocaton matrx of a neutral peer can be expreed a { Ml( )[a 1 f ] = = ψ (x = α a ) (7) otherwe where α = /â denote the lope determned baed on the current reource recprocaton. In the followng ubecton Th full text paper wa peer revewed at the drecton of IEEE Communcaton Socety ubject matter expert for publcaton n the IEEE "GLOBECOM" 8 proceedng.

4 l l l m 1 ) M M ( ( ) Recprocaton Matrce w 1 ( ) State ( ) m w Tranton m Probablty ) M w ( ( ) l l = 1 Matrx: l ye M ( ) a? no = h a h 1 m H ( ( ( ( ) ) )) = m h ) ) w ( ( h ( ) l l l = l 1 ye h ( ) h ( ) + 1 no Accuracy < Threhold? Weght Update Fg. 3. Bloc dagram for updatng the weght of the recprocaton matrce. D Jl (average) D Jl (average) (a) wthout recprocaton model wth recprocaton model Number of Obervaton (b) wthout recprocaton model 1.5 wth recprocaton model Duraton we propoe an algorthm that ue the dcued recprocaton matrce to effcently dentfy the tate tranton probablty. C. Buldng State Tranton Probablte baed on Recprocaton Matrce We aume that each peer ha a pre-determned ntal acton a I =(a I 1...aI N C ) A that ued for ntalzng the recprocaton matrce.e. a peer ha a pre-determned acton a I A for peer and the reultng. Baed on the ntal reource recprocaton (a I ) between peer and peer the recprocaton matrce of peer can be ntalzed baed on (5) (6) and (7). Let M ( ) be the et of m recprocaton matrce that are ntalzed baed on (a I ). The weght of peer for the recprocaton matrce are denoted by w ( ) = (w1 ( )...wm ( )) for peer C. We defne H ( ) = (h 1 ( )...h m ( )) a the et of number of ht where the reource recprocaton between peer and peer are matched to non-zero element n M ( ). Specfcally f (a ) matched up to a non-zero element n Ml ( )[a ] then h l ( ) h l ( )+1. Baed on H ( ) the weght of recprocaton matrce for peer can be computed a w l( )= h l ( ) m l =1 h l ( ). Th update proce depcted n Fg. 3. Fnally baed on the dentfed weght the probablty P a ( ) can be obtaned by { m P a ( ) = wl( )Ml( ) [a ]. (8) l=1 V. SIMULATION RESULTS A. Comparon of Varou Approache for Identfyng the State Tranton Probablte To llutrate the tradeoff between the effcency and the accuracy we deploy the two approache dcued n Secton IV to dentfy the true tate tranton probablty. Fg. 4. (Averaged) D Jl for etmated tate tranton probablty matrce wth/wthout recprocaton model. In the mulaton we aume that peer and peer are n a group and recprocate ther reource. To tudy the mpact of the two approache on the dcounted expected reward we conder dcounted expected reward J = [ J ( 1 )...J ( n )] T and J = [ J ( 1 )...J ( n of peer from peer obtaned )] T baed on true and etmated tate tranton probablte P ad P and the correpondng optmal polcy π and π repectvely. We aume that P tatonary. Note that J can be computed by J = P r + γ P J [7] or equvalently J =[I γ P ] 1 P r =[P + γ P + γ P 3 + ]r where r = [r( 1 ) r(n )]T. Smlarly J can alo be computed by J = [I γ P ] 1 P r. Wthout lo of generalty we conder a dcounted expected reward from l.e. J ( l ) and J ( l ). To quantfy them we ue a metrc D Jl defned by D Jl = J ( l ) J ( l ). The reult are hown n Fg. 4. In Fg. 4 (a) nce the true tate tranton probablty matrx tatonary f there are enough obervaton of reource recprocaton the true tate tranton probablty matrx can be well-dentfed by the emprcal frequency baed approach. However t may requre the obervaton of reource recprocaton among peer to obtan accurate tate tranton probablte. In contrat the approach baed on the recprocaton model can dentfy the tate tranton probablte baed on fewer obervaton. However unle the emprcal frequency baed approach the mprovement ganed for more obervaton dmnhe rapdly (before reachng the bet performance of the emprcal frequency baed approach) a the etmaton rele on pre-determned recprocaton model. A hown n Fg. 4 (a) the approach baed on the recprocaton model provde a fater convergence whch become mportant when the tate tranton probablte vary over tme. Illutratve mulaton reult are hown n Fg. 4 (b). To emulate a dynamc envronment randomly generated dfferent tate tranton probablty matrce of a peer are deployed ev- Th full text paper wa peer revewed at the drecton of IEEE Communcaton Socety ubject matter expert for publcaton n the IEEE "GLOBECOM" 8 proceedng.

5 7 6 Immedate Expected Reward (a) 5 foreghted decon 4 myopc decon TFT decon State Index Cumulatve Dcounted Reward (γ =.8) (b) State Index Fg. 5. Immedate (a) and cumulatve expected dcounted reward (b) acheved by dfferent polce. ery 1 reource recprocaton. A expected the approach baed on the recprocaton model provde a fater convergence thereby enablng peer to effcently capture the change of the tate tranton probablty. Therefore the propoed approach can cope wth a dynamc envronment thereby mang t more utable than the emprcal frequency baed approach for PP networ. B. Impact of Myopc and Foreghted Polce on Reward The mpact of the myopc and foreghted polce for ther acton on acheved cumulatve expected reward are quantfed. To focu on the mpact of myopc and foreghted polce we aume that the aocated peer behavor are randomly generated and tatonary. The oluton to the MDP mplemented baed on a well-nown polcy teraton method [7]. In addton the TFT trategy mplemented n BtTorrent-le ytem compared whch upport two leecher multaneouly a an llutraton. The mulaton reult are hown n Fg. 5. Fg. 5 how the mmedate and cumulatve expected dcounted reward wth γ =.8 obtaned baed on the myopc foreghted and TFT polce. State ndex repreent avalable tate n the tate pace. To focu on the comparon among the condered polce we aume that the reward that can be obtaned n each tate are normalzed a the tate ndex. We can oberve that the obtaned reward baed on the TFT polcy are the wort nce the acton determned by the TFT polcy do not conder the expected reward. Moreover the contrant of fxed concurrent allowable upload to the leecher can prevent the decon proce from electng better acton. A dcued prevouly the myopc decon are made baed on (3) whch maxmze the mmedate expected reward. Hence we can verfy that the mmedate expected reward obtaned by the acton of myopc polcy are alway hgher (or equal) than the other polce n Fg. 5 (a). However the foreghted decon are made baed on (4) uch that they maxmze the cumulatve dcounted expected reward a hown n Fg. 5 (b). Therefore the foreghted polcy enable the peer to determne ther decon that lead to the hghet cumulatve expected dcounted reward. VI. CONCLUSION In th paper the reource recprocaton among the peer modeled a a recprocaton game and the game formulated baed on the MDP framewor. Hence peer can determne ther reource recprocaton uch that they can maxmze ther cumulatve expected reward. To uccefully formulate the MDP framewor n PP networ we propoe the recprocaton model baed approach that enable peer to effcently dentfy the tate tranton probablty matrx. We tudy the tradeoff between effcency and accuracy when dfferent number of recprocaton model are deployed. In the mulaton we how that the propoed recprocaton baed approach more utable for PP networ. We alo how that the propoed foreghted decon lead to the bet performance n term of the cumulatve expected reward. Therefore f the propoed approach deployed n extng reource recprocaton tratege uch a TFT n BtTorrent t enable peer to mae foreghted decon on ther reource allocaton leadng to performance mprovement. REFERENCES [1] Napter. [Onlne]. Avalable: [] Gnutella. [Onlne]. Avalable: [3] J. Lu S. G. Rao B. L and H. Zhang Opportunte and challenge of peer-to-peer nternet vdeo broadcat Proc. of the IEEE Specal Iue on Recent Advance n Dtrbuted Multmeda Commun. 7. [4] A. Legout N. Loga E. Kohler and L. Zhang Cluterng and harng ncentve n BtTorrent ytem INRIA-1166 Tech. Rep. 1-1 Nov. 6. [5] X. Zhang J. Lu B. L and T. S. P. Yum CoolStreamng/DONet: A data-drven overlay networ for effcent lve meda treamng n Proc. INFOCOM 5 5. [6] V. Pa K. Kumar K. Tamlman V. Sambamurthy and A. E. Mohr Chanaw: Elmnatng tree from overlay multcat n Proc. 4th Int. Worhop on Peer-to-Peer Sytem (IPTPS) Feb. 5. [7] D. P. Bertea Dynamc Programmng and Stochatc Control. Academc P [8] E. Haruvy D. O. Stahl and P. W. Wlon Evdence for optmtc and pemtc behavor n normal-form game Economc Lett. vol. 63 pp [9] K. Jan L. Lovz and P. A. Chou Buldng calable and robut peer-to-peer overlay networ for broadcatng ung networ codng Dtrbuted Computng vol. 19 no. 4 pp [1] G. de Vecana and X. Yang Farne ncentve and performance n peer-to-peer networ n 41th Annual Allerton Conference on Communcaton Control and Computng 3. [11] E. Daman D. C. d Vmercat S. Paraboch P. Samarat and F. Volante A reputaton-baed approach for choong relable reource n peer-to-peer networ n Proc. the 9th ACM Conf. on Comput. and Commun. Securty (CCS ). ACM Pre pp [1] M. Gupta P. Judge and M. Ammar A reputaton ytem for peer-topeer networ n Proc. the 13th Int. Worhop on Netw. and Operatng Sytem Support for Dgtal Audo and Vdeo (NOSSDAV 3). ACM Pre 3 pp Th full text paper wa peer revewed at the drecton of IEEE Communcaton Socety ubject matter expert for publcaton n the IEEE "GLOBECOM" 8 proceedng.

Additional File 1 - Detailed explanation of the expression level CPD

Additional File 1 - Detailed explanation of the expression level CPD Addtonal Fle - Detaled explanaton of the expreon level CPD A mentoned n the man text, the man CPD for the uterng model cont of two ndvdual factor: P( level gen P( level gen P ( level gen 2 (.).. CPD factor

More information

Risk-aware Cooperative Spectrum Access for Multi-Channel Cognitive Radio Networks

Risk-aware Cooperative Spectrum Access for Multi-Channel Cognitive Radio Networks 1 Rk-aware Cooperatve Spectrum Acce for Mult-Channel Cogntve Rado Network Nng Zhang, Student Member, IEEE, Nan Cheng, Student Member, IEEE, Nng Lu, Student Member, IEEE, Habo Zhou, Student Member, IEEE,

More information

Harmonic oscillator approximation

Harmonic oscillator approximation armonc ocllator approxmaton armonc ocllator approxmaton Euaton to be olved We are fndng a mnmum of the functon under the retrcton where W P, P,..., P, Q, Q,..., Q P, P,..., P, Q, Q,..., Q lnwgner functon

More information

Chapter 11. Supplemental Text Material. The method of steepest ascent can be derived as follows. Suppose that we have fit a firstorder

Chapter 11. Supplemental Text Material. The method of steepest ascent can be derived as follows. Suppose that we have fit a firstorder S-. The Method of Steepet cent Chapter. Supplemental Text Materal The method of teepet acent can be derved a follow. Suppoe that we have ft a frtorder model y = β + β x and we wh to ue th model to determne

More information

Performance Bounds for P2P Streaming System with Transcoding

Performance Bounds for P2P Streaming System with Transcoding Appl. Math. Inf. Sc. 7, No. 6, 2477-2484 2013 2477 Appled Mathematc & Informaton Scence An Internatonal Journal http://dx.do.org/10.12785/am/070641 Performance Bound for P2P Streamng Sytem wth Trancodng

More information

Estimation of Finite Population Total under PPS Sampling in Presence of Extra Auxiliary Information

Estimation of Finite Population Total under PPS Sampling in Presence of Extra Auxiliary Information Internatonal Journal of Stattc and Analy. ISSN 2248-9959 Volume 6, Number 1 (2016), pp. 9-16 Reearch Inda Publcaton http://www.rpublcaton.com Etmaton of Fnte Populaton Total under PPS Samplng n Preence

More information

Team. Outline. Statistics and Art: Sampling, Response Error, Mixed Models, Missing Data, and Inference

Team. Outline. Statistics and Art: Sampling, Response Error, Mixed Models, Missing Data, and Inference Team Stattc and Art: Samplng, Repone Error, Mxed Model, Mng Data, and nference Ed Stanek Unverty of Maachuett- Amhert, USA 9/5/8 9/5/8 Outlne. Example: Doe-repone Model n Toxcology. ow to Predct Realzed

More information

arxiv: v1 [cs.gt] 15 Jan 2019

arxiv: v1 [cs.gt] 15 Jan 2019 Model and algorthm for tme-content rk-aware Markov game Wenje Huang, Pham Vet Ha and Wllam B. Hakell January 16, 2019 arxv:1901.04882v1 [c.gt] 15 Jan 2019 Abtract In th paper, we propoe a model for non-cooperatve

More information

This appendix presents the derivations and proofs omitted from the main text.

This appendix presents the derivations and proofs omitted from the main text. Onlne Appendx A Appendx: Omtted Dervaton and Proof Th appendx preent the dervaton and proof omtted from the man text A Omtted dervaton n Secton Mot of the analy provded n the man text Here, we formally

More information

Scattering of two identical particles in the center-of. of-mass frame. (b)

Scattering of two identical particles in the center-of. of-mass frame. (b) Lecture # November 5 Scatterng of two dentcal partcle Relatvtc Quantum Mechanc: The Klen-Gordon equaton Interpretaton of the Klen-Gordon equaton The Drac equaton Drac repreentaton for the matrce α and

More information

Batch RL Via Least Squares Policy Iteration

Batch RL Via Least Squares Policy Iteration Batch RL Va Leat Square Polcy Iteraton Alan Fern * Baed n part on lde by Ronald Parr Overvew Motvaton LSPI Dervaton from LSTD Expermental reult Onlne veru Batch RL Onlne RL: ntegrate data collecton and

More information

Chapter 6 The Effect of the GPS Systematic Errors on Deformation Parameters

Chapter 6 The Effect of the GPS Systematic Errors on Deformation Parameters Chapter 6 The Effect of the GPS Sytematc Error on Deformaton Parameter 6.. General Beutler et al., (988) dd the frt comprehenve tudy on the GPS ytematc error. Baed on a geometrc approach and aumng a unform

More information

Resource Allocation with a Budget Constraint for Computing Independent Tasks in the Cloud

Resource Allocation with a Budget Constraint for Computing Independent Tasks in the Cloud Resource Allocaton wth a Budget Constrant for Computng Independent Tasks n the Cloud Wemng Sh and Bo Hong School of Electrcal and Computer Engneerng Georga Insttute of Technology, USA 2nd IEEE Internatonal

More information

Improvements on Waring s Problem

Improvements on Waring s Problem Improvement on Warng Problem L An-Png Bejng, PR Chna apl@nacom Abtract By a new recurve algorthm for the auxlary equaton, n th paper, we wll gve ome mprovement for Warng problem Keyword: Warng Problem,

More information

The Essential Dynamics Algorithm: Essential Results

The Essential Dynamics Algorithm: Essential Results @ MIT maachuett nttute of technology artfcal ntellgence laboratory The Eental Dynamc Algorthm: Eental Reult Martn C. Martn AI Memo 003-014 May 003 003 maachuett nttute of technology, cambrdge, ma 0139

More information

Start Point and Trajectory Analysis for the Minimal Time System Design Algorithm

Start Point and Trajectory Analysis for the Minimal Time System Design Algorithm Start Pont and Trajectory Analy for the Mnmal Tme Sytem Degn Algorthm ALEXANDER ZEMLIAK, PEDRO MIRANDA Department of Phyc and Mathematc Puebla Autonomou Unverty Av San Claudo /n, Puebla, 757 MEXICO Abtract:

More information

Deep Reinforcement Learning with Experience Replay Based on SARSA

Deep Reinforcement Learning with Experience Replay Based on SARSA Deep Renforcement Learnng wth Experence Replay Baed on SARSA Dongbn Zhao, Hatao Wang, Kun Shao and Yuanheng Zhu Key Laboratory of Management and Control for Complex Sytem Inttute of Automaton Chnee Academy

More information

ECE559VV Project Report

ECE559VV Project Report ECE559VV Project Report (Supplementary Notes Loc Xuan Bu I. MAX SUM-RATE SCHEDULING: THE UPLINK CASE We have seen (n the presentaton that, for downlnk (broadcast channels, the strategy maxmzng the sum-rate

More information

Solution Methods for Time-indexed MIP Models for Chemical Production Scheduling

Solution Methods for Time-indexed MIP Models for Chemical Production Scheduling Ian Davd Lockhart Bogle and Mchael Farweather (Edtor), Proceedng of the 22nd European Sympoum on Computer Aded Proce Engneerng, 17-2 June 212, London. 212 Elever B.V. All rght reerved. Soluton Method for

More information

m = 4 n = 9 W 1 N 1 x 1 R D 4 s x i

m = 4 n = 9 W 1 N 1 x 1 R D 4 s x i GREEDY WIRE-SIZING IS LINEAR TIME Chr C. N. Chu D. F. Wong cnchu@c.utexa.edu wong@c.utexa.edu Department of Computer Scence, Unverty of Texa at Autn, Autn, T 787. ABSTRACT In nterconnect optmzaton by wre-zng,

More information

Alpha Risk of Taguchi Method with L 18 Array for NTB Type QCH by Simulation

Alpha Risk of Taguchi Method with L 18 Array for NTB Type QCH by Simulation Proceedng of the World Congre on Engneerng 00 Vol II WCE 00, July -, 00, London, U.K. Alpha Rk of Taguch Method wth L Array for NTB Type QCH by Smulaton A. Al-Refae and M.H. L Abtract Taguch method a wdely

More information

Small signal analysis

Small signal analysis Small gnal analy. ntroducton Let u conder the crcut hown n Fg., where the nonlnear retor decrbed by the equaton g v havng graphcal repreentaton hown n Fg.. ( G (t G v(t v Fg. Fg. a D current ource wherea

More information

A Kernel Particle Filter Algorithm for Joint Tracking and Classification

A Kernel Particle Filter Algorithm for Joint Tracking and Classification A Kernel Partcle Flter Algorthm for Jont Tracng and Clafcaton Yunfe Guo Donglang Peng Inttute of Informaton and Control Automaton School Hangzhou Danz Unverty Hangzhou Chna gyf@hdueducn Huaje Chen Ane

More information

Method Of Fundamental Solutions For Modeling Electromagnetic Wave Scattering Problems

Method Of Fundamental Solutions For Modeling Electromagnetic Wave Scattering Problems Internatonal Workhop on MehFree Method 003 1 Method Of Fundamental Soluton For Modelng lectromagnetc Wave Scatterng Problem Der-Lang Young (1) and Jhh-We Ruan (1) Abtract: In th paper we attempt to contruct

More information

A METHOD TO REPRESENT THE SEMANTIC DESCRIPTION OF A WEB SERVICE BASED ON COMPLEXITY FUNCTIONS

A METHOD TO REPRESENT THE SEMANTIC DESCRIPTION OF A WEB SERVICE BASED ON COMPLEXITY FUNCTIONS UPB Sc Bull, Sere A, Vol 77, I, 5 ISSN 3-77 A METHOD TO REPRESENT THE SEMANTIC DESCRIPTION OF A WEB SERVICE BASED ON COMPLEXITY FUNCTIONS Andre-Hora MOGOS, Adna Magda FLOREA Semantc web ervce repreent

More information

A New Virtual Indexing Method for Measuring Host Connection Degrees

A New Virtual Indexing Method for Measuring Host Connection Degrees A New Vrtual Indexng Method for Meaurng ot Connecton Degree Pnghu Wang, Xaohong Guan,, Webo Gong 3, and Don Towley 4 SKLMS Lab and MOE KLINNS Lab, X an Jaotong Unverty, X an, Chna Department of Automaton

More information

728. Mechanical and electrical elements in reduction of vibrations

728. Mechanical and electrical elements in reduction of vibrations 78. Mechancal and electrcal element n reducton of vbraton Katarzyna BIAŁAS The Slean Unverty of Technology, Faculty of Mechancal Engneerng Inttute of Engneerng Procee Automaton and Integrated Manufacturng

More information

Specification -- Assumptions of the Simple Classical Linear Regression Model (CLRM) 1. Introduction

Specification -- Assumptions of the Simple Classical Linear Regression Model (CLRM) 1. Introduction ECONOMICS 35* -- NOTE ECON 35* -- NOTE Specfcaton -- Aumpton of the Smple Clacal Lnear Regreon Model (CLRM). Introducton CLRM tand for the Clacal Lnear Regreon Model. The CLRM alo known a the tandard lnear

More information

Confidence intervals for the difference and the ratio of Lognormal means with bounded parameters

Confidence intervals for the difference and the ratio of Lognormal means with bounded parameters Songklanakarn J. Sc. Technol. 37 () 3-40 Mar.-Apr. 05 http://www.jt.pu.ac.th Orgnal Artcle Confdence nterval for the dfference and the rato of Lognormal mean wth bounded parameter Sa-aat Nwtpong* Department

More information

The Second Anti-Mathima on Game Theory

The Second Anti-Mathima on Game Theory The Second Ant-Mathma on Game Theory Ath. Kehagas December 1 2006 1 Introducton In ths note we wll examne the noton of game equlbrum for three types of games 1. 2-player 2-acton zero-sum games 2. 2-player

More information

OPTIMAL COMPUTING BUDGET ALLOCATION FOR MULTI-OBJECTIVE SIMULATION MODELS. David Goldsman

OPTIMAL COMPUTING BUDGET ALLOCATION FOR MULTI-OBJECTIVE SIMULATION MODELS. David Goldsman Proceedng of the 004 Wnter Smulaton Conference R.G. Ingall, M. D. Roett, J. S. Smth, and B. A. Peter, ed. OPTIMAL COMPUTING BUDGET ALLOCATION FOR MULTI-OBJECTIVE SIMULATION MODELS Loo Hay Lee Ek Peng Chew

More information

APPLICATIONS OF RELIABILITY ANALYSIS TO POWER ELECTRONICS SYSTEMS

APPLICATIONS OF RELIABILITY ANALYSIS TO POWER ELECTRONICS SYSTEMS APPLICATIONS OF RELIABILITY ANALYSIS TO POWER ELECTRONICS SYSTEMS Chanan Sngh, Fellow IEEE Praad Enjet, Fellow IEEE Department o Electrcal Engneerng Texa A&M Unverty College Staton, Texa USA Joydeep Mtra,

More information

DEADLOCK INDEX ANALYSIS OF MULTI-LEVEL QUEUE SCHEDULING IN OPERATING SYSTEM USING DATA MODEL APPROACH

DEADLOCK INDEX ANALYSIS OF MULTI-LEVEL QUEUE SCHEDULING IN OPERATING SYSTEM USING DATA MODEL APPROACH GESJ: Computer Scence and Telecommuncaton 2 No.(29 ISSN 2-232 DEADLOCK INDEX ANALYSIS OF MULTI-LEVEL QUEUE SCHEDULING IN OPERATING SYSTEM USING DATA MODEL APPROACH D. Shukla, Shweta Ojha 2 Deptt. of Mathematc

More information

Variable Structure Control ~ Basics

Variable Structure Control ~ Basics Varable Structure Control ~ Bac Harry G. Kwatny Department of Mechancal Engneerng & Mechanc Drexel Unverty Outlne A prelmnary example VS ytem, ldng mode, reachng Bac of dcontnuou ytem Example: underea

More information

Hybrid State Feedback Controller Design of Networked Switched Control Systems with Packet Dropout

Hybrid State Feedback Controller Design of Networked Switched Control Systems with Packet Dropout Amercan Control Conference Marrott Waterfront, Baltmore, MD, USA June 3-July, WeB6.6 Hybrd State Feedbac Controller Degn of etwored Swtched Control Sytem wth Pacet Dropout Dan Ma, Georg M. Dmrov and Jun

More information

The Price of Anarchy in a Network Pricing Game

The Price of Anarchy in a Network Pricing Game The Prce of Anarchy n a Network Prcng Game John Muaccho and Shuang Wu Abtract We analyze a game theoretc model of competng network ervce provder that trategcally prce ther ervce n the preence of elatc

More information

A Novel Approach for Testing Stability of 1-D Recursive Digital Filters Based on Lagrange Multipliers

A Novel Approach for Testing Stability of 1-D Recursive Digital Filters Based on Lagrange Multipliers Amercan Journal of Appled Scence 5 (5: 49-495, 8 ISSN 546-939 8 Scence Publcaton A Novel Approach for Tetng Stablty of -D Recurve Dgtal Flter Baed on Lagrange ultpler KRSanth, NGangatharan and Ponnavakko

More information

Synchronization Protocols. Task Allocation Bin-Packing Heuristics: First-Fit Subtasks assigned in arbitrary order To allocate a new subtask T i,j

Synchronization Protocols. Task Allocation Bin-Packing Heuristics: First-Fit Subtasks assigned in arbitrary order To allocate a new subtask T i,j End-to-End Schedulng Framework 1. Tak allocaton: bnd tak to proceor 2. Synchronzaton protocol: enforce precedence contrant 3. Subdeadlne agnment 4. Schedulablty analy Tak Allocaton Bn-Packng eurtc: Frt-Ft

More information

Seismic Reliability Analysis and Topology Optimization of Lifeline Networks

Seismic Reliability Analysis and Topology Optimization of Lifeline Networks The 4 th World Conference on Earthquake Engneerng October 2-7, 2008, Beng, Chna Semc Relablty Analy and Topology Optmzaton of Lfelne Network ABSTRACT: Je L and We Lu 2 Profeor, Dept. of Buldng Engneerng,

More information

Image Registration for a Series of Chest Radiograph Images

Image Registration for a Series of Chest Radiograph Images Proceedng of the 5th WE Internatonal Conference on gnal Proceng, Itanbul, Turkey, May 7-9, 006 (pp179-184) Image Regtraton for a ere of Chet Radograph Image Omar Mohd. Rjal*, Norlza Mohd. Noor, hee Lee

More information

COMPARISON OF SOME RELIABILITY CHARACTERISTICS BETWEEN REDUNDANT SYSTEMS REQUIRING SUPPORTING UNITS FOR THEIR OPERATIONS

COMPARISON OF SOME RELIABILITY CHARACTERISTICS BETWEEN REDUNDANT SYSTEMS REQUIRING SUPPORTING UNITS FOR THEIR OPERATIONS Avalable onlne at http://sck.org J. Math. Comput. Sc. 3 (3), No., 6-3 ISSN: 97-537 COMPARISON OF SOME RELIABILITY CHARACTERISTICS BETWEEN REDUNDANT SYSTEMS REQUIRING SUPPORTING UNITS FOR THEIR OPERATIONS

More information

STOCHASTIC BEHAVIOUR OF COMMUNICATION SUBSYSTEM OF COMMUNICATION SATELLITE

STOCHASTIC BEHAVIOUR OF COMMUNICATION SUBSYSTEM OF COMMUNICATION SATELLITE IJS 4 () July Sharma & al ehavour of Subytem of ommuncaton Satellte SOHSI HVIOU O OMMUNIION SUSYSM O OMMUNIION SLLI SK Mttal eepankar Sharma & Neelam Sharma 3 S he author n th paper have dcued the tochatc

More information

A Study of Quantum Strategies for Newcomb s Paradox

A Study of Quantum Strategies for Newcomb s Paradox Bune, 00, : 4-50 do:0.436/b.00.004 Publhed Onlne March 00 (http://www.scrp.org/journal/b A Study of Quantum Stratege for ewcomb Paradox Taah Mhara Department of Informaton Scence and Art, Toyo Unverty,

More information

bounds compared to SB and SBB bounds as the former two have an index parameter, while the latter two

bounds compared to SB and SBB bounds as the former two have an index parameter, while the latter two 1 Queung Procee n GPS and PGPS wth LRD Traffc Input Xang Yu, Ian L-Jn Thng, Yumng Jang and Chunmng Qao Department of Computer Scence and Engneerng State Unverty of New York at Buffalo Department of Electrcal

More information

MULTIPLE REGRESSION ANALYSIS For the Case of Two Regressors

MULTIPLE REGRESSION ANALYSIS For the Case of Two Regressors MULTIPLE REGRESSION ANALYSIS For the Cae of Two Regreor In the followng note, leat-quare etmaton developed for multple regreon problem wth two eplanator varable, here called regreor (uch a n the Fat Food

More information

A LINEAR PROGRAM TO COMPARE MULTIPLE GROSS CREDIT LOSS FORECASTS. Dr. Derald E. Wentzien, Wesley College, (302) ,

A LINEAR PROGRAM TO COMPARE MULTIPLE GROSS CREDIT LOSS FORECASTS. Dr. Derald E. Wentzien, Wesley College, (302) , A LINEAR PROGRAM TO COMPARE MULTIPLE GROSS CREDIT LOSS FORECASTS Dr. Derald E. Wentzen, Wesley College, (302) 736-2574, wentzde@wesley.edu ABSTRACT A lnear programmng model s developed and used to compare

More information

Optimal inference of sameness Supporting information

Optimal inference of sameness Supporting information Optmal nference of amene Supportng nformaton Content Decon rule of the optmal oberver.... Unequal relablte.... Equal relablte... 5 Repone probablte of the optmal oberver... 6. Equal relablte... 6. Unequal

More information

Experience with Automatic Generation Control (AGC) Dynamic Simulation in PSS E

Experience with Automatic Generation Control (AGC) Dynamic Simulation in PSS E Semens Industry, Inc. Power Technology Issue 113 Experence wth Automatc Generaton Control (AGC) Dynamc Smulaton n PSS E Lu Wang, Ph.D. Staff Software Engneer lu_wang@semens.com Dngguo Chen, Ph.D. Staff

More information

Hongyi Miao, College of Science, Nanjing Forestry University, Nanjing ,China. (Received 20 June 2013, accepted 11 March 2014) I)ϕ (k)

Hongyi Miao, College of Science, Nanjing Forestry University, Nanjing ,China. (Received 20 June 2013, accepted 11 March 2014) I)ϕ (k) ISSN 1749-3889 (prnt), 1749-3897 (onlne) Internatonal Journal of Nonlnear Scence Vol.17(2014) No.2,pp.188-192 Modfed Block Jacob-Davdson Method for Solvng Large Sparse Egenproblems Hongy Mao, College of

More information

CHAPTER 9 LINEAR MOMENTUM, IMPULSE AND COLLISIONS

CHAPTER 9 LINEAR MOMENTUM, IMPULSE AND COLLISIONS CHAPTER 9 LINEAR MOMENTUM, IMPULSE AND COLLISIONS 103 Phy 1 9.1 Lnear Momentum The prncple o energy conervaton can be ued to olve problem that are harder to olve jut ung Newton law. It ued to decrbe moton

More information

Discrete Simultaneous Perturbation Stochastic Approximation on Loss Function with Noisy Measurements

Discrete Simultaneous Perturbation Stochastic Approximation on Loss Function with Noisy Measurements 0 Amercan Control Conference on O'Farrell Street San Francco CA USA June 9 - July 0 0 Dcrete Smultaneou Perturbaton Stochatc Approxmaton on Lo Functon wth Noy Meaurement Q Wang and Jame C Spall Abtract

More information

An Admission Control Algorithm in Cloud Computing Systems

An Admission Control Algorithm in Cloud Computing Systems An Admsson Control Algorthm n Cloud Computng Systems Authors: Frank Yeong-Sung Ln Department of Informaton Management Natonal Tawan Unversty Tape, Tawan, R.O.C. ysln@m.ntu.edu.tw Yngje Lan Management Scence

More information

Batch Reinforcement Learning

Batch Reinforcement Learning Batch Renforcement Learnng Alan Fern * Baed n part on lde by Ronald Parr Overvew What batch renforcement learnng? Leat Square Polcy Iteraton Ftted Q-teraton Batch DQN Onlne veru Batch RL Onlne RL: ntegrate

More information

A PROBABILITY-DRIVEN SEARCH ALGORITHM FOR SOLVING MULTI-OBJECTIVE OPTIMIZATION PROBLEMS

A PROBABILITY-DRIVEN SEARCH ALGORITHM FOR SOLVING MULTI-OBJECTIVE OPTIMIZATION PROBLEMS HCMC Unversty of Pedagogy Thong Nguyen Huu et al. A PROBABILITY-DRIVEN SEARCH ALGORITHM FOR SOLVING MULTI-OBJECTIVE OPTIMIZATION PROBLEMS Thong Nguyen Huu and Hao Tran Van Department of mathematcs-nformaton,

More information

Dynamic Programming. Lecture 13 (5/31/2017)

Dynamic Programming. Lecture 13 (5/31/2017) Dynamc Programmng Lecture 13 (5/31/2017) - A Forest Thnnng Example - Projected yeld (m3/ha) at age 20 as functon of acton taken at age 10 Age 10 Begnnng Volume Resdual Ten-year Volume volume thnned volume

More information

Statistical Properties of the OLS Coefficient Estimators. 1. Introduction

Statistical Properties of the OLS Coefficient Estimators. 1. Introduction ECOOMICS 35* -- OTE 4 ECO 35* -- OTE 4 Stattcal Properte of the OLS Coeffcent Etmator Introducton We derved n ote the OLS (Ordnary Leat Square etmator ˆβ j (j, of the regreon coeffcent βj (j, n the mple

More information

Modeling of Wave Behavior of Substrate Noise Coupling for Mixed-Signal IC Design

Modeling of Wave Behavior of Substrate Noise Coupling for Mixed-Signal IC Design Modelng of Wave Behavor of Subtrate Noe Couplng for Mxed-Sgnal IC Degn Georgo Veron, Y-Chang Lu, and Robert W. Dutton Center for Integrated Sytem, Stanford Unverty, Stanford, CA 9435 yorgo@gloworm.tanford.edu

More information

Joint Energy-Efficient Cooperative Spectrum Sensing and Power Allocation in Cognitive Machine-to-Machine Communications

Joint Energy-Efficient Cooperative Spectrum Sensing and Power Allocation in Cognitive Machine-to-Machine Communications Jont Energy-Effcent Cooperatve Spectrum Senng and Power Allocaton n Cogntve Machne-to-Machne Communcaton Ha Ngoc Pham, Yan Zhang, Tor Skee, Paal E. Engeltad, Frank Elaen Department of Informatc, Unverty

More information

Distributed Control for the Parallel DC Linked Modular Shunt Active Power Filters under Distorted Utility Voltage Condition

Distributed Control for the Parallel DC Linked Modular Shunt Active Power Filters under Distorted Utility Voltage Condition Dtrbted Control for the Parallel DC Lnked Modlar Shnt Actve Power Flter nder Dtorted Utlty Voltage Condton Reearch Stdent: Adl Salman Spervor: Dr. Malabka Ba School of Electrcal and Electronc Engneerng

More information

Foresighted Demand Side Management

Foresighted Demand Side Management Foresghted Demand Sde Management 1 Yuanzhang Xao and Mhaela van der Schaar, Fellow, IEEE Department of Electrcal Engneerng, UCLA. {yxao,mhaela}@ee.ucla.edu. Abstract arxv:1401.2185v1 [cs.ma] 9 Jan 2014

More information

Information Acquisition in Global Games of Regime Change (Online Appendix)

Information Acquisition in Global Games of Regime Change (Online Appendix) Informaton Acquton n Global Game of Regme Change (Onlne Appendx) Mchal Szkup and Iabel Trevno Augut 4, 05 Introducton Th appendx contan the proof of all the ntermedate reult that have been omtted from

More information

A New Inverse Reliability Analysis Method Using MPP-Based Dimension Reduction Method (DRM)

A New Inverse Reliability Analysis Method Using MPP-Based Dimension Reduction Method (DRM) roceedng of the ASME 007 Internatonal Degn Engneerng Techncal Conference & Computer and Informaton n Engneerng Conference IDETC/CIE 007 September 4-7, 007, La Vega, eada, USA DETC007-35098 A ew Inere Relablty

More information

The multivariate Gaussian probability density function for random vector X (X 1,,X ) T. diagonal term of, denoted

The multivariate Gaussian probability density function for random vector X (X 1,,X ) T. diagonal term of, denoted Appendx Proof of heorem he multvarate Gauan probablty denty functon for random vector X (X,,X ) px exp / / x x mean and varance equal to the th dagonal term of, denoted he margnal dtrbuton of X Gauan wth

More information

A Hybrid Evolution Algorithm with Application Based on Chaos Genetic Algorithm and Particle Swarm Optimization

A Hybrid Evolution Algorithm with Application Based on Chaos Genetic Algorithm and Particle Swarm Optimization Natonal Conference on Informaton Technology and Computer Scence (CITCS ) A Hybrd Evoluton Algorthm wth Applcaton Baed on Chao Genetc Algorthm and Partcle Swarm Optmzaton Fu Yu School of Computer & Informaton

More information

Dynamic Pricing for Smart Grid with Reinforcement Learning

Dynamic Pricing for Smart Grid with Reinforcement Learning Dynamc Prcng for Smart Grd wth Renforcement Learnng Byung-Gook Km, Yu Zhang, Mhaela van der Schaar, and Jang-Won Lee Samsung Electroncs, Suwon, Korea Department of Electrcal Engneerng, UCLA, Los Angeles,

More information

Estimation of a proportion under a certain two-stage sampling design

Estimation of a proportion under a certain two-stage sampling design Etmaton of a roorton under a certan two-tage amng degn Danutė Kraavcatė nttute of athematc and nformatc Lthuana Stattc Lthuana Lthuana e-ma: raav@tmt Abtract The am of th aer to demontrate wth exame that

More information

Linear Approximation with Regularization and Moving Least Squares

Linear Approximation with Regularization and Moving Least Squares Lnear Approxmaton wth Regularzaton and Movng Least Squares Igor Grešovn May 007 Revson 4.6 (Revson : March 004). 5 4 3 0.5 3 3.5 4 Contents: Lnear Fttng...4. Weghted Least Squares n Functon Approxmaton...

More information

EEL 6266 Power System Operation and Control. Chapter 3 Economic Dispatch Using Dynamic Programming

EEL 6266 Power System Operation and Control. Chapter 3 Economic Dispatch Using Dynamic Programming EEL 6266 Power System Operaton and Control Chapter 3 Economc Dspatch Usng Dynamc Programmng Pecewse Lnear Cost Functons Common practce many utltes prefer to represent ther generator cost functons as sngle-

More information

Dynamic Chinese Restaurant Game in Cognitive Radio Networks

Dynamic Chinese Restaurant Game in Cognitive Radio Networks 2013 Proceedngs IEEE INFOCO Dynamc Chnese Restaurant Game n Cogntve Rado Networks Chunxao Jang, Yan Chen, Yu-Han Yang, Chh-Yu Wang, and K. J. Ray Lu Department of Electrcal and Computer Engneerng, Unversty

More information

Extended Prigogine Theorem: Method for Universal Characterization of Complex System Evolution

Extended Prigogine Theorem: Method for Universal Characterization of Complex System Evolution Extended Prgogne Theorem: Method for Unveral Characterzaton of Complex Sytem Evoluton Sergey amenhchkov* Mocow State Unverty of M.V. Lomonoov, Phycal department, Rua, Mocow, Lennke Gory, 1/, 119991 Publhed

More information

Problem Set 9 Solutions

Problem Set 9 Solutions Desgn and Analyss of Algorthms May 4, 2015 Massachusetts Insttute of Technology 6.046J/18.410J Profs. Erk Demane, Srn Devadas, and Nancy Lynch Problem Set 9 Solutons Problem Set 9 Solutons Ths problem

More information

Joint Source Coding and Higher-Dimension Modulation

Joint Source Coding and Higher-Dimension Modulation Jont Codng and Hgher-Dmenon Modulaton Tze C. Wong and Huck M. Kwon Electrcal Engneerng and Computer Scence Wchta State Unvert, Wchta, Kana 676, USA {tcwong; huck.kwon}@wchta.edu Abtract Th paper propoe

More information

Lecture Space-Bounded Derandomization

Lecture Space-Bounded Derandomization Notes on Complexty Theory Last updated: October, 2008 Jonathan Katz Lecture Space-Bounded Derandomzaton 1 Space-Bounded Derandomzaton We now dscuss derandomzaton of space-bounded algorthms. Here non-trval

More information

Appendix B: Resampling Algorithms

Appendix B: Resampling Algorithms 407 Appendx B: Resamplng Algorthms A common problem of all partcle flters s the degeneracy of weghts, whch conssts of the unbounded ncrease of the varance of the mportance weghts ω [ ] of the partcles

More information

Root Locus Techniques

Root Locus Techniques Root Locu Technque ELEC 32 Cloed-Loop Control The control nput u t ynthezed baed on the a pror knowledge of the ytem plant, the reference nput r t, and the error gnal, e t The control ytem meaure the output,

More information

CHAPTER IV RESEARCH FINDING AND ANALYSIS

CHAPTER IV RESEARCH FINDING AND ANALYSIS CHAPTER IV REEARCH FINDING AND ANALYI A. Descrpton of Research Fndngs To fnd out the dfference between the students who were taught by usng Mme Game and the students who were not taught by usng Mme Game

More information

Games of Threats. Elon Kohlberg Abraham Neyman. Working Paper

Games of Threats. Elon Kohlberg Abraham Neyman. Working Paper Games of Threats Elon Kohlberg Abraham Neyman Workng Paper 18-023 Games of Threats Elon Kohlberg Harvard Busness School Abraham Neyman The Hebrew Unversty of Jerusalem Workng Paper 18-023 Copyrght 2017

More information

Information Weighted Consensus

Information Weighted Consensus Informaton Weghted Consensus A. T. Kamal, J. A. Farrell and A. K. Roy-Chowdhury Unversty of Calforna, Rversde, CA-92521 Abstract Consensus-based dstrbuted estmaton schemes are becomng ncreasngly popular

More information

Kernel Methods and SVMs Extension

Kernel Methods and SVMs Extension Kernel Methods and SVMs Extenson The purpose of ths document s to revew materal covered n Machne Learnng 1 Supervsed Learnng regardng support vector machnes (SVMs). Ths document also provdes a general

More information

Two Approaches to Proving. Goldbach s Conjecture

Two Approaches to Proving. Goldbach s Conjecture Two Approache to Provng Goldbach Conecture By Bernard Farley Adved By Charle Parry May 3 rd 5 A Bref Introducton to Goldbach Conecture In 74 Goldbach made h mot famou contrbuton n mathematc wth the conecture

More information

Improvements on Waring s Problem

Improvements on Waring s Problem Imrovement on Warng Problem L An-Png Bejng 85, PR Chna al@nacom Abtract By a new recurve algorthm for the auxlary equaton, n th aer, we wll gve ome mrovement for Warng roblem Keyword: Warng Problem, Hardy-Lttlewood

More information

Lecture Notes on Linear Regression

Lecture Notes on Linear Regression Lecture Notes on Lnear Regresson Feng L fl@sdueducn Shandong Unversty, Chna Lnear Regresson Problem In regresson problem, we am at predct a contnuous target value gven an nput feature vector We assume

More information

Parameter Estimation for Dynamic System using Unscented Kalman filter

Parameter Estimation for Dynamic System using Unscented Kalman filter Parameter Estmaton for Dynamc System usng Unscented Kalman flter Jhoon Seung 1,a, Amr Atya F. 2,b, Alexander G.Parlos 3,c, and Klto Chong 1,4,d* 1 Dvson of Electroncs Engneerng, Chonbuk Natonal Unversty,

More information

Two-Layered Model of Blood Flow through Composite Stenosed Artery

Two-Layered Model of Blood Flow through Composite Stenosed Artery Avalable at http://pvamu.edu/aam Appl. Appl. Math. ISSN: 93-9466 Vol. 4, Iue (December 9), pp. 343 354 (Prevouly, Vol. 4, No.) Applcaton Appled Mathematc: An Internatonal Journal (AAM) Two-ayered Model

More information

Computational issues surrounding the management of an ecological food web

Computational issues surrounding the management of an ecological food web Computatonal ssues surroundng the management of an ecologcal food web Wllam J M Probert, Eve McDonald-Madden, Nathale Peyrard, Régs Sabbadn AIGM 12, ECAI2012 Montpeller, France Ratonale Ecology has many

More information

A Complexity-Based Approach in Image Compression using Neural Networks

A Complexity-Based Approach in Image Compression using Neural Networks Internatonal Journal of Sgnal Proceng 5; www.waet.org Sprng 009 A Complexty-Baed Approach n Image Compreon ung eural etwork Had Ve, Manour Jamzad Abtract In th paper we preent an adaptve method for mage

More information

Pricing and Resource Allocation Game Theoretic Models

Pricing and Resource Allocation Game Theoretic Models Prcng and Resource Allocaton Game Theoretc Models Zhy Huang Changbn Lu Q Zhang Computer and Informaton Scence December 8, 2009 Z. Huang, C. Lu, and Q. Zhang (CIS) Game Theoretc Models December 8, 2009

More information

Spectral Properties of the Grounded Laplacian Matrix with Applications to Consensus in the Presence of Stubborn Agents

Spectral Properties of the Grounded Laplacian Matrix with Applications to Consensus in the Presence of Stubborn Agents Spectral Properte of the Grounded Laplacan Matrx wth Applcaton to Conenu n the Preence of Stubborn Agent Mohammad Pran and Shreya Sundaram Abtract We tudy lnear conenu and opnon dynamc n network that contan

More information

Economics 101. Lecture 4 - Equilibrium and Efficiency

Economics 101. Lecture 4 - Equilibrium and Efficiency Economcs 0 Lecture 4 - Equlbrum and Effcency Intro As dscussed n the prevous lecture, we wll now move from an envronment where we looed at consumers mang decsons n solaton to analyzng economes full of

More information

Introduction. Modeling Data. Approach. Quality of Fit. Likelihood. Probabilistic Approach

Introduction. Modeling Data. Approach. Quality of Fit. Likelihood. Probabilistic Approach Introducton Modelng Data Gven a et of obervaton, we wh to ft a mathematcal model Model deend on adutable arameter traght lne: m + c n Polnomal: a + a + a + L+ a n Choce of model deend uon roblem Aroach

More information

2. SINGLE VS. MULTI POLARIZATION SAR DATA

2. SINGLE VS. MULTI POLARIZATION SAR DATA . SINGLE VS. MULTI POLARIZATION SAR DATA.1 Scatterng Coeffcent v. Scatterng Matrx In the prevou chapter of th document, we dealt wth the decrpton and the characterzaton of electromagnetc wave. A t wa hown,

More information

Low Complexity Soft-Input Soft-Output Hamming Decoder

Low Complexity Soft-Input Soft-Output Hamming Decoder Low Complexty Soft-Input Soft-Output Hammng Der Benjamn Müller, Martn Holters, Udo Zölzer Helmut Schmdt Unversty Unversty of the Federal Armed Forces Department of Sgnal Processng and Communcatons Holstenhofweg

More information

Portioned Static-Priority Scheduling on Multiprocessors

Portioned Static-Priority Scheduling on Multiprocessors Portoned Statc-Prorty Schedulng on Multproceor Shnpe Kato and Nobuyuk Yamaak School of Scence for Open and Envronmental Sytem Keo Unverty, Japan {hnpe,yamaak}@ny.c.keo.ac.jp Abtract Th paper propoe an

More information

Discrete MRF Inference of Marginal Densities for Non-uniformly Discretized Variable Space

Discrete MRF Inference of Marginal Densities for Non-uniformly Discretized Variable Space 2013 IEEE Conference on Computer Von and Pattern Recognton Dcrete MRF Inference of Margnal Dente for Non-unformly Dcretzed Varable Space Maak Sato Takayuk Okatan Kochro Deguch Tohoku Unverty, Japan {mato,

More information

Pythagorean triples. Leen Noordzij.

Pythagorean triples. Leen Noordzij. Pythagorean trple. Leen Noordz Dr.l.noordz@leennoordz.nl www.leennoordz.me Content A Roadmap for generatng Pythagorean Trple.... Pythagorean Trple.... 3 Dcuon Concluon.... 5 A Roadmap for generatng Pythagorean

More information

A Weighted UTASTAR Method for the Multiple Criteria Decision Making with Interval Numbers

A Weighted UTASTAR Method for the Multiple Criteria Decision Making with Interval Numbers 3rd Internatonal Conference on Management Scence and Management Innovaton MSMI 2016) A Weghted UTASTAR Method for the Multple Crtera Decon Makng wth Interval Number Wen-Tao Xong Jng Cheng School of Mathematc

More information

A Simple Inventory System

A Simple Inventory System A Smple Inventory System Lawrence M. Leems and Stephen K. Park, Dscrete-Event Smulaton: A Frst Course, Prentce Hall, 2006 Hu Chen Computer Scence Vrgna State Unversty Petersburg, Vrgna February 8, 2017

More information

Predictors Using Partially Conditional 2 Stage Response Error Ed Stanek

Predictors Using Partially Conditional 2 Stage Response Error Ed Stanek Predctor ng Partally Condtonal Stage Repone Error Ed Stane TRODCTO We explore the predctor that wll relt n a mple random ample wth repone error when a dfferent model potlated The model we decrbe here cloely

More information

Grover s Algorithm + Quantum Zeno Effect + Vaidman

Grover s Algorithm + Quantum Zeno Effect + Vaidman Grover s Algorthm + Quantum Zeno Effect + Vadman CS 294-2 Bomb 10/12/04 Fall 2004 Lecture 11 Grover s algorthm Recall that Grover s algorthm for searchng over a space of sze wors as follows: consder the

More information

Tracking with Kalman Filter

Tracking with Kalman Filter Trackng wth Kalman Flter Scott T. Acton Vrgna Image and Vdeo Analyss (VIVA), Charles L. Brown Department of Electrcal and Computer Engneerng Department of Bomedcal Engneerng Unversty of Vrgna, Charlottesvlle,

More information