A Modular On-line Profit Sharing Approach in Multiagent Domains
|
|
- Dorcas Johnson
- 5 years ago
- Views:
Transcription
1 A Modular O-le Prof Sharg Approach Mulage Domas Pucheg Zhou, ad Bgrog Hog Absrac How o coordae he behavors of he ages hrough learg s a challegg problem wh mul-age domas. Because of s complexy, rece work has focused o how coordaed sraeges ca be leared. Here we are eresed usg reforceme learg echques o lear he coordaed acos of a group of age whou requrg explc commucao amog hem. However, radoal reforceme learg mehods are based o he assumpo ha he evrome ca be modeled as Markov Decso Proces whch usually cao be sasfed whe mulple ages coexs he same evrome. Moreover, o effecvely coordae each age s behavor so as o acheve he goal, s ecessary o augme he sae of each age wh he formao abou oher exsg ages. Wherea as he umber of ages a mulage evrome crease he sae space of each age grows expoeally, whch wll cause he combaoal exploso problem. Prof sharg s oe of he reforceme learg mehods ha allow ages o lear effecve behavors from her expereces eve wh o-markova evromes. I hs paper, o remedy he drawback of he orgal prof sharg approach ha eeds much memory o sore each sae-aco par durg he learg proces we frsly address a kd of o-le raoal prof sharg algorhm. he, we egrae he advaages of modular learg archecure wh o-le raoal prof sharg algorhm, ad propose a ew modular reforceme learg model. he effecveess of he echque s demosraed usg he pursu problem. Keywords Mul-age learg; reforceme learg; raoal prof sharg; modular archecure. I. INRODUCION mulage sysem (MAS) whch here s a umber of Aauoomous ages eracg, wh each affecg he acos of he ohers esseally cosues a complex sysem. Performg ad compleg asks such a evrome ca be exremely dffcul. I hs paper, he cooperave MAS s cocered, whch several ages aemp, hrough her eraco, o joly solve asks or o maxmze her uly[]. Learg eables MAS o be more flexble ad robu ad makes hem beer able o hadle ucera ad chagg crcumsaces. hus how o coordae dffere ages Mauscrp receved Jue 26, 6. hs work was suppored par by he Naoal Scece Foudao uder Gra Pucheg Zhou s wh he School of Compuer Scece ad echology, Harb Isue of echology, Harb 5000, HeLogJag Provce, P.R.Cha(correspodg auhor o provde phoe: ; fax: ; e-mal: zhoupc@h.edu.c). Bgrog Hog s wh he School of Compuer Scece ad echology, Harb Isue of echology, Harb 5000, HeLogJag Provce, P.R.Cha (e-mal: hogbr@h.edu.c). behavors by learg so as o acheve he commo goal s a mpora heme mulage domas. (e.g., see [2, 3]) Reforceme learg (RL)[4, 5] s he problem faced by a age ha mus lear behavor hrough ral-ad-error eracos wh dyamc evrome ad has bee appled successfully may sgle age sysems. Learg from he evrome s robus because ages are drecly affeced by he dyamcs of he evrome. Because of hese characer RL has become oe of he mpora learg approaches for mulage learg. (e.g., see [6, 7]) I a mulage doma, o acheve he sharg goal, learg age eeds o augme s sae space o clude more effecve formao. A he same me, he creme of sae space wll lead o slow dow he learg process. Based o he dea of modular archecure, Oo[8] proposed a modular Q-learg model, whch ca reduce he sae space effecvely. However, Q-learg s based o he evrome whch should obey he Markova propery, ha he evrome ca by modeled by a Markov Decso Proces bu for mulage doma hese cao be me wh. he raoal prof sharg, whch s proposed by Myazak[9], s a cred assgme mechasm ha bases o rals ad errors. Learg ages erac wh he evrome hrough ral ad error, ad reforce effecve sae-aco pars ad suppress effecve sae-aco pars by learg epsode. Prof sharg coras wh oher reforceme learg approaches whch are based o Dyamc Programmg, such as emporal Dfferece mehod ad Q-learg, ha prof sharg guaraees covergece o a effecve polcy eve domas ha do o obey he Markova propery, f a ask s epsodc ad a cred s assged a approprae maer[0, ]. Based o hese researche hs paper we egrae he advaages of modular archecure wh raoal prof sharg, ad propose a ew modular reforceme learg approach. II. MODULAR REINFORCEMEN LEARNING A. Markov Decso Processes ad Reforceme Learg Markov Decso Process (MDP) s geerally regarded as he mahemacal foudao for RL. A fully observable MDP s a quadruple (S, A,, R), where S s a fe se of sae A s a se of aco : SAS [0,] s he sae raso fuco ha descrbes he probably ps ( ' sa, ) of edg up sae s whe performg aco a sae ad 424
2 R: SA s reward fuco ha reurs a umerc value afer akg aco a sae s. A age s polcy s a mappg : S A. he goal of he age s o fd a opmal polcy ha maxmzes he expeced sum of dscoued rewards V(, s ) E( r, s) () 0 where r s he scalar reward a me sep, [0,] s dscou facor. Equao () ca be rewre as V(, s ) r(, s a) p( s' a) V( s', ) (2) s' S where a s he aco deermed by polcy. Dyamc Programmg (DP) heory[2] guaraees here exss a leas a deermsc opmal polcy *, ha for ay s S, sasfes * * V(, s ) max{ r p(',) s s a V( s', )} (3) aa s' S Q-learg[3]s oe kd of model-free RL algorhm s ha based o DP. I essece Q-learg s a emporal-dfferece learg mehod. Q-learg drecly compues he approxmao of a opmal aco-value fuco, called Q-value, by usg he followg updae rule Qs (, a) Qs (, a) (4) ( ) r maxq( s, a) Q( s, a ) here [0,] s learg rae. B. Modular Q-learg a he modular Q-learg archecure[8] cosss of a medaor module ad learg modules whch amou o he umber of ages volved he ask, show as Fg. Each age he learg module carres ou Q-learg he evrome. I each learg module, learg coceraes o a sgle age ad he learg of oher ages s o cosdered. o complee he global goal, eeds a medaor module o arbrae he resuls of he learg modules. he medaor module combes learg modules decso polces usg a smple heursc decso procedure ad deermes a fal decso by he correspodg age. Reward Evrome Aco Sae Age Learg Module Medaor Module Q Q 2 Q L Learg Module 2 x x 2 x L Learg Module L Fg. he modular archecure for a learg age he learg ages ca decde her aco by usg he Greaes Mass (GM) sraegy proposed by Whehead[4], L k arg max Q ( xk, a). (7) aa k where Q k deoes he aco-value fuco susaed by he learg module k, L s he umber of learg modules. Iuvely, hs s a seleco b majory amog acos proposed by dvdual learg-modules. III. RAIONAL PROFI SHARING Prof sharg s orgally proposed by Grefesee[5]. he orgal verso used prof sharg as a cred assgme mehod based o ral-ad-error experece whou ulzg ay form of value esmao. However, hs approach does o guaraee he raoaly of a acqured polcy. o guaraee covergece o a raoal polcy a o-markova doma, Myazak[9, 6] roduces he raoaly heorem, whch he cred assgme fuco should sasfy, here we call raoal prof sharg. I a mulage evrome, a me sep he learg age observes a sae s, whch s geerally he parally avalable sae of s evrome. A aco a s he seleced from he aco se. Afer he aco s execued, he age deermes wheher a reward r s geeraed. If here s o reward, he age sores he sae-aco par so-called rule (s, a ) s epsodc memory, ad repeas hs cycle ul a reward s geeraed. he process of movg from a al sae o he fal reward sae s called a epsode. Whe a reward s gve o he age, he rules sored s epsodc memory are reforced a oce. he fuco ha maps saes o acos s called a polcy. A polcy s called raoal f ad oly f he expeced reward per a aco s larger ha zero. We call a subsequece of a epsode a deour whe dffere acos are seleced for he same sae a epsode. he rules o a deour are called effecve rules. o corol he effecve rule Myazak[6] gves he followg raoaly heorem of prof sharg. Lemma. Ay effecve rule ca be suppressed ff W, 2,..., W, C f j f (8) where C s a upper boud of he umber of coflcg effecve rule ad W s a upper boud of he legh of epsodes. I pracce, we ca le C=M-, where M s he umber of acos. f deoes a reforceme value for he rule seleced a sep before a reward s acqured. he equaly (8) s called suppresso codo. Lemma 2. Prof sharg ca lear a raoal polcy ff sasfes he suppresso codo of lemma. he algorhm for he prof sharg based o lookup able s followed. Algorhm. he prof sharg algorhm Ialze, for all ss, aa( s) : wsa (, ) a small cosa Repea (for each epsode) 0, alze s j 425
3 Repea (for each sep of epsode) Accordg o s, choose a usg weghed-roulee: a) Pr( a a s s) a' A( s) a' ) Save sae-aco par (s, a ) o epsodc memory Perform aco a ; observe reward r +, ad ex sae s + ul s + s ermal sae, for each par (s, a ) epsodc memory: ws (, a) ws (, a) f( r, ), 0,,..., ul he algorhm holds he soppg crera IV. MODULAR ON-LINE PROFI SHARING APPROACH Wh he modular-q model, each learg module oly cocers wh a par of formao of he evrome, hs may reduce he sze of sae space. However, wll easly lead o percepual alasg problem[7],.e., he learg age wrogly rea wo or more dffere saes as oe. A he same me, a mul-age doma, he succeeded sae of he learg age wll o oly be effeced by s aco ad he evrome, bu also be effeced by oher ages behavor oher word he evrome cao obey Markova propery. However, eve wh o-markova evrome, prof sharg mehod ca lear effecve polcy. So, here we adop prof sharg as a basc algorhm for each learg module. A. O-le Prof Sharg Algorhm Prof sharg algorhm show Algorhm s a off-le updag mehod, whch does o chage weghs of s sae-aco pars ul he ed of a ask. Oe drawback s ha wll requre ubouded memory space o sore all of he seleced sae-aco pars durg he ask. Based o he dea of elgbly races[5,8], here we propose a o-le prof sharg algorhm. he dea behd elgbly races s very smple. Each me a sae s vsed aes a shor-erm memory proces a race, whch he decays gradually over me. hs race marks he sae as elgble for learg. here are maly wo kds of elgbly race ha s accumulag ad replacg elgbly race as defed by equao (9) ad (0), respecvely. e (, s a) s aa e (, s a) (9) e (, s a) oherwse s a a e(, s a) 0 s a a (0) e (, s a) s s where s he decay parameer, ad 0. We apply elgbly races for mplemeg prof sharg mehod cremeally, ad propose a kd of o-le prof sharg algorhm, as show Algorhm 2. Algorhm 2. he o-le prof sharg algorhm Ialze, for all ss, aa( s) : wsa (, ) a small cosa Repea (for each epsode) for all a : esa (, ) 0 0, alze s Repea (for each sep of epsode) Accordg o s, choose a usg weghed-roulee: a) Pr( a a s s) a' A( s) a' ) Updae elgbly races: f use accumulag elgbly race he e( a) e ( a) f use replacg elgbly race he e( a) e( a) 0,for aa( s), a a ake aco a ; observe reward, r +, ad ex sae, s + For all a: wsa (, ) wsa (, ) r e(, sa) e(, s a) e(, s a) ul s + s he ermal sae ul he algorhm holds he soppg crera heorem. If he followg hold:. Algorhm adops cred assgme fuco f(, r) r where, ad. C 2. Algorhm 2 uses accumulag elgbly races. 3. he ermedae rewards are zero ul he ed of each epsode. he, Algorhm s equvale o Algorhm 2, boh of hem ca lear a raoal polcy. Proof. Fr we wll prove ha hese wo algorhms are equvale. Accordg o Kroecker Dela fuco:, j (, j) 0, j Equao (9) ca be rewre as e(, s a) e (, s a) (, s s)(, a a) (, ss0)(, aa0) (, ss)(, aa) (, ss )(, aa )) (, ss)(, aa) 0 (, ss)(, aa) I Algorhm 2, for ay sae-aco par ( a), afer oe erao he cremeal wegh s wsa (, ) r e(, sa), so a me sep whe a epsode ed he oal updae s w (, s a) r e (, s a) o 0 r s s a a 0 0 (, )(, ) 426
4 (, ) (, ) 2 (, ) (, ) 0 r s s a a r s s a a 0 r (, s s )(, a a ) Because all he ermedae rewards ul he ed of a epsode are zero, ha r r 2 r 0, so o(, ) (, )(, ) 0 w s a r s s a a () O he oher had, for Algorhm, a me sep whe a epsode s over, for ay sae-aco par, he sum of cremeal wegh ca be wre as Sce w (, s a) f(, r )(, s s )(, a a ) (2) f(, r ) off 0 r, equao (2) ca be rasformed o off (, ) (, )(, ) o(, ) 0 w s a r s s a a w s a herefore, for ay sae-aco par, he sums of all he updaes hese wo algorhms are same afer learg, so hey are equvale. Nex, we wll show ha hese wo algorhms ca lear a raoal polcy. For Algorhm, accordg o he cred assgme fuco, we wll have - C( ) C f(, r) C r r 0 0 C r Because C, C so - C C f(, r) r r f(, r). 0 herefore, sasfes he suppresso codo lemma, ad accordg o lemma 2, wll lear a raoal polcy. Because of he equvalece of Algorhm ad 2, we ca coclude ha Algorhm wll also lear a raoal polcy. heorem 2. If he followg hold: W. 2, where s he decay parameer equao (0), ad W s a upper boud of he legh of epsodes Lemma. 2. Algorhm 2 uses replacg elgbly races. 3. he ermedae rewards are zero ul he ed of each epsode. he, Algorhm 2 ca lear a raoal polcy. Proof. Fr defe a fuco 0, j f(, j) (, j), j he equao (0) ca be rewre as e ( a) f( s ) e ( a) ( s ) ( a, a ) f(, s s) f(, s s ) e(, s a) f(, s s )(, s s )(, a a ) (, s s )(, a a ) s s aa f ss ss aa 0 (, )(, ) (, ) (, )(, ) For ay sae-aco par (a), afer oe erao he cremeal wegh s wsa (, ) r e(, sa), so a me sep whe a epsode ed he oal updae s w (, s a) r e (, s a) o 0 Because all he ermedae rewards ul he ed of a epsode are zero, ha r r 2 r 0, so w (, s a) r e (, s a) o Assume a me sep - he vsed sae-aco par s (s c, a d ), he w ( s, a ) r e ( s, a ) r o c d c d A he same me, for ay oher uvsed sae-aco par a me -, whch s deoed by ( s, a ) U ( s, a ) s S, a A( s ) ( s, a ) we have u v u v u v u c d w ( s, a ) r e ( s, a ) o u v u v he for all uvsed sae-aco pars a me -, he sum of cremeal weghs s w ( s, a ) r e ( s, a ) o u v u v U U r ( s, s ) ( a, a ) f( s, s ) (3) 2 u v u 0 U r ( s, s ) ( a, a ) u v U Because a me - for ay (s u, a v ), holds ( s, s ) ( a, a ) 0 so u v r ( su, s ) ( av, a ) 0 (4) U Accordg o he defo, s appare ha so f( s, s ) u ( s, s ) ( a, a ) f( s, s ) ( s, s ) ( a, a ) u v u u v For each me sep a =0,,, -2, wh he se U here s a mos oe sae-aco par would be vsed, so ( s, s ) ( a, a ) Hece U u v ( s, s ) ( a, a ) f( s, s ) u v u U (5) Iegrae equao(4) ad equao (5) wh equao(3), we have 427
5 2 wo( su, av ) r r U 0 Because W W 2 herefore w ( s, a ) r w ( s, a ) U o u v o c d herefore, sasfes he suppresso codo lemma, accordg o lemma 2 Algorhm 2 ca lear a raoal polcy. B. he Proposed Modular Reforceme Learg Model Based o he aalyss meoed above, here we propose a modular prof sharg model, as show Fg 2. he learg model of he age cosss of hree module hey are Sae Decomposo Module (SDM), Learg Module (LM) ad Medaor Module (MM), respecvely. SDM deermes he sae of each LM, ad he each LM seds s local polcy o MM. MM wll deerme o choose a approprae aco for he learg age o ake. SDM Reward Sae Age LM ASM SAP PSM Evromes ad oher Ages LM 2 ASM SAP PSM MM Aco LM L ASM SAP PSM Fg. 2 he modular o-le prof sharg model of he age Each LM cosss of hree submodules. ASM: Aco Seleco Module, whch wll sed he local polcy o MM, cludg he avalable acos ad her weghs for he curre sae. SAP: Sae-Aco Par wegh able, whch keeps all of he sae-aco pars ad s wegh, also sores he elgbly race of each sae-aco par. PSM: Prof Sharg Module. Accordg o he receved reward r, he curre sae s ad he chose aco a of MM, PSM updaes he wegh of each sae-aco par usg o-le prof sharg algorhm. he proposed learg algorhm s show as Algorhm 3. Algorhm 3. he proposed modular o-le prof sharg algorhm. Ialze, se learg parameer for all ss, aa( s) each LM: wsa (, ) a small cosa 2. Repea (for each epsode): ) for all ss, a A( s) each LM, esa (, ) 0, 0 2) Repea (for each sep of epsode) () Observe curre sae s() (2) Deerme he sae of each LM, s ()(=,,L, L s he umber of LM) (3) Each LM seds s avalable acos aa(s ()) ad wegh s (), a) o MM (4) Based o receved polces from each LM, MM chooses aco a() usg GM sraegy: k a () arg max w ( s (), a) L aa( sk( )) k (5) Each LM updaes he elgbly race of he vsed sae-aco par. (6) ake aco a(); observe reward, r(+), ad ex sae, s(+) (7) For all of he sae-aco par each LM, updae s wegh ad elgbly race: wsa (, ) wsa (, ) r e(, sa) e(, s a) e(, s a) (8) If sae s(+) s a ermal sae, he go o sep 3); oherwse, +, go o sep (2) 3) If he algorhm holds he soppg crera, he he learg process s over; oherwse, go o sep ). A. Problem Doma V. SIMULAION o evaluae he learg approach proposed hs paper, we choose a modfed verso of he pursu problem[9] as he es bed. I a 8 8 orodal grd world, a sgle prey ad four huer ages are placed a radom posos he grd, as show Fg 3(a). A each me sep, ages sychroously execue oe ou of fve acos: sayg a he curre poso or movg orh, souh, we or eas. he prey ad he huer cao share a cell, bu more ha oe huer ca be a he same poso. Also, a age s o allowed o move off he evrome. Every age ca see objecs a a cera dsace. he dsace ad he cells covers are, respecvely, called he vsual deph ad he vsual evrome of he age. A age havg vsual deph d ca see all he cells sde he (2d+) (2d+) square aroud. hese cells are he vsual evrome of he age. Each age s assged a uque defer. A huer age ca locae he relave poso ad recogze he defer of ay oher ages s vsual evrome. he prey s capured, whe all of s eghbor cell are occuped by he huer as show Fg 3(b). he all of he ages are relocaed a ew radom posos. k 428
6 700 Average seps ul capure Modular-Q, =0. Modular-Q, =0.8 MPS-AE, =0. MPS-RE, =0.5 (a) Fg. 3 (a) Example of he pursu problem a grd world. (b) Examples of capurg saes he pursu problem has he followg characerscs. ) he evrome s fully dyamc, parally observable ad o-deermsc. 2) he huers ages are learg age whle he prey selecs s ow aco based o a radom sraegy. 3) he huer ages are self-eresed, ad here s o commucao bewee each oher. B. Expermeal Seg ad Smulao Resuls o es he proposed algorhm, we compare modular prof sharg based o accumulag elgbly race (MPS-AE), wh modular prof sharg based o replacg elgbly race (MPS-RE) ad Modular Q-learg (Modular-Q). Accordg o heorem, MPS-AE we se decay parameer 0., MPS-RE le 0.5. For Modular-Q, he learg rae 0. or 0.8, dscou facor 0.9. he oal learg epsodes are 5000, ad he maxmum eraos wh a epsode are 500 me seps. he vsual deph of he huer s d=3. Upo capurg he prey, all he huers surroudg he prey receve a reward of 00, ad accordgly all of her cosues learg modules uformly receve he same reward regardless of wha acos hey have jus proposed. I ay oher case huers receve a reward of zero. Sce he huer cao fsh capure he prey aloe, herefore, o capure he prey, he huer should ake he formao of he prey ad oher huers o accou, so as o beer coordae s behavor wh oher huers. For he problem of (N+) huers capurg oe prey, accordg o coveoal sae codg, he sae of Huer ca be deoed as s:=<p, H,, H N>, where P s he relave poso of he prey o Huer, ad H k (k=,, N) deoes he relave poso of Huer k o Huer. Whe he huer wh vsual deph d, he sze of he sae space s S =(v s +) N+. For module archecure, he sae space of each learg module for he huer s a combao of he relave posos of he prey ad oe oher huer, ha he sze of sae space s S =(v s +) 2, accordgly, he sze of oal sae space s S =N (v s +) 2, oher word he relaoshp bewee he sze of he sae space ad he umber of learg ages s rasformed from expoeal o power, whch meas grealy reduce he sze of sae space. Fg 4 s he learg curve of he hree learg algorhm uder dffere me afer several ral Fg 5 gves he hsogram of he learg resuls for hree algorhms. (b) Fg. 4 he graph of comparso of hree learg algorhms Average seps ul capure Modular-Q, =0. Modular-Q, =0.8 MPS-AE, =0. MPS-RE, = Fg. 5 he hsogram of hree learg algorhms From hese wo graph we ca fd ha, boh MPS-AE ad MPS-RE are beer ha Modular Q-learg. hese ca be explaed as followg. For he pursu problem dscussed here, he evrome cao obey Markova propery, whch meas he premse of Q-learg covergg o he opmal polcy cao be sasfed aymore. Moreover, because of he lmed sese ably, he huer wll wrogly rea wo or more dffere saes as oe, whch lead o he percepual alasg problem. Sce he learg age wll choose approprae aco, whch wll slow dow he learg speed. I addo, Q-learg s a off-polcy learg mehod. hs meas ha Q-learg s guaraeed o coverge o he opmal soluo uder MDP evrome regardless of wha polcy s followed durg rag, as log as each sae-aco pars s vsed fely of he lm. However, Modular-Q, he oe-sep value updaes for each learg module are compued uder he assumpo ha all fuure acos wll be chose opmally for ha Markova evrome. hs assumpo s vald uder he aco seleco mechasms descrbed above, whch uses he heursc polcy such as GM sraegy. Because he chose aco by medaor module wll represe some compromse polcy whch he dffere learg modules share corol. hs meas ha he compued Q-values do o coverge o he acual expeced reur uder he compose polcy. Modular prof sharg mehod wll suppress effecve sae-aco pars ha may rap o loop whou reward, ad reforce 429
7 effecve sae-aco par so wll gradually coverge o a raoal polcy, ad oba beer learg resuls. Addoally, we ca fd ha based o he parameers seg above, MPS-RE s a lle beer ha MPS-AE, whch seems someway lke he reforceme learg wh replacg elgbly races compares o reforceme learg wh accumulag elgbly races. o es performaces of proposed algorhms uder dffere parameer seg we le decay parameer wh dffere values. Fg 6 ad 7 are average learg resuls for MPS-AE ad MPS-RE uder dffere decay parameer respecvely. Whe decay parameer does hold he codos of heorem, he he performace of he algorhm decle he average capurg me s loger. hs s because whe decay parameer does o hold he codos of heorem, durg he learg proces some sae-aco pars ha wll lead o rap o loop whou ay reward also are reforced, so wll affec he performace of he learg algorhm. I addo, from he learg curves we ca fd ha, dffere decay parameer has dffere effec o he learg algorhm. As for he smulao hs paper, for MPS-AE, whe decay parameer 0., he learg algorhm performs bes. For MPS-RE, geerally, he algorhm performs beer wh he creme of decay parameers. Average seps ul capure MPS-AE, =0.05 MPS-AE, =0.0 MPS-AE, =0.5 MPS-AE, =0.20 MPS-AE, =0.50 Fg. 6 he learg resul of MPS-AE algorhm wh dffere decay parameers. Average seps ul capure MPS-RE,=0.0 MPS-RE,=0.20 MPS-RE,=0.30 MPS-RE,=0.50 MPS-RE,=0.80 Fg. 7 he learg resul of MPS-RE algorhm wh dffere decay parameers VI. CONCLUSION I order o solve he problem of huge saes mulage learg, as well as he percepual alasg problem because of he lmed sese ably of learg age we proposed a ew modular reforceme learg approach. Frsly, a kd of o-le prof sharg algorhm s proposed, ad he based o he advaages of modular archecure ad o-le prof sharg mehod, a modular o-le prof sharg approach s preseed. Expermeal resuls showed ha he proposed learg algorhm could be used o acheve opmal soluo beer, sead of radoal reforceme learg mehods. REFERENCES [] Paa L, Luke S, Cooperave Mul-Age Learg: he sae of he ar. Auoomous Ages ad Mul-Age Sysem 5, (3): [2] Ho F, Kamel M. Learg coordag sraeges for cooperave mulage sysems. Mache Learg, 998, 33(2-3): 55-77, [3] Garlad A, Alerma R. Auoomous ages ha lear o beer coordae. Auoomous Ages ad Mul-Age Sysem, 4, 8(3): [4] Kaelbg L P, Lma M L, Moore A W. Reforceme learg: A survey. Joural of Arfcal Research, 996, 4: [5] Suo R S, Baro A G. Reforceme learg: A roduco. Cambrdge, MA: MI Pres 998 [6] Excelee-oledo CB, Jegs NR. Usg reforceme learg o coordae beer. Compuaoal Iellgece, Vol. 2 No. 3, pp Blackwell Publshg 5 [7] CHEN G, YANG ZH. Coordag Mulple Ages va Reforceme Learg. Auoomous Ages ad Mul-Age Sysem 5, 0(3): [8] Oo N, Fukumoo K. Mul-age reforceme learg: A modular approach. I Proceedgs of he Secod Ieraoal Coferece o Mul-age Sysems. Porlad, Orego, USA. 996, pp: , AAAI Press [9] Myazak K, Yamamura M, Kobayash S. O he raoaly of prof sharg reforceme learg. I Proceedgs of he hrd Ieraoal Coferece o Fuzzy Logc, Neural Nes ad Sof Compug, pages Fuzzy Logc Sysems Isue, 994 [0] Ara S, Sycara K. Effecve learg approach for plag ad schedulg mul-age doma. I Proceedgs of he 6h Ieraoal Coferece o Smulao of Adapve Behavor. Par Frace. Sepember 0, pp: [] Ara S, Sycara K P, Paye R. Experece-based reforceme learg o acqure effecve behavor a mul-age doma. I Proceedgs of he 6h Pacfc Rm Ieraoal Coferece o Arfcal Iellgece. Melboure, Ausrala. 0, pp: [2] Bellma R. Dyamc programmg. Prceo, NJ: Prceo Uversy Pres 957 [3] Waks C J, Daya P. echcal Noe: Q-learg. Mache learg, 992, 8: [4] Whehead S, Karlsso J, eeberg J. Learg mulple goal behavor va ask decomposo ad dyamc polcy mergg. Robo Learg, Norwell, MA: Kluwer Academc Pres 993 [5] Grefesee J J. Cred assgme rule dscovery sysems based o geec algorhms. Mache Learg, 988, 3: [6] Myazak K, Kobayash S. O he raoaly of prof sharg parally observable markov decso process. I Proceedgs of he ffh Ieraoal Coferece o Iformao Sysems Aalyss ad Syhess. Orlado, FL, USA. 999, pp: [7] Whehead S D, Ballad D H. Acve percepo ad reforceme learg. I Proceedgs of 7h Ieraoal Coferece o Mache Learg. 990, pp: [8] Sgh S P, Suo R S. Reforceme learg wh replacg elgbly races. Mache Learg, 996, 22: [9] Beda M, Jagaaha V, Dodhawalla R. O opmal cooperao of kowledge source. echcal Repor No. BCS-G-28, Boeg Advaced echology Ceer, Boeg Compuer Servce Seale, WA,
The Poisson Process Properties of the Poisson Process
Posso Processes Summary The Posso Process Properes of he Posso Process Ierarrval mes Memoryless propery ad he resdual lfeme paradox Superposo of Posso processes Radom seleco of Posso Pos Bulk Arrvals ad
More informationReal-Time Systems. Example: scheduling using EDF. Feasibility analysis for EDF. Example: scheduling using EDF
EDA/DIT6 Real-Tme Sysems, Chalmers/GU, 0/0 ecure # Updaed February, 0 Real-Tme Sysems Specfcao Problem: Assume a sysem wh asks accordg o he fgure below The mg properes of he asks are gve he able Ivesgae
More information14. Poisson Processes
4. Posso Processes I Lecure 4 we roduced Posso arrvals as he lmg behavor of Bomal radom varables. Refer o Posso approxmao of Bomal radom varables. From he dscusso here see 4-6-4-8 Lecure 4 " arrvals occur
More information8. Queueing systems lect08.ppt S Introduction to Teletraffic Theory - Fall
8. Queueg sysems lec8. S-38.45 - Iroduco o Teleraffc Theory - Fall 8. Queueg sysems Coes Refresher: Smle eleraffc model M/M/ server wag laces M/M/ servers wag laces 8. Queueg sysems Smle eleraffc model
More informationFORCED VIBRATION of MDOF SYSTEMS
FORCED VIBRAION of DOF SSES he respose of a N DOF sysem s govered by he marx equao of moo: ] u C] u K] u 1 h al codos u u0 ad u u 0. hs marx equao of moo represes a sysem of N smulaeous equaos u ad s me
More informationKey words: Fractional difference equation, oscillatory solutions,
OSCILLATION PROPERTIES OF SOLUTIONS OF FRACTIONAL DIFFERENCE EQUATIONS Musafa BAYRAM * ad Ayd SECER * Deparme of Compuer Egeerg, Isabul Gelsm Uversy Deparme of Mahemacal Egeerg, Yldz Techcal Uversy * Correspodg
More informationLeast Squares Fitting (LSQF) with a complicated function Theexampleswehavelookedatsofarhavebeenlinearintheparameters
Leas Squares Fg LSQF wh a complcaed fuco Theeampleswehavelookedasofarhavebeelearheparameers ha we have bee rg o deerme e.g. slope, ercep. For he case where he fuco s lear he parameers we ca fd a aalc soluo
More informationFor the plane motion of a rigid body, an additional equation is needed to specify the state of rotation of the body.
The kecs of rgd bodes reas he relaoshps bewee he exeral forces acg o a body ad he correspodg raslaoal ad roaoal moos of he body. he kecs of he parcle, we foud ha wo force equaos of moo were requred o defe
More informationContinuous Time Markov Chains
Couous me Markov chas have seay sae probably soluos f a oly f hey are ergoc, us lke scree me Markov chas. Fg he seay sae probably vecor for a couous me Markov cha s o more ffcul ha s he scree me case,
More informationDetermination of Antoine Equation Parameters. December 4, 2012 PreFEED Corporation Yoshio Kumagae. Introduction
refeed Soluos for R&D o Desg Deermao of oe Equao arameers Soluos for R&D o Desg December 4, 0 refeed orporao Yosho Kumagae refeed Iroduco hyscal propery daa s exremely mpora for performg process desg ad
More informationCyclone. Anti-cyclone
Adveco Cycloe A-cycloe Lorez (963) Low dmesoal aracors. Uclear f hey are a good aalogy o he rue clmae sysem, bu hey have some appealg characerscs. Dscusso Is he al codo balaced? Is here a al adjusme
More informationThe algebraic immunity of a class of correlation immune H Boolean functions
Ieraoal Coferece o Advaced Elecroc Scece ad Techology (AEST 06) The algebrac mmuy of a class of correlao mmue H Boolea fucos a Jgla Huag ad Zhuo Wag School of Elecrcal Egeerg Norhwes Uversy for Naoales
More informationQR factorization. Let P 1, P 2, P n-1, be matrices such that Pn 1Pn 2... PPA
QR facorzao Ay x real marx ca be wre as AQR, where Q s orhogoal ad R s upper ragular. To oba Q ad R, we use he Householder rasformao as follows: Le P, P, P -, be marces such ha P P... PPA ( R s upper ragular.
More informationThe ray paths and travel times for multiple layers can be computed using ray-tracing, as demonstrated in Lab 3.
C. Trael me cures for mulple reflecors The ray pahs ad rael mes for mulple layers ca be compued usg ray-racg, as demosraed Lab. MATLAB scrp reflec_layers_.m performs smple ray racg. (m) ref(ms) ref(ms)
More informationMidterm Exam. Tuesday, September hour, 15 minutes
Ecoomcs of Growh, ECON560 Sa Fracsco Sae Uvers Mchael Bar Fall 203 Mderm Exam Tuesda, Sepember 24 hour, 5 mues Name: Isrucos. Ths s closed boo, closed oes exam. 2. No calculaors of a d are allowed. 3.
More information-distributed random variables consisting of n samples each. Determine the asymptotic confidence intervals for
Assgme Sepha Brumme Ocober 8h, 003 9 h semeser, 70544 PREFACE I 004, I ed o sped wo semesers o a sudy abroad as a posgraduae exchage sude a he Uversy of Techology Sydey, Ausrala. Each opporuy o ehace my
More informationOptimal Eye Movement Strategies in Visual Search (Supplement)
Opmal Eye Moveme Sraeges Vsual Search (Suppleme) Jr Naemk ad Wlso S. Gesler Ceer for Percepual Sysems ad Deparme of Psychology, Uversy of exas a Aus, Aus X 787 Here we derve he deal searcher for he case
More informationSolution set Stat 471/Spring 06. Homework 2
oluo se a 47/prg 06 Homework a Whe he upper ragular elemes are suppressed due o smmer b Le Y Y Y Y A weep o he frs colum o oba: A ˆ b chagg he oao eg ad ec YY weep o he secod colum o oba: Aˆ YY weep o
More information(1) Cov(, ) E[( E( ))( E( ))]
Impac of Auocorrelao o OLS Esmaes ECON 3033/Evas Cosder a smple bvarae me-seres model of he form: y 0 x The four key assumpos abou ε hs model are ) E(ε ) = E[ε x ]=0 ) Var(ε ) =Var(ε x ) = ) Cov(ε, ε )
More informationIMPROVED PORTFOLIO OPTIMIZATION MODEL WITH TRANSACTION COST AND MINIMAL TRANSACTION LOTS
Vol.7 No.4 (200) p73-78 Joural of Maageme Scece & Sascal Decso IMPROVED PORTFOLIO OPTIMIZATION MODEL WITH TRANSACTION COST AND MINIMAL TRANSACTION LOTS TIANXIANG YAO AND ZAIWU GONG College of Ecoomcs &
More informationFundamentals of Speech Recognition Suggested Project The Hidden Markov Model
. Projec Iroduco Fudameals of Speech Recogo Suggesed Projec The Hdde Markov Model For hs projec, s proposed ha you desg ad mpleme a hdde Markov model (HMM) ha opmally maches he behavor of a se of rag sequeces
More informationChapter 8. Simple Linear Regression
Chaper 8. Smple Lear Regresso Regresso aalyss: regresso aalyss s a sascal mehodology o esmae he relaoshp of a respose varable o a se of predcor varable. whe here s jus oe predcor varable, we wll use smple
More informationA Novel ACO with Average Entropy
J. Sofware Egeerg & Applcaos, 2009, 2: 370-374 do:10.4236/jsea.2009.25049 Publshed Ole December 2009 (hp://www.scrp.org/joural/jsea) A Novel ACO wh Average Eropy Yacag LI College of Cvl Egeerg, Hebe Uversy
More informationThe Mean Residual Lifetime of (n k + 1)-out-of-n Systems in Discrete Setting
Appled Mahemacs 4 5 466-477 Publshed Ole February 4 (hp//wwwscrporg/oural/am hp//dxdoorg/436/am45346 The Mea Resdual Lfeme of ( + -ou-of- Sysems Dscree Seg Maryam Torab Sahboom Deparme of Sascs Scece ad
More informationVARIATIONAL ITERATION METHOD FOR DELAY DIFFERENTIAL-ALGEBRAIC EQUATIONS. Hunan , China,
Mahemacal ad Compuaoal Applcaos Vol. 5 No. 5 pp. 834-839. Assocao for Scefc Research VARIATIONAL ITERATION METHOD FOR DELAY DIFFERENTIAL-ALGEBRAIC EQUATIONS Hoglag Lu Aguo Xao Yogxag Zhao School of Mahemacs
More informationMixed Integral Equation of Contact Problem in Position and Time
Ieraoal Joural of Basc & Appled Sceces IJBAS-IJENS Vol: No: 3 ed Iegral Equao of Coac Problem Poso ad me. A. Abdou S. J. oaquel Deparme of ahemacs Faculy of Educao Aleadra Uversy Egyp Deparme of ahemacs
More informationResearch on portfolio model based on information entropy theory
Avalable ole www.jocpr.com Joural of Chemcal ad Pharmaceucal esearch, 204, 6(6):286-290 esearch Arcle ISSN : 0975-7384 CODEN(USA) : JCPC5 esearch o porfolo model based o formao eropy heory Zhag Jusha,
More informationPartial Molar Properties of solutions
Paral Molar Properes of soluos A soluo s a homogeeous mxure; ha s, a soluo s a oephase sysem wh more ha oe compoe. A homogeeous mxures of wo or more compoes he gas, lqud or sold phase The properes of a
More informationFALL HOMEWORK NO. 6 - SOLUTION Problem 1.: Use the Storage-Indication Method to route the Input hydrograph tabulated below.
Jorge A. Ramírez HOMEWORK NO. 6 - SOLUTION Problem 1.: Use he Sorage-Idcao Mehod o roue he Ipu hydrograph abulaed below. Tme (h) Ipu Hydrograph (m 3 /s) Tme (h) Ipu Hydrograph (m 3 /s) 0 0 90 450 6 50
More informationThe Bernstein Operational Matrix of Integration
Appled Mahemacal Sceces, Vol. 3, 29, o. 49, 2427-2436 he Berse Operaoal Marx of Iegrao Am K. Sgh, Vee K. Sgh, Om P. Sgh Deparme of Appled Mahemacs Isue of echology, Baaras Hdu Uversy Varaas -225, Ida Asrac
More informationSolving fuzzy linear programming problems with piecewise linear membership functions by the determination of a crisp maximizing decision
Frs Jo Cogress o Fuzzy ad Iellge Sysems Ferdows Uversy of Mashhad Ira 9-3 Aug 7 Iellge Sysems Scefc Socey of Ira Solvg fuzzy lear programmg problems wh pecewse lear membershp fucos by he deermao of a crsp
More informationReal-time Classification of Large Data Sets using Binary Knapsack
Real-me Classfcao of Large Daa Ses usg Bary Kapsack Reao Bru bru@ds.uroma. Uversy of Roma La Sapeza AIRO 004-35h ANNUAL CONFERENCE OF THE ITALIAN OPERATIONS RESEARCH Sepember 7-0, 004, Lecce, Ialy Oule
More informationInternational Journal Of Engineering And Computer Science ISSN: Volume 5 Issue 12 Dec. 2016, Page No.
www.jecs. Ieraoal Joural Of Egeerg Ad Compuer Scece ISSN: 19-74 Volume 5 Issue 1 Dec. 16, Page No. 196-1974 Sofware Relably Model whe mulple errors occur a a me cludg a faul correco process K. Harshchadra
More informationGeneral Complex Fuzzy Transformation Semigroups in Automata
Joural of Advaces Compuer Research Quarerly pissn: 345-606x eissn: 345-6078 Sar Brach Islamc Azad Uversy Sar IRIra Vol 7 No May 06 Pages: 7-37 wwwacrausaracr Geeral Complex uzzy Trasformao Semgroups Auomaa
More informationQuantitative Portfolio Theory & Performance Analysis
550.447 Quaave Porfolo heory & Performace Aalyss Week February 4 203 Coceps. Assgme For February 4 (hs Week) ead: A&L Chaper Iroduco & Chaper (PF Maageme Evrome) Chaper 2 ( Coceps) Seco (Basc eur Calculaos)
More informationLecture 3 Topic 2: Distributions, hypothesis testing, and sample size determination
Lecure 3 Topc : Drbuo, hypohe eg, ad ample ze deermao The Sude - drbuo Coder a repeaed drawg of ample of ze from a ormal drbuo of mea. For each ample, compue,,, ad aoher ac,, where: The ac he devao of
More informationLeast squares and motion. Nuno Vasconcelos ECE Department, UCSD
Leas squares ad moo uo Vascocelos ECE Deparme UCSD Pla for oda oda we wll dscuss moo esmao hs s eresg wo was moo s ver useful as a cue for recogo segmeao compresso ec. s a grea eample of leas squares problem
More informationNature and Science, 5(1), 2007, Han and Xu, Multi-variable Grey Model based on Genetic Algorithm and its Application in Urban Water Consumption
Naure ad Scece, 5, 7, Ha ad u, ul-varable Grey odel based o Geec Algorhm ad s Applcao Urba Waer Cosumpo ul-varable Grey odel based o Geec Algorhm ad s Applcao Urba Waer Cosumpo Ha Ya*, u Shguo School of
More informationCyclically Interval Total Colorings of Cycles and Middle Graphs of Cycles
Ope Joural of Dsree Mahemas 2017 7 200-217 hp://wwwsrporg/joural/ojdm ISSN Ole: 2161-7643 ISSN Pr: 2161-7635 Cylally Ierval Toal Colorgs of Cyles Mddle Graphs of Cyles Yogqag Zhao 1 Shju Su 2 1 Shool of
More informationThe Linear Regression Of Weighted Segments
The Lear Regresso Of Weghed Segmes George Dael Maeescu Absrac. We proposed a regresso model where he depede varable s made o up of pos bu segmes. Ths suao correspods o he markes hroughou he da are observed
More informationQuantum Mechanics II Lecture 11 Time-dependent perturbation theory. Time-dependent perturbation theory (degenerate or non-degenerate starting state)
Pro. O. B. Wrgh, Auum Quaum Mechacs II Lecure Tme-depede perurbao heory Tme-depede perurbao heory (degeerae or o-degeerae sarg sae) Cosder a sgle parcle whch, s uperurbed codo wh Hamloa H, ca exs a superposo
More informationCOMPARISON OF ESTIMATORS OF PARAMETERS FOR THE RAYLEIGH DISTRIBUTION
COMPARISON OF ESTIMATORS OF PARAMETERS FOR THE RAYLEIGH DISTRIBUTION Eldesoky E. Affy. Faculy of Eg. Shbee El kom Meoufa Uv. Key word : Raylegh dsrbuo, leas squares mehod, relave leas squares, leas absolue
More informationAML710 CAD LECTURE 12 CUBIC SPLINE CURVES. Cubic Splines Matrix formulation Normalised cubic splines Alternate end conditions Parabolic blending
CUIC SLINE CURVES Cubc Sples Marx formulao Normalsed cubc sples Alerae ed codos arabolc bledg AML7 CAD LECTURE CUIC SLINE The ame sple comes from he physcal srume sple drafsme use o produce curves A geeral
More informationComplementary Tree Paired Domination in Graphs
IOSR Joural of Mahemacs (IOSR-JM) e-issn: 2278-5728, p-issn: 239-765X Volume 2, Issue 6 Ver II (Nov - Dec206), PP 26-3 wwwosrjouralsorg Complemeary Tree Pared Domao Graphs A Meeaksh, J Baskar Babujee 2
More informationStabilization of LTI Switched Systems with Input Time Delay. Engineering Letters, 14:2, EL_14_2_14 (Advance online publication: 16 May 2007) Lin Lin
Egeerg Leers, 4:2, EL_4_2_4 (Advace ole publcao: 6 May 27) Sablzao of LTI Swched Sysems wh Ipu Tme Delay L L Absrac Ths paper deals wh sablzao of LTI swched sysems wh pu me delay. A descrpo of sysems sablzao
More informationCompetitive Facility Location Problem with Demands Depending on the Facilities
Aa Pacc Maageme Revew 4) 009) 5-5 Compeve Facl Locao Problem wh Demad Depedg o he Facle Shogo Shode a* Kuag-Yh Yeh b Hao-Chg Ha c a Facul of Bue Admrao Kobe Gau Uver Japa bc Urba Plag Deparme Naoal Cheg
More informationAverage Consensus in Networks of Multi-Agent with Multiple Time-Varying Delays
I. J. Commucaos ewor ad Sysem Sceces 3 96-3 do:.436/jcs..38 Publshed Ole February (hp://www.scrp.org/joural/jcs/). Average Cosesus ewors of Mul-Age wh Mulple me-varyg Delays echeg ZHAG Hu YU Isue of olear
More informationNeural Network Global Sliding Mode PID Control for Robot Manipulators
Neural Newor Global Sldg Mode PID Corol for Robo Mapulaors. C. Kuo, Member, IAENG ad Y. J. Huag, Member, IAENG Absrac hs paper preses a eural ewor global PID-sldg mode corol mehod for he racg corol of
More informationComparison of the Bayesian and Maximum Likelihood Estimation for Weibull Distribution
Joural of Mahemacs ad Sascs 6 (2): 1-14, 21 ISSN 1549-3644 21 Scece Publcaos Comarso of he Bayesa ad Maxmum Lkelhood Esmao for Webull Dsrbuo Al Omar Mohammed Ahmed, Hadeel Salm Al-Kuub ad Noor Akma Ibrahm
More informationAsymptotic Behavior of Solutions of Nonlinear Delay Differential Equations With Impulse
P a g e Vol Issue7Ver,oveber Global Joural of Scece Froer Research Asypoc Behavor of Soluos of olear Delay Dffereal Equaos Wh Ipulse Zhag xog GJSFR Classfcao - F FOR 3 Absrac Ths paper sudes he asypoc
More informationModified Integrated Multi-Point Approximation And GA Used In Truss Topology Optimization
Joural of Muldscplary Egeerg Scece ad echology (JMES) Vol. 4 Issue 6, Jue - 2017 Modfed Iegraed Mul-Po Appromao Ad GA sed I russ opology Opmzao Adurahma M. Hasse 1, Mohammed A. Ha 2 Mechacal ad Idusral
More informationSolution. The straightforward approach is surprisingly difficult because one has to be careful about the limits.
ose ad Varably Homewor # (8), aswers Q: Power spera of some smple oses A Posso ose A Posso ose () s a sequee of dela-fuo pulses, eah ourrg depedely, a some rae r (More formally, s a sum of pulses of wdh
More informationAN INCREMENTAL QUASI-NEWTON METHOD WITH A LOCAL SUPERLINEAR CONVERGENCE RATE. Aryan Mokhtari Mark Eisen Alejandro Ribeiro
AN INCREMENTAL QUASI-NEWTON METHOD WITH A LOCAL SUPERLINEAR CONVERGENCE RATE Arya Mokhar Mark Ese Alejadro Rbero Deparme of Elecrcal ad Sysems Egeerg, Uversy of Pesylvaa ABSTRACT We prese a cremeal Broyde-Flecher-Goldfarb-Shao
More informationFully Fuzzy Linear Systems Solving Using MOLP
World Appled Sceces Joural 12 (12): 2268-2273, 2011 ISSN 1818-4952 IDOSI Publcaos, 2011 Fully Fuzzy Lear Sysems Solvg Usg MOLP Tofgh Allahvraloo ad Nasser Mkaelvad Deparme of Mahemacs, Islamc Azad Uversy,
More informationContinuous Indexed Variable Systems
Ieraoal Joural o Compuaoal cece ad Mahemacs. IN 0974-389 Volume 3, Number 4 (20), pp. 40-409 Ieraoal Research Publcao House hp://www.rphouse.com Couous Idexed Varable ysems. Pouhassa ad F. Mohammad ghjeh
More informationSolution of Impulsive Differential Equations with Boundary Conditions in Terms of Integral Equations
Joural of aheacs ad copuer Scece (4 39-38 Soluo of Ipulsve Dffereal Equaos wh Boudary Codos Ters of Iegral Equaos Arcle hsory: Receved Ocober 3 Acceped February 4 Avalable ole July 4 ohse Rabba Depare
More informationThe Optimal Combination Forecasting Based on ARIMA,VAR and SSM
Advaces Compuer, Sgals ad Sysems (206) : 3-7 Clausus Scefc Press, Caada The Opmal Combao Forecasg Based o ARIMA,VAR ad SSM Bebe Che,a, Mgya Jag,b* School of Iformao Scece ad Egeerg, Shadog Uversy, Ja,
More informationMoments of Order Statistics from Nonidentically Distributed Three Parameters Beta typei and Erlang Truncated Exponential Variables
Joural of Mahemacs ad Sascs 6 (4): 442-448, 200 SSN 549-3644 200 Scece Publcaos Momes of Order Sascs from Nodecally Dsrbued Three Parameers Bea ype ad Erlag Trucaed Expoeal Varables A.A. Jamoom ad Z.A.
More informationRedundancy System Fault Sampling Under Imperfect Maintenance
A publcao of CHEMICAL EGIEERIG TRASACTIOS VOL. 33, 03 Gues Edors: Erco Zo, Pero Barald Copyrgh 03, AIDIC Servz S.r.l., ISB 978-88-95608-4-; ISS 974-979 The Iala Assocao of Chemcal Egeerg Ole a: www.adc./ce
More informationThe textbook expresses the stock price as the present discounted value of the dividend paid and the price of the stock next period.
ublc Affars 974 Meze D. Ch Fall Socal Sceces 748 Uversy of Wscos-Madso Sock rces, News ad he Effce Markes Hypohess (rev d //) The rese Value Model Approach o Asse rcg The exbook expresses he sock prce
More informationDevelopment of Hybrid-Coded EPSO for Optimal Allocation of FACTS Devices in Uncertain Smart Grids
Avalable ole a www.scecedrec.com Proceda Compuer Scece 6 (011) 49 434 Complex Adapve Sysems, Volume 1 Cha H. Dagl, Edor Chef Coferece Orgazed by ssour Uversy of Scece ad Techology 011- Chcago, IL Developme
More informationFourth Order Runge-Kutta Method Based On Geometric Mean for Hybrid Fuzzy Initial Value Problems
IOSR Joural of Mahemacs (IOSR-JM) e-issn: 2278-5728, p-issn: 29-765X. Volume, Issue 2 Ver. II (Mar. - Apr. 27), PP 4-5 www.osrjourals.org Fourh Order Ruge-Kua Mehod Based O Geomerc Mea for Hybrd Fuzzy
More informationComplete Identification of Isotropic Configurations of a Caster Wheeled Mobile Robot with Nonredundant/Redundant Actuation
486 Ieraoal Joural Sugbok of Corol Km Auomao ad Byugkwo ad Sysems Moo vol 4 o 4 pp 486-494 Augus 006 Complee Idefcao of Isoropc Cofguraos of a Caser Wheeled Moble Robo wh Noreduda/Reduda Acuao Sugbok Km
More informationBrownian Motion and Stochastic Calculus. Brownian Motion and Stochastic Calculus
Browa Moo Sochasc Calculus Xogzh Che Uversy of Hawa a Maoa earme of Mahemacs Seember, 8 Absrac Ths oe s abou oob decomoso he bascs of Suare egrable margales Coes oob-meyer ecomoso Suare Iegrable Margales
More informationLinear Regression Linear Regression with Shrinkage
Lear Regresso Lear Regresso h Shrkage Iroduco Regresso meas predcg a couous (usuall scalar oupu from a vecor of couous pus (feaures x. Example: Predcg vehcle fuel effcec (mpg from 8 arbues: Lear Regresso
More informationarxiv: v1 [stat.ml] 21 Mar 2017
Sochasc Prmal Dual Coordae Mehod wh No-Uform Samplg Based o Opmaly Volaos Asush Shbagak shbagak.a.mllab.@gmal.com Deparme of Scefc ad Egeerg Smulao, Nagoya Isue of Techology arxv:703.0706v [sa.ml] Mar
More informationThe textbook expresses the stock price as the present discounted value of the dividend paid and the price of the stock next period.
coomcs 435 Meze. Ch Fall 07 Socal Sceces 748 Uversy of Wscos-Madso Sock rces, News ad he ffce Markes Hypohess The rese Value Model Approach o Asse rcg The exbook expresses he sock prce as he prese dscoued
More informationSome Probability Inequalities for Quadratic Forms of Negatively Dependent Subgaussian Random Variables
Joural of Sceces Islamc epublc of Ira 6(: 63-67 (005 Uvers of ehra ISSN 06-04 hp://scecesuacr Some Probabl Iequales for Quadrac Forms of Negavel Depede Subgaussa adom Varables M Am A ozorga ad H Zare 3
More informationMultiphase Flow Simulation Based on Unstructured Grid
200 Tuoral School o Flud Dyamcs: Topcs Turbulece Uversy of Marylad, May 24-28, 200 Oule Bacgroud Mulphase Flow Smulao Based o Usrucured Grd Bubble Pacg Mehod mehod Based o he Usrucured Grd Remar B CHEN,
More informationθ = θ Π Π Parametric counting process models θ θ θ Log-likelihood: Consider counting processes: Score functions:
Paramerc coug process models Cosder coug processes: N,,..., ha cou he occurreces of a eve of eres for dvduals Iesy processes: Lelhood λ ( ;,,..., N { } λ < Log-lelhood: l( log L( Score fucos: U ( l( log
More informationAbstract. Keywords: Mutation probability, evolutionary computation, optimization, sensitivity, variability. 1. Introduction. 2. Proposed Algorithm
EgOp 2008 Ieraoal Coferece o Egeerg Opmzao Ro de Jaero, Brazl, 01-05 Jue 2008. Absrac Redefg Muao Probables for Evoluoary Opmzao Problems Raja Aggarwal Faculy of Egeerg ad Compuer Scece Cocorda Uversy,
More informationSYRIAN SEISMIC CODE :
SYRIAN SEISMIC CODE 2004 : Two sac mehods have bee ssued Syra buldg code 2004 o calculae he laeral sesmc forces he buldg. The Frs Sac Mehod: I s he same mehod he prevous code (995) wh few modfcaos. I s
More informationFault Tolerant Computing. Fault Tolerant Computing CS 530 Probabilistic methods: overview
Probably 1/19/ CS 53 Probablsc mehods: overvew Yashwa K. Malaya Colorado Sae Uversy 1 Probablsc Mehods: Overvew Cocree umbers presece of uceray Probably Dsjo eves Sascal depedece Radom varables ad dsrbuos
More informationSolving Constrained Flow-Shop Scheduling. Problems with Three Machines
It J Cotemp Math Sceces, Vol 5, 2010, o 19, 921-929 Solvg Costraed Flow-Shop Schedulg Problems wth Three Maches P Pada ad P Rajedra Departmet of Mathematcs, School of Advaced Sceces, VIT Uversty, Vellore-632
More informationOptimal Reactive Power Dispatching for Automatic Voltage Control of Hydropower Station Based on an Improved Genetic Algorithm
Opmal eacve Power Dspachg for Auomac Volage Corol of Hydropower Sao ased o a Improved Geec Algorhm GEHAO SHENG ICHEN JIANG IAOJN SHEN YI ENG Deparme. of Elecrcal Egeerg Shagha Jaoog versy uhu Dsrc, Shagha,
More informationarxiv: v2 [cs.lg] 19 Dec 2016
1 Sasfcg mul-armed bad problems Paul Reverdy, Vabhav Srvasava, ad Naom Ehrch Leoard arxv:1512.07638v2 [cs.lg] 19 Dec 2016 Absrac Sasfcg s a relaxao of maxmzg ad allows for less rsky decso makg he face
More informationRegression Approach to Parameter Estimation of an Exponential Software Reliability Model
Amerca Joural of Theorecal ad Appled Sascs 06; 5(3): 80-86 hp://www.scecepublshggroup.com/j/ajas do: 0.648/j.ajas.060503. ISSN: 36-8999 (Pr); ISSN: 36-9006 (Ole) Regresso Approach o Parameer Esmao of a
More informationA Constitutive Model for Multi-Line Simulation of Granular Material Behavior Using Multi-Plane Pattern
Joural of Compuer Scece 5 (): 8-80, 009 ISSN 549-009 Scece Publcaos A Cosuve Model for Mul-Le Smulao of Graular Maeral Behavor Usg Mul-Plae Paer S.A. Sadread, A. Saed Darya ad M. Zae KN Toos Uversy of
More information4. THE DENSITY MATRIX
4. THE DENSTY MATRX The desy marx or desy operaor s a alerae represeao of he sae of a quaum sysem for whch we have prevously used he wavefuco. Alhough descrbg a quaum sysem wh he desy marx s equvale o
More informationHuman Action Recognition in Smart Classroom
Huma Aco Recogo Smar Classroom Habg Re Guagyou Xu (Depe of Compuer Scece ad echology, sghua Uversy, Bejg 00084, P.R.Cha) E-mal: rehabg@263.e xgy-dcs@mal.sghua.edu.c Absrac hs paper preses a ew sysem for
More informationENGINEERING solutions to decision-making problems are
3788 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 62, NO. 8, AUGUST 2017 Sasfcg Mul-Armed Bad Problems Paul Reverdy, Member, IEEE, Vabhav Srvasava, ad Naom Ehrch Leoard, Fellow, IEEE Absrac Sasfcg s a
More informationAsymptotic Regional Boundary Observer in Distributed Parameter Systems via Sensors Structures
Sesors,, 37-5 sesors ISSN 44-8 by MDPI hp://www.mdp.e/sesors Asympoc Regoal Boudary Observer Dsrbued Parameer Sysems va Sesors Srucures Raheam Al-Saphory Sysems Theory Laboraory, Uversy of Perpga, 5, aveue
More informationEE 6885 Statistical Pattern Recognition
EE 6885 Sascal Paer Recogo Fall 005 Prof. Shh-Fu Chag hp://www.ee.columba.edu/~sfchag Lecure 5 (9//05 4- Readg Model Parameer Esmao ML Esmao, Chap. 3. Mure of Gaussa ad EM Referece Boo, HTF Chap. 8.5 Teboo,
More informationThe Properties of Probability of Normal Chain
I. J. Coep. Mah. Sceces Vol. 8 23 o. 9 433-439 HIKARI Ld www.-hkar.co The Properes of Proaly of Noral Cha L Che School of Maheacs ad Sascs Zheghou Noral Uversy Zheghou Cy Hea Provce 4544 Cha cluu6697@sa.co
More informationChapter 3: Maximum-Likelihood & Bayesian Parameter Estimation (part 1)
Aoucemes Reags o E-reserves Proec roosal ue oay Parameer Esmao Bomercs CSE 9-a Lecure 6 CSE9a Fall 6 CSE9a Fall 6 Paer Classfcao Chaer 3: Mamum-Lelhoo & Bayesa Parameer Esmao ar All maerals hese sles were
More informationD 2 : Decentralized Training over Decentralized Data
D : Deceralzed rag over Deceralzed Daa Hal ag Xagru La Mg Ya 3 Ce Zhag 4 J Lu 5 Absrac Whle rag a mache learg model usg mulple workers, each of whch collecs daa from s ow daa source, would be useful whe
More informationPeriodic Resource Reallocation in Two-Echelon Repairable Item Inventory Systems
-ASE-009-070 Perodc Resource Reallocao wo-echelo Reparable Iem Iveory Sysems Hoog Chu LAU, Je PAN, Huawe SON Absrac ve a exsg sock allocao a veory sysem, s ofe ecessary o perform reallocao over mulple
More informationNOTE ON SIMPLE AND LOGARITHMIC RETURN
Appled udes Agrbusess ad Commerce AAC Ceer-r ublshg House, Debrece DOI:.94/AAC/27/-2/6 CIENIFIC AE NOE ON IME AND OGAIHMIC EUN aa Mskolcz Uversy of Debrece, Isue of Accoug ad Face mskolczpaa@gmal.com Absrac:
More informationCS344: Introduction to Artificial Intelligence
C344: Iroduco o Arfcal Iellgece Puhpa Bhaacharyya CE Dep. IIT Bombay Lecure 3 3 32 33: Forward ad bacward; Baum elch 9 h ad 2 March ad 2 d Aprl 203 Lecure 27 28 29 were o EM; dae 2 h March o 8 h March
More informationFresnel Equations cont.
Lecure 12 Chaper 4 Fresel quaos co. Toal eral refleco ad evaesce waves Opcal properes of meals Laer: Famlar aspecs of he eraco of lgh ad maer Fresel quaos r 2 Usg Sell s law, we ca re-wre: r s s r a a
More informationNew Guaranteed H Performance State Estimation for Delayed Neural Networks
Ieraoal Joural of Iformao ad Elecrocs Egeerg Vol. o. 6 ovember ew Guaraeed H Performace ae Esmao for Delayed eural eworks Wo Il Lee ad PooGyeo Park Absrac I hs paper a ew guaraeed performace sae esmao
More informationAn Efficient Dual to Ratio and Product Estimator of Population Variance in Sample Surveys
"cece as True Here" Joural of Mahemacs ad ascal cece, Volume 06, 78-88 cece gpos Publshg A Effce Dual o Rao ad Produc Esmaor of Populao Varace ample urves ubhash Kumar Yadav Deparme of Mahemacs ad ascs
More informationSynchronization of Complex Network System with Time-Varying Delay Via Periodically Intermittent Control
Sychrozao of Complex ework Sysem wh me-varyg Delay Va Perodcally Ierme Corol JIAG Ya Deparme of Elecrcal ad Iformao Egeerg Hua Elecrcal College of echology Xaga 4, Cha Absrac he sychrozao corol problem
More informationEfficient Estimators for Population Variance using Auxiliary Information
Global Joural of Mahemacal cece: Theor ad Praccal. IN 97-3 Volume 3, Number (), pp. 39-37 Ieraoal Reearch Publcao Houe hp://www.rphoue.com Effce Emaor for Populao Varace ug Aular Iformao ubhah Kumar Yadav
More informationAs evident from the full-sample-model, we continue to assume that individual errors are identically and
Maxmum Lkelhood smao Greee Ch.4; App. R scrp modsa, modsb If we feel safe makg assumpos o he sascal dsrbuo of he error erm, Maxmum Lkelhood smao (ML) s a aracve alerave o Leas Squares for lear regresso
More informationRATIO ESTIMATORS USING CHARACTERISTICS OF POISSON DISTRIBUTION WITH APPLICATION TO EARTHQUAKE DATA
The 7 h Ieraoal as of Sascs ad Ecoomcs Prague Sepember 9-0 Absrac RATIO ESTIMATORS USING HARATERISTIS OF POISSON ISTRIBUTION WITH APPLIATION TO EARTHQUAKE ATA Gamze Özel Naural pulaos bolog geecs educao
More informationAvailable online Journal of Scientific and Engineering Research, 2014, 1(1): Research Article
Avalable ole wwwjsaercom Joural o Scec ad Egeerg Research, 0, ():0-9 Research Arcle ISSN: 39-630 CODEN(USA): JSERBR NEW INFORMATION INEUALITIES ON DIFFERENCE OF GENERALIZED DIVERGENCES AND ITS APPLICATION
More informationCONTROLLABILITY OF A CLASS OF SINGULAR SYSTEMS
44 Asa Joural o Corol Vol 8 No 4 pp 44-43 December 6 -re Paper- CONTROLLAILITY OF A CLASS OF SINGULAR SYSTEMS Guagmg Xe ad Log Wag ASTRACT I hs paper several dere coceps o corollably are vesgaed or a class
More informationFinal Exam Applied Econometrics
Fal Eam Appled Ecoomercs. 0 Sppose we have he followg regresso resl: Depede Varable: SAT Sample: 437 Iclded observaos: 437 Whe heeroskedasc-cosse sadard errors & covarace Varable Coeffce Sd. Error -Sasc
More informationStudy on one-dimensional consolidation of soil under cyclic loading and with varied compressibility *
Zhuag e al. / J Zhejag v SCI 5 6A(:4-47 4 Joural of Zhejag versy SCIENCE ISSN 9-395 hp://www.ju.edu.c/jus E-mal: jus@ju.edu.c Sudy o oe-dmesoal cosoldao of sol uder cyclc loadg ad wh vared compressbly
More information