A Modular On-line Profit Sharing Approach in Multiagent Domains

Size: px
Start display at page:

Download "A Modular On-line Profit Sharing Approach in Multiagent Domains"

Transcription

1 A Modular O-le Prof Sharg Approach Mulage Domas Pucheg Zhou, ad Bgrog Hog Absrac How o coordae he behavors of he ages hrough learg s a challegg problem wh mul-age domas. Because of s complexy, rece work has focused o how coordaed sraeges ca be leared. Here we are eresed usg reforceme learg echques o lear he coordaed acos of a group of age whou requrg explc commucao amog hem. However, radoal reforceme learg mehods are based o he assumpo ha he evrome ca be modeled as Markov Decso Proces whch usually cao be sasfed whe mulple ages coexs he same evrome. Moreover, o effecvely coordae each age s behavor so as o acheve he goal, s ecessary o augme he sae of each age wh he formao abou oher exsg ages. Wherea as he umber of ages a mulage evrome crease he sae space of each age grows expoeally, whch wll cause he combaoal exploso problem. Prof sharg s oe of he reforceme learg mehods ha allow ages o lear effecve behavors from her expereces eve wh o-markova evromes. I hs paper, o remedy he drawback of he orgal prof sharg approach ha eeds much memory o sore each sae-aco par durg he learg proces we frsly address a kd of o-le raoal prof sharg algorhm. he, we egrae he advaages of modular learg archecure wh o-le raoal prof sharg algorhm, ad propose a ew modular reforceme learg model. he effecveess of he echque s demosraed usg he pursu problem. Keywords Mul-age learg; reforceme learg; raoal prof sharg; modular archecure. I. INRODUCION mulage sysem (MAS) whch here s a umber of Aauoomous ages eracg, wh each affecg he acos of he ohers esseally cosues a complex sysem. Performg ad compleg asks such a evrome ca be exremely dffcul. I hs paper, he cooperave MAS s cocered, whch several ages aemp, hrough her eraco, o joly solve asks or o maxmze her uly[]. Learg eables MAS o be more flexble ad robu ad makes hem beer able o hadle ucera ad chagg crcumsaces. hus how o coordae dffere ages Mauscrp receved Jue 26, 6. hs work was suppored par by he Naoal Scece Foudao uder Gra Pucheg Zhou s wh he School of Compuer Scece ad echology, Harb Isue of echology, Harb 5000, HeLogJag Provce, P.R.Cha(correspodg auhor o provde phoe: ; fax: ; e-mal: zhoupc@h.edu.c). Bgrog Hog s wh he School of Compuer Scece ad echology, Harb Isue of echology, Harb 5000, HeLogJag Provce, P.R.Cha (e-mal: hogbr@h.edu.c). behavors by learg so as o acheve he commo goal s a mpora heme mulage domas. (e.g., see [2, 3]) Reforceme learg (RL)[4, 5] s he problem faced by a age ha mus lear behavor hrough ral-ad-error eracos wh dyamc evrome ad has bee appled successfully may sgle age sysems. Learg from he evrome s robus because ages are drecly affeced by he dyamcs of he evrome. Because of hese characer RL has become oe of he mpora learg approaches for mulage learg. (e.g., see [6, 7]) I a mulage doma, o acheve he sharg goal, learg age eeds o augme s sae space o clude more effecve formao. A he same me, he creme of sae space wll lead o slow dow he learg process. Based o he dea of modular archecure, Oo[8] proposed a modular Q-learg model, whch ca reduce he sae space effecvely. However, Q-learg s based o he evrome whch should obey he Markova propery, ha he evrome ca by modeled by a Markov Decso Proces bu for mulage doma hese cao be me wh. he raoal prof sharg, whch s proposed by Myazak[9], s a cred assgme mechasm ha bases o rals ad errors. Learg ages erac wh he evrome hrough ral ad error, ad reforce effecve sae-aco pars ad suppress effecve sae-aco pars by learg epsode. Prof sharg coras wh oher reforceme learg approaches whch are based o Dyamc Programmg, such as emporal Dfferece mehod ad Q-learg, ha prof sharg guaraees covergece o a effecve polcy eve domas ha do o obey he Markova propery, f a ask s epsodc ad a cred s assged a approprae maer[0, ]. Based o hese researche hs paper we egrae he advaages of modular archecure wh raoal prof sharg, ad propose a ew modular reforceme learg approach. II. MODULAR REINFORCEMEN LEARNING A. Markov Decso Processes ad Reforceme Learg Markov Decso Process (MDP) s geerally regarded as he mahemacal foudao for RL. A fully observable MDP s a quadruple (S, A,, R), where S s a fe se of sae A s a se of aco : SAS [0,] s he sae raso fuco ha descrbes he probably ps ( ' sa, ) of edg up sae s whe performg aco a sae ad 424

2 R: SA s reward fuco ha reurs a umerc value afer akg aco a sae s. A age s polcy s a mappg : S A. he goal of he age s o fd a opmal polcy ha maxmzes he expeced sum of dscoued rewards V(, s ) E( r, s) () 0 where r s he scalar reward a me sep, [0,] s dscou facor. Equao () ca be rewre as V(, s ) r(, s a) p( s' a) V( s', ) (2) s' S where a s he aco deermed by polcy. Dyamc Programmg (DP) heory[2] guaraees here exss a leas a deermsc opmal polcy *, ha for ay s S, sasfes * * V(, s ) max{ r p(',) s s a V( s', )} (3) aa s' S Q-learg[3]s oe kd of model-free RL algorhm s ha based o DP. I essece Q-learg s a emporal-dfferece learg mehod. Q-learg drecly compues he approxmao of a opmal aco-value fuco, called Q-value, by usg he followg updae rule Qs (, a) Qs (, a) (4) ( ) r maxq( s, a) Q( s, a ) here [0,] s learg rae. B. Modular Q-learg a he modular Q-learg archecure[8] cosss of a medaor module ad learg modules whch amou o he umber of ages volved he ask, show as Fg. Each age he learg module carres ou Q-learg he evrome. I each learg module, learg coceraes o a sgle age ad he learg of oher ages s o cosdered. o complee he global goal, eeds a medaor module o arbrae he resuls of he learg modules. he medaor module combes learg modules decso polces usg a smple heursc decso procedure ad deermes a fal decso by he correspodg age. Reward Evrome Aco Sae Age Learg Module Medaor Module Q Q 2 Q L Learg Module 2 x x 2 x L Learg Module L Fg. he modular archecure for a learg age he learg ages ca decde her aco by usg he Greaes Mass (GM) sraegy proposed by Whehead[4], L k arg max Q ( xk, a). (7) aa k where Q k deoes he aco-value fuco susaed by he learg module k, L s he umber of learg modules. Iuvely, hs s a seleco b majory amog acos proposed by dvdual learg-modules. III. RAIONAL PROFI SHARING Prof sharg s orgally proposed by Grefesee[5]. he orgal verso used prof sharg as a cred assgme mehod based o ral-ad-error experece whou ulzg ay form of value esmao. However, hs approach does o guaraee he raoaly of a acqured polcy. o guaraee covergece o a raoal polcy a o-markova doma, Myazak[9, 6] roduces he raoaly heorem, whch he cred assgme fuco should sasfy, here we call raoal prof sharg. I a mulage evrome, a me sep he learg age observes a sae s, whch s geerally he parally avalable sae of s evrome. A aco a s he seleced from he aco se. Afer he aco s execued, he age deermes wheher a reward r s geeraed. If here s o reward, he age sores he sae-aco par so-called rule (s, a ) s epsodc memory, ad repeas hs cycle ul a reward s geeraed. he process of movg from a al sae o he fal reward sae s called a epsode. Whe a reward s gve o he age, he rules sored s epsodc memory are reforced a oce. he fuco ha maps saes o acos s called a polcy. A polcy s called raoal f ad oly f he expeced reward per a aco s larger ha zero. We call a subsequece of a epsode a deour whe dffere acos are seleced for he same sae a epsode. he rules o a deour are called effecve rules. o corol he effecve rule Myazak[6] gves he followg raoaly heorem of prof sharg. Lemma. Ay effecve rule ca be suppressed ff W, 2,..., W, C f j f (8) where C s a upper boud of he umber of coflcg effecve rule ad W s a upper boud of he legh of epsodes. I pracce, we ca le C=M-, where M s he umber of acos. f deoes a reforceme value for he rule seleced a sep before a reward s acqured. he equaly (8) s called suppresso codo. Lemma 2. Prof sharg ca lear a raoal polcy ff sasfes he suppresso codo of lemma. he algorhm for he prof sharg based o lookup able s followed. Algorhm. he prof sharg algorhm Ialze, for all ss, aa( s) : wsa (, ) a small cosa Repea (for each epsode) 0, alze s j 425

3 Repea (for each sep of epsode) Accordg o s, choose a usg weghed-roulee: a) Pr( a a s s) a' A( s) a' ) Save sae-aco par (s, a ) o epsodc memory Perform aco a ; observe reward r +, ad ex sae s + ul s + s ermal sae, for each par (s, a ) epsodc memory: ws (, a) ws (, a) f( r, ), 0,,..., ul he algorhm holds he soppg crera IV. MODULAR ON-LINE PROFI SHARING APPROACH Wh he modular-q model, each learg module oly cocers wh a par of formao of he evrome, hs may reduce he sze of sae space. However, wll easly lead o percepual alasg problem[7],.e., he learg age wrogly rea wo or more dffere saes as oe. A he same me, a mul-age doma, he succeeded sae of he learg age wll o oly be effeced by s aco ad he evrome, bu also be effeced by oher ages behavor oher word he evrome cao obey Markova propery. However, eve wh o-markova evrome, prof sharg mehod ca lear effecve polcy. So, here we adop prof sharg as a basc algorhm for each learg module. A. O-le Prof Sharg Algorhm Prof sharg algorhm show Algorhm s a off-le updag mehod, whch does o chage weghs of s sae-aco pars ul he ed of a ask. Oe drawback s ha wll requre ubouded memory space o sore all of he seleced sae-aco pars durg he ask. Based o he dea of elgbly races[5,8], here we propose a o-le prof sharg algorhm. he dea behd elgbly races s very smple. Each me a sae s vsed aes a shor-erm memory proces a race, whch he decays gradually over me. hs race marks he sae as elgble for learg. here are maly wo kds of elgbly race ha s accumulag ad replacg elgbly race as defed by equao (9) ad (0), respecvely. e (, s a) s aa e (, s a) (9) e (, s a) oherwse s a a e(, s a) 0 s a a (0) e (, s a) s s where s he decay parameer, ad 0. We apply elgbly races for mplemeg prof sharg mehod cremeally, ad propose a kd of o-le prof sharg algorhm, as show Algorhm 2. Algorhm 2. he o-le prof sharg algorhm Ialze, for all ss, aa( s) : wsa (, ) a small cosa Repea (for each epsode) for all a : esa (, ) 0 0, alze s Repea (for each sep of epsode) Accordg o s, choose a usg weghed-roulee: a) Pr( a a s s) a' A( s) a' ) Updae elgbly races: f use accumulag elgbly race he e( a) e ( a) f use replacg elgbly race he e( a) e( a) 0,for aa( s), a a ake aco a ; observe reward, r +, ad ex sae, s + For all a: wsa (, ) wsa (, ) r e(, sa) e(, s a) e(, s a) ul s + s he ermal sae ul he algorhm holds he soppg crera heorem. If he followg hold:. Algorhm adops cred assgme fuco f(, r) r where, ad. C 2. Algorhm 2 uses accumulag elgbly races. 3. he ermedae rewards are zero ul he ed of each epsode. he, Algorhm s equvale o Algorhm 2, boh of hem ca lear a raoal polcy. Proof. Fr we wll prove ha hese wo algorhms are equvale. Accordg o Kroecker Dela fuco:, j (, j) 0, j Equao (9) ca be rewre as e(, s a) e (, s a) (, s s)(, a a) (, ss0)(, aa0) (, ss)(, aa) (, ss )(, aa )) (, ss)(, aa) 0 (, ss)(, aa) I Algorhm 2, for ay sae-aco par ( a), afer oe erao he cremeal wegh s wsa (, ) r e(, sa), so a me sep whe a epsode ed he oal updae s w (, s a) r e (, s a) o 0 r s s a a 0 0 (, )(, ) 426

4 (, ) (, ) 2 (, ) (, ) 0 r s s a a r s s a a 0 r (, s s )(, a a ) Because all he ermedae rewards ul he ed of a epsode are zero, ha r r 2 r 0, so o(, ) (, )(, ) 0 w s a r s s a a () O he oher had, for Algorhm, a me sep whe a epsode s over, for ay sae-aco par, he sum of cremeal wegh ca be wre as Sce w (, s a) f(, r )(, s s )(, a a ) (2) f(, r ) off 0 r, equao (2) ca be rasformed o off (, ) (, )(, ) o(, ) 0 w s a r s s a a w s a herefore, for ay sae-aco par, he sums of all he updaes hese wo algorhms are same afer learg, so hey are equvale. Nex, we wll show ha hese wo algorhms ca lear a raoal polcy. For Algorhm, accordg o he cred assgme fuco, we wll have - C( ) C f(, r) C r r 0 0 C r Because C, C so - C C f(, r) r r f(, r). 0 herefore, sasfes he suppresso codo lemma, ad accordg o lemma 2, wll lear a raoal polcy. Because of he equvalece of Algorhm ad 2, we ca coclude ha Algorhm wll also lear a raoal polcy. heorem 2. If he followg hold: W. 2, where s he decay parameer equao (0), ad W s a upper boud of he legh of epsodes Lemma. 2. Algorhm 2 uses replacg elgbly races. 3. he ermedae rewards are zero ul he ed of each epsode. he, Algorhm 2 ca lear a raoal polcy. Proof. Fr defe a fuco 0, j f(, j) (, j), j he equao (0) ca be rewre as e ( a) f( s ) e ( a) ( s ) ( a, a ) f(, s s) f(, s s ) e(, s a) f(, s s )(, s s )(, a a ) (, s s )(, a a ) s s aa f ss ss aa 0 (, )(, ) (, ) (, )(, ) For ay sae-aco par (a), afer oe erao he cremeal wegh s wsa (, ) r e(, sa), so a me sep whe a epsode ed he oal updae s w (, s a) r e (, s a) o 0 Because all he ermedae rewards ul he ed of a epsode are zero, ha r r 2 r 0, so w (, s a) r e (, s a) o Assume a me sep - he vsed sae-aco par s (s c, a d ), he w ( s, a ) r e ( s, a ) r o c d c d A he same me, for ay oher uvsed sae-aco par a me -, whch s deoed by ( s, a ) U ( s, a ) s S, a A( s ) ( s, a ) we have u v u v u v u c d w ( s, a ) r e ( s, a ) o u v u v he for all uvsed sae-aco pars a me -, he sum of cremeal weghs s w ( s, a ) r e ( s, a ) o u v u v U U r ( s, s ) ( a, a ) f( s, s ) (3) 2 u v u 0 U r ( s, s ) ( a, a ) u v U Because a me - for ay (s u, a v ), holds ( s, s ) ( a, a ) 0 so u v r ( su, s ) ( av, a ) 0 (4) U Accordg o he defo, s appare ha so f( s, s ) u ( s, s ) ( a, a ) f( s, s ) ( s, s ) ( a, a ) u v u u v For each me sep a =0,,, -2, wh he se U here s a mos oe sae-aco par would be vsed, so ( s, s ) ( a, a ) Hece U u v ( s, s ) ( a, a ) f( s, s ) u v u U (5) Iegrae equao(4) ad equao (5) wh equao(3), we have 427

5 2 wo( su, av ) r r U 0 Because W W 2 herefore w ( s, a ) r w ( s, a ) U o u v o c d herefore, sasfes he suppresso codo lemma, accordg o lemma 2 Algorhm 2 ca lear a raoal polcy. B. he Proposed Modular Reforceme Learg Model Based o he aalyss meoed above, here we propose a modular prof sharg model, as show Fg 2. he learg model of he age cosss of hree module hey are Sae Decomposo Module (SDM), Learg Module (LM) ad Medaor Module (MM), respecvely. SDM deermes he sae of each LM, ad he each LM seds s local polcy o MM. MM wll deerme o choose a approprae aco for he learg age o ake. SDM Reward Sae Age LM ASM SAP PSM Evromes ad oher Ages LM 2 ASM SAP PSM MM Aco LM L ASM SAP PSM Fg. 2 he modular o-le prof sharg model of he age Each LM cosss of hree submodules. ASM: Aco Seleco Module, whch wll sed he local polcy o MM, cludg he avalable acos ad her weghs for he curre sae. SAP: Sae-Aco Par wegh able, whch keeps all of he sae-aco pars ad s wegh, also sores he elgbly race of each sae-aco par. PSM: Prof Sharg Module. Accordg o he receved reward r, he curre sae s ad he chose aco a of MM, PSM updaes he wegh of each sae-aco par usg o-le prof sharg algorhm. he proposed learg algorhm s show as Algorhm 3. Algorhm 3. he proposed modular o-le prof sharg algorhm. Ialze, se learg parameer for all ss, aa( s) each LM: wsa (, ) a small cosa 2. Repea (for each epsode): ) for all ss, a A( s) each LM, esa (, ) 0, 0 2) Repea (for each sep of epsode) () Observe curre sae s() (2) Deerme he sae of each LM, s ()(=,,L, L s he umber of LM) (3) Each LM seds s avalable acos aa(s ()) ad wegh s (), a) o MM (4) Based o receved polces from each LM, MM chooses aco a() usg GM sraegy: k a () arg max w ( s (), a) L aa( sk( )) k (5) Each LM updaes he elgbly race of he vsed sae-aco par. (6) ake aco a(); observe reward, r(+), ad ex sae, s(+) (7) For all of he sae-aco par each LM, updae s wegh ad elgbly race: wsa (, ) wsa (, ) r e(, sa) e(, s a) e(, s a) (8) If sae s(+) s a ermal sae, he go o sep 3); oherwse, +, go o sep (2) 3) If he algorhm holds he soppg crera, he he learg process s over; oherwse, go o sep ). A. Problem Doma V. SIMULAION o evaluae he learg approach proposed hs paper, we choose a modfed verso of he pursu problem[9] as he es bed. I a 8 8 orodal grd world, a sgle prey ad four huer ages are placed a radom posos he grd, as show Fg 3(a). A each me sep, ages sychroously execue oe ou of fve acos: sayg a he curre poso or movg orh, souh, we or eas. he prey ad he huer cao share a cell, bu more ha oe huer ca be a he same poso. Also, a age s o allowed o move off he evrome. Every age ca see objecs a a cera dsace. he dsace ad he cells covers are, respecvely, called he vsual deph ad he vsual evrome of he age. A age havg vsual deph d ca see all he cells sde he (2d+) (2d+) square aroud. hese cells are he vsual evrome of he age. Each age s assged a uque defer. A huer age ca locae he relave poso ad recogze he defer of ay oher ages s vsual evrome. he prey s capured, whe all of s eghbor cell are occuped by he huer as show Fg 3(b). he all of he ages are relocaed a ew radom posos. k 428

6 700 Average seps ul capure Modular-Q, =0. Modular-Q, =0.8 MPS-AE, =0. MPS-RE, =0.5 (a) Fg. 3 (a) Example of he pursu problem a grd world. (b) Examples of capurg saes he pursu problem has he followg characerscs. ) he evrome s fully dyamc, parally observable ad o-deermsc. 2) he huers ages are learg age whle he prey selecs s ow aco based o a radom sraegy. 3) he huer ages are self-eresed, ad here s o commucao bewee each oher. B. Expermeal Seg ad Smulao Resuls o es he proposed algorhm, we compare modular prof sharg based o accumulag elgbly race (MPS-AE), wh modular prof sharg based o replacg elgbly race (MPS-RE) ad Modular Q-learg (Modular-Q). Accordg o heorem, MPS-AE we se decay parameer 0., MPS-RE le 0.5. For Modular-Q, he learg rae 0. or 0.8, dscou facor 0.9. he oal learg epsodes are 5000, ad he maxmum eraos wh a epsode are 500 me seps. he vsual deph of he huer s d=3. Upo capurg he prey, all he huers surroudg he prey receve a reward of 00, ad accordgly all of her cosues learg modules uformly receve he same reward regardless of wha acos hey have jus proposed. I ay oher case huers receve a reward of zero. Sce he huer cao fsh capure he prey aloe, herefore, o capure he prey, he huer should ake he formao of he prey ad oher huers o accou, so as o beer coordae s behavor wh oher huers. For he problem of (N+) huers capurg oe prey, accordg o coveoal sae codg, he sae of Huer ca be deoed as s:=<p, H,, H N>, where P s he relave poso of he prey o Huer, ad H k (k=,, N) deoes he relave poso of Huer k o Huer. Whe he huer wh vsual deph d, he sze of he sae space s S =(v s +) N+. For module archecure, he sae space of each learg module for he huer s a combao of he relave posos of he prey ad oe oher huer, ha he sze of sae space s S =(v s +) 2, accordgly, he sze of oal sae space s S =N (v s +) 2, oher word he relaoshp bewee he sze of he sae space ad he umber of learg ages s rasformed from expoeal o power, whch meas grealy reduce he sze of sae space. Fg 4 s he learg curve of he hree learg algorhm uder dffere me afer several ral Fg 5 gves he hsogram of he learg resuls for hree algorhms. (b) Fg. 4 he graph of comparso of hree learg algorhms Average seps ul capure Modular-Q, =0. Modular-Q, =0.8 MPS-AE, =0. MPS-RE, = Fg. 5 he hsogram of hree learg algorhms From hese wo graph we ca fd ha, boh MPS-AE ad MPS-RE are beer ha Modular Q-learg. hese ca be explaed as followg. For he pursu problem dscussed here, he evrome cao obey Markova propery, whch meas he premse of Q-learg covergg o he opmal polcy cao be sasfed aymore. Moreover, because of he lmed sese ably, he huer wll wrogly rea wo or more dffere saes as oe, whch lead o he percepual alasg problem. Sce he learg age wll choose approprae aco, whch wll slow dow he learg speed. I addo, Q-learg s a off-polcy learg mehod. hs meas ha Q-learg s guaraeed o coverge o he opmal soluo uder MDP evrome regardless of wha polcy s followed durg rag, as log as each sae-aco pars s vsed fely of he lm. However, Modular-Q, he oe-sep value updaes for each learg module are compued uder he assumpo ha all fuure acos wll be chose opmally for ha Markova evrome. hs assumpo s vald uder he aco seleco mechasms descrbed above, whch uses he heursc polcy such as GM sraegy. Because he chose aco by medaor module wll represe some compromse polcy whch he dffere learg modules share corol. hs meas ha he compued Q-values do o coverge o he acual expeced reur uder he compose polcy. Modular prof sharg mehod wll suppress effecve sae-aco pars ha may rap o loop whou reward, ad reforce 429

7 effecve sae-aco par so wll gradually coverge o a raoal polcy, ad oba beer learg resuls. Addoally, we ca fd ha based o he parameers seg above, MPS-RE s a lle beer ha MPS-AE, whch seems someway lke he reforceme learg wh replacg elgbly races compares o reforceme learg wh accumulag elgbly races. o es performaces of proposed algorhms uder dffere parameer seg we le decay parameer wh dffere values. Fg 6 ad 7 are average learg resuls for MPS-AE ad MPS-RE uder dffere decay parameer respecvely. Whe decay parameer does hold he codos of heorem, he he performace of he algorhm decle he average capurg me s loger. hs s because whe decay parameer does o hold he codos of heorem, durg he learg proces some sae-aco pars ha wll lead o rap o loop whou ay reward also are reforced, so wll affec he performace of he learg algorhm. I addo, from he learg curves we ca fd ha, dffere decay parameer has dffere effec o he learg algorhm. As for he smulao hs paper, for MPS-AE, whe decay parameer 0., he learg algorhm performs bes. For MPS-RE, geerally, he algorhm performs beer wh he creme of decay parameers. Average seps ul capure MPS-AE, =0.05 MPS-AE, =0.0 MPS-AE, =0.5 MPS-AE, =0.20 MPS-AE, =0.50 Fg. 6 he learg resul of MPS-AE algorhm wh dffere decay parameers. Average seps ul capure MPS-RE,=0.0 MPS-RE,=0.20 MPS-RE,=0.30 MPS-RE,=0.50 MPS-RE,=0.80 Fg. 7 he learg resul of MPS-RE algorhm wh dffere decay parameers VI. CONCLUSION I order o solve he problem of huge saes mulage learg, as well as he percepual alasg problem because of he lmed sese ably of learg age we proposed a ew modular reforceme learg approach. Frsly, a kd of o-le prof sharg algorhm s proposed, ad he based o he advaages of modular archecure ad o-le prof sharg mehod, a modular o-le prof sharg approach s preseed. Expermeal resuls showed ha he proposed learg algorhm could be used o acheve opmal soluo beer, sead of radoal reforceme learg mehods. REFERENCES [] Paa L, Luke S, Cooperave Mul-Age Learg: he sae of he ar. Auoomous Ages ad Mul-Age Sysem 5, (3): [2] Ho F, Kamel M. Learg coordag sraeges for cooperave mulage sysems. Mache Learg, 998, 33(2-3): 55-77, [3] Garlad A, Alerma R. Auoomous ages ha lear o beer coordae. Auoomous Ages ad Mul-Age Sysem, 4, 8(3): [4] Kaelbg L P, Lma M L, Moore A W. Reforceme learg: A survey. Joural of Arfcal Research, 996, 4: [5] Suo R S, Baro A G. Reforceme learg: A roduco. Cambrdge, MA: MI Pres 998 [6] Excelee-oledo CB, Jegs NR. Usg reforceme learg o coordae beer. Compuaoal Iellgece, Vol. 2 No. 3, pp Blackwell Publshg 5 [7] CHEN G, YANG ZH. Coordag Mulple Ages va Reforceme Learg. Auoomous Ages ad Mul-Age Sysem 5, 0(3): [8] Oo N, Fukumoo K. Mul-age reforceme learg: A modular approach. I Proceedgs of he Secod Ieraoal Coferece o Mul-age Sysems. Porlad, Orego, USA. 996, pp: , AAAI Press [9] Myazak K, Yamamura M, Kobayash S. O he raoaly of prof sharg reforceme learg. I Proceedgs of he hrd Ieraoal Coferece o Fuzzy Logc, Neural Nes ad Sof Compug, pages Fuzzy Logc Sysems Isue, 994 [0] Ara S, Sycara K. Effecve learg approach for plag ad schedulg mul-age doma. I Proceedgs of he 6h Ieraoal Coferece o Smulao of Adapve Behavor. Par Frace. Sepember 0, pp: [] Ara S, Sycara K P, Paye R. Experece-based reforceme learg o acqure effecve behavor a mul-age doma. I Proceedgs of he 6h Pacfc Rm Ieraoal Coferece o Arfcal Iellgece. Melboure, Ausrala. 0, pp: [2] Bellma R. Dyamc programmg. Prceo, NJ: Prceo Uversy Pres 957 [3] Waks C J, Daya P. echcal Noe: Q-learg. Mache learg, 992, 8: [4] Whehead S, Karlsso J, eeberg J. Learg mulple goal behavor va ask decomposo ad dyamc polcy mergg. Robo Learg, Norwell, MA: Kluwer Academc Pres 993 [5] Grefesee J J. Cred assgme rule dscovery sysems based o geec algorhms. Mache Learg, 988, 3: [6] Myazak K, Kobayash S. O he raoaly of prof sharg parally observable markov decso process. I Proceedgs of he ffh Ieraoal Coferece o Iformao Sysems Aalyss ad Syhess. Orlado, FL, USA. 999, pp: [7] Whehead S D, Ballad D H. Acve percepo ad reforceme learg. I Proceedgs of 7h Ieraoal Coferece o Mache Learg. 990, pp: [8] Sgh S P, Suo R S. Reforceme learg wh replacg elgbly races. Mache Learg, 996, 22: [9] Beda M, Jagaaha V, Dodhawalla R. O opmal cooperao of kowledge source. echcal Repor No. BCS-G-28, Boeg Advaced echology Ceer, Boeg Compuer Servce Seale, WA,

The Poisson Process Properties of the Poisson Process

The Poisson Process Properties of the Poisson Process Posso Processes Summary The Posso Process Properes of he Posso Process Ierarrval mes Memoryless propery ad he resdual lfeme paradox Superposo of Posso processes Radom seleco of Posso Pos Bulk Arrvals ad

More information

Real-Time Systems. Example: scheduling using EDF. Feasibility analysis for EDF. Example: scheduling using EDF

Real-Time Systems. Example: scheduling using EDF. Feasibility analysis for EDF. Example: scheduling using EDF EDA/DIT6 Real-Tme Sysems, Chalmers/GU, 0/0 ecure # Updaed February, 0 Real-Tme Sysems Specfcao Problem: Assume a sysem wh asks accordg o he fgure below The mg properes of he asks are gve he able Ivesgae

More information

14. Poisson Processes

14. Poisson Processes 4. Posso Processes I Lecure 4 we roduced Posso arrvals as he lmg behavor of Bomal radom varables. Refer o Posso approxmao of Bomal radom varables. From he dscusso here see 4-6-4-8 Lecure 4 " arrvals occur

More information

8. Queueing systems lect08.ppt S Introduction to Teletraffic Theory - Fall

8. Queueing systems lect08.ppt S Introduction to Teletraffic Theory - Fall 8. Queueg sysems lec8. S-38.45 - Iroduco o Teleraffc Theory - Fall 8. Queueg sysems Coes Refresher: Smle eleraffc model M/M/ server wag laces M/M/ servers wag laces 8. Queueg sysems Smle eleraffc model

More information

FORCED VIBRATION of MDOF SYSTEMS

FORCED VIBRATION of MDOF SYSTEMS FORCED VIBRAION of DOF SSES he respose of a N DOF sysem s govered by he marx equao of moo: ] u C] u K] u 1 h al codos u u0 ad u u 0. hs marx equao of moo represes a sysem of N smulaeous equaos u ad s me

More information

Key words: Fractional difference equation, oscillatory solutions,

Key words: Fractional difference equation, oscillatory solutions, OSCILLATION PROPERTIES OF SOLUTIONS OF FRACTIONAL DIFFERENCE EQUATIONS Musafa BAYRAM * ad Ayd SECER * Deparme of Compuer Egeerg, Isabul Gelsm Uversy Deparme of Mahemacal Egeerg, Yldz Techcal Uversy * Correspodg

More information

Least Squares Fitting (LSQF) with a complicated function Theexampleswehavelookedatsofarhavebeenlinearintheparameters

Least Squares Fitting (LSQF) with a complicated function Theexampleswehavelookedatsofarhavebeenlinearintheparameters Leas Squares Fg LSQF wh a complcaed fuco Theeampleswehavelookedasofarhavebeelearheparameers ha we have bee rg o deerme e.g. slope, ercep. For he case where he fuco s lear he parameers we ca fd a aalc soluo

More information

For the plane motion of a rigid body, an additional equation is needed to specify the state of rotation of the body.

For the plane motion of a rigid body, an additional equation is needed to specify the state of rotation of the body. The kecs of rgd bodes reas he relaoshps bewee he exeral forces acg o a body ad he correspodg raslaoal ad roaoal moos of he body. he kecs of he parcle, we foud ha wo force equaos of moo were requred o defe

More information

Continuous Time Markov Chains

Continuous Time Markov Chains Couous me Markov chas have seay sae probably soluos f a oly f hey are ergoc, us lke scree me Markov chas. Fg he seay sae probably vecor for a couous me Markov cha s o more ffcul ha s he scree me case,

More information

Determination of Antoine Equation Parameters. December 4, 2012 PreFEED Corporation Yoshio Kumagae. Introduction

Determination of Antoine Equation Parameters. December 4, 2012 PreFEED Corporation Yoshio Kumagae. Introduction refeed Soluos for R&D o Desg Deermao of oe Equao arameers Soluos for R&D o Desg December 4, 0 refeed orporao Yosho Kumagae refeed Iroduco hyscal propery daa s exremely mpora for performg process desg ad

More information

Cyclone. Anti-cyclone

Cyclone. Anti-cyclone Adveco Cycloe A-cycloe Lorez (963) Low dmesoal aracors. Uclear f hey are a good aalogy o he rue clmae sysem, bu hey have some appealg characerscs. Dscusso Is he al codo balaced? Is here a al adjusme

More information

The algebraic immunity of a class of correlation immune H Boolean functions

The algebraic immunity of a class of correlation immune H Boolean functions Ieraoal Coferece o Advaced Elecroc Scece ad Techology (AEST 06) The algebrac mmuy of a class of correlao mmue H Boolea fucos a Jgla Huag ad Zhuo Wag School of Elecrcal Egeerg Norhwes Uversy for Naoales

More information

QR factorization. Let P 1, P 2, P n-1, be matrices such that Pn 1Pn 2... PPA

QR factorization. Let P 1, P 2, P n-1, be matrices such that Pn 1Pn 2... PPA QR facorzao Ay x real marx ca be wre as AQR, where Q s orhogoal ad R s upper ragular. To oba Q ad R, we use he Householder rasformao as follows: Le P, P, P -, be marces such ha P P... PPA ( R s upper ragular.

More information

The ray paths and travel times for multiple layers can be computed using ray-tracing, as demonstrated in Lab 3.

The ray paths and travel times for multiple layers can be computed using ray-tracing, as demonstrated in Lab 3. C. Trael me cures for mulple reflecors The ray pahs ad rael mes for mulple layers ca be compued usg ray-racg, as demosraed Lab. MATLAB scrp reflec_layers_.m performs smple ray racg. (m) ref(ms) ref(ms)

More information

Midterm Exam. Tuesday, September hour, 15 minutes

Midterm Exam. Tuesday, September hour, 15 minutes Ecoomcs of Growh, ECON560 Sa Fracsco Sae Uvers Mchael Bar Fall 203 Mderm Exam Tuesda, Sepember 24 hour, 5 mues Name: Isrucos. Ths s closed boo, closed oes exam. 2. No calculaors of a d are allowed. 3.

More information

-distributed random variables consisting of n samples each. Determine the asymptotic confidence intervals for

-distributed random variables consisting of n samples each. Determine the asymptotic confidence intervals for Assgme Sepha Brumme Ocober 8h, 003 9 h semeser, 70544 PREFACE I 004, I ed o sped wo semesers o a sudy abroad as a posgraduae exchage sude a he Uversy of Techology Sydey, Ausrala. Each opporuy o ehace my

More information

Optimal Eye Movement Strategies in Visual Search (Supplement)

Optimal Eye Movement Strategies in Visual Search (Supplement) Opmal Eye Moveme Sraeges Vsual Search (Suppleme) Jr Naemk ad Wlso S. Gesler Ceer for Percepual Sysems ad Deparme of Psychology, Uversy of exas a Aus, Aus X 787 Here we derve he deal searcher for he case

More information

Solution set Stat 471/Spring 06. Homework 2

Solution set Stat 471/Spring 06. Homework 2 oluo se a 47/prg 06 Homework a Whe he upper ragular elemes are suppressed due o smmer b Le Y Y Y Y A weep o he frs colum o oba: A ˆ b chagg he oao eg ad ec YY weep o he secod colum o oba: Aˆ YY weep o

More information

(1) Cov(, ) E[( E( ))( E( ))]

(1) Cov(, ) E[( E( ))( E( ))] Impac of Auocorrelao o OLS Esmaes ECON 3033/Evas Cosder a smple bvarae me-seres model of he form: y 0 x The four key assumpos abou ε hs model are ) E(ε ) = E[ε x ]=0 ) Var(ε ) =Var(ε x ) = ) Cov(ε, ε )

More information

IMPROVED PORTFOLIO OPTIMIZATION MODEL WITH TRANSACTION COST AND MINIMAL TRANSACTION LOTS

IMPROVED PORTFOLIO OPTIMIZATION MODEL WITH TRANSACTION COST AND MINIMAL TRANSACTION LOTS Vol.7 No.4 (200) p73-78 Joural of Maageme Scece & Sascal Decso IMPROVED PORTFOLIO OPTIMIZATION MODEL WITH TRANSACTION COST AND MINIMAL TRANSACTION LOTS TIANXIANG YAO AND ZAIWU GONG College of Ecoomcs &

More information

Fundamentals of Speech Recognition Suggested Project The Hidden Markov Model

Fundamentals of Speech Recognition Suggested Project The Hidden Markov Model . Projec Iroduco Fudameals of Speech Recogo Suggesed Projec The Hdde Markov Model For hs projec, s proposed ha you desg ad mpleme a hdde Markov model (HMM) ha opmally maches he behavor of a se of rag sequeces

More information

Chapter 8. Simple Linear Regression

Chapter 8. Simple Linear Regression Chaper 8. Smple Lear Regresso Regresso aalyss: regresso aalyss s a sascal mehodology o esmae he relaoshp of a respose varable o a se of predcor varable. whe here s jus oe predcor varable, we wll use smple

More information

A Novel ACO with Average Entropy

A Novel ACO with Average Entropy J. Sofware Egeerg & Applcaos, 2009, 2: 370-374 do:10.4236/jsea.2009.25049 Publshed Ole December 2009 (hp://www.scrp.org/joural/jsea) A Novel ACO wh Average Eropy Yacag LI College of Cvl Egeerg, Hebe Uversy

More information

The Mean Residual Lifetime of (n k + 1)-out-of-n Systems in Discrete Setting

The Mean Residual Lifetime of (n k + 1)-out-of-n Systems in Discrete Setting Appled Mahemacs 4 5 466-477 Publshed Ole February 4 (hp//wwwscrporg/oural/am hp//dxdoorg/436/am45346 The Mea Resdual Lfeme of ( + -ou-of- Sysems Dscree Seg Maryam Torab Sahboom Deparme of Sascs Scece ad

More information

VARIATIONAL ITERATION METHOD FOR DELAY DIFFERENTIAL-ALGEBRAIC EQUATIONS. Hunan , China,

VARIATIONAL ITERATION METHOD FOR DELAY DIFFERENTIAL-ALGEBRAIC EQUATIONS. Hunan , China, Mahemacal ad Compuaoal Applcaos Vol. 5 No. 5 pp. 834-839. Assocao for Scefc Research VARIATIONAL ITERATION METHOD FOR DELAY DIFFERENTIAL-ALGEBRAIC EQUATIONS Hoglag Lu Aguo Xao Yogxag Zhao School of Mahemacs

More information

Mixed Integral Equation of Contact Problem in Position and Time

Mixed Integral Equation of Contact Problem in Position and Time Ieraoal Joural of Basc & Appled Sceces IJBAS-IJENS Vol: No: 3 ed Iegral Equao of Coac Problem Poso ad me. A. Abdou S. J. oaquel Deparme of ahemacs Faculy of Educao Aleadra Uversy Egyp Deparme of ahemacs

More information

Research on portfolio model based on information entropy theory

Research on portfolio model based on information entropy theory Avalable ole www.jocpr.com Joural of Chemcal ad Pharmaceucal esearch, 204, 6(6):286-290 esearch Arcle ISSN : 0975-7384 CODEN(USA) : JCPC5 esearch o porfolo model based o formao eropy heory Zhag Jusha,

More information

Partial Molar Properties of solutions

Partial Molar Properties of solutions Paral Molar Properes of soluos A soluo s a homogeeous mxure; ha s, a soluo s a oephase sysem wh more ha oe compoe. A homogeeous mxures of wo or more compoes he gas, lqud or sold phase The properes of a

More information

FALL HOMEWORK NO. 6 - SOLUTION Problem 1.: Use the Storage-Indication Method to route the Input hydrograph tabulated below.

FALL HOMEWORK NO. 6 - SOLUTION Problem 1.: Use the Storage-Indication Method to route the Input hydrograph tabulated below. Jorge A. Ramírez HOMEWORK NO. 6 - SOLUTION Problem 1.: Use he Sorage-Idcao Mehod o roue he Ipu hydrograph abulaed below. Tme (h) Ipu Hydrograph (m 3 /s) Tme (h) Ipu Hydrograph (m 3 /s) 0 0 90 450 6 50

More information

The Bernstein Operational Matrix of Integration

The Bernstein Operational Matrix of Integration Appled Mahemacal Sceces, Vol. 3, 29, o. 49, 2427-2436 he Berse Operaoal Marx of Iegrao Am K. Sgh, Vee K. Sgh, Om P. Sgh Deparme of Appled Mahemacs Isue of echology, Baaras Hdu Uversy Varaas -225, Ida Asrac

More information

Solving fuzzy linear programming problems with piecewise linear membership functions by the determination of a crisp maximizing decision

Solving fuzzy linear programming problems with piecewise linear membership functions by the determination of a crisp maximizing decision Frs Jo Cogress o Fuzzy ad Iellge Sysems Ferdows Uversy of Mashhad Ira 9-3 Aug 7 Iellge Sysems Scefc Socey of Ira Solvg fuzzy lear programmg problems wh pecewse lear membershp fucos by he deermao of a crsp

More information

Real-time Classification of Large Data Sets using Binary Knapsack

Real-time Classification of Large Data Sets using Binary Knapsack Real-me Classfcao of Large Daa Ses usg Bary Kapsack Reao Bru bru@ds.uroma. Uversy of Roma La Sapeza AIRO 004-35h ANNUAL CONFERENCE OF THE ITALIAN OPERATIONS RESEARCH Sepember 7-0, 004, Lecce, Ialy Oule

More information

International Journal Of Engineering And Computer Science ISSN: Volume 5 Issue 12 Dec. 2016, Page No.

International Journal Of Engineering And Computer Science ISSN: Volume 5 Issue 12 Dec. 2016, Page No. www.jecs. Ieraoal Joural Of Egeerg Ad Compuer Scece ISSN: 19-74 Volume 5 Issue 1 Dec. 16, Page No. 196-1974 Sofware Relably Model whe mulple errors occur a a me cludg a faul correco process K. Harshchadra

More information

General Complex Fuzzy Transformation Semigroups in Automata

General Complex Fuzzy Transformation Semigroups in Automata Joural of Advaces Compuer Research Quarerly pissn: 345-606x eissn: 345-6078 Sar Brach Islamc Azad Uversy Sar IRIra Vol 7 No May 06 Pages: 7-37 wwwacrausaracr Geeral Complex uzzy Trasformao Semgroups Auomaa

More information

Quantitative Portfolio Theory & Performance Analysis

Quantitative Portfolio Theory & Performance Analysis 550.447 Quaave Porfolo heory & Performace Aalyss Week February 4 203 Coceps. Assgme For February 4 (hs Week) ead: A&L Chaper Iroduco & Chaper (PF Maageme Evrome) Chaper 2 ( Coceps) Seco (Basc eur Calculaos)

More information

Lecture 3 Topic 2: Distributions, hypothesis testing, and sample size determination

Lecture 3 Topic 2: Distributions, hypothesis testing, and sample size determination Lecure 3 Topc : Drbuo, hypohe eg, ad ample ze deermao The Sude - drbuo Coder a repeaed drawg of ample of ze from a ormal drbuo of mea. For each ample, compue,,, ad aoher ac,, where: The ac he devao of

More information

Least squares and motion. Nuno Vasconcelos ECE Department, UCSD

Least squares and motion. Nuno Vasconcelos ECE Department, UCSD Leas squares ad moo uo Vascocelos ECE Deparme UCSD Pla for oda oda we wll dscuss moo esmao hs s eresg wo was moo s ver useful as a cue for recogo segmeao compresso ec. s a grea eample of leas squares problem

More information

Nature and Science, 5(1), 2007, Han and Xu, Multi-variable Grey Model based on Genetic Algorithm and its Application in Urban Water Consumption

Nature and Science, 5(1), 2007, Han and Xu, Multi-variable Grey Model based on Genetic Algorithm and its Application in Urban Water Consumption Naure ad Scece, 5, 7, Ha ad u, ul-varable Grey odel based o Geec Algorhm ad s Applcao Urba Waer Cosumpo ul-varable Grey odel based o Geec Algorhm ad s Applcao Urba Waer Cosumpo Ha Ya*, u Shguo School of

More information

Cyclically Interval Total Colorings of Cycles and Middle Graphs of Cycles

Cyclically Interval Total Colorings of Cycles and Middle Graphs of Cycles Ope Joural of Dsree Mahemas 2017 7 200-217 hp://wwwsrporg/joural/ojdm ISSN Ole: 2161-7643 ISSN Pr: 2161-7635 Cylally Ierval Toal Colorgs of Cyles Mddle Graphs of Cyles Yogqag Zhao 1 Shju Su 2 1 Shool of

More information

The Linear Regression Of Weighted Segments

The Linear Regression Of Weighted Segments The Lear Regresso Of Weghed Segmes George Dael Maeescu Absrac. We proposed a regresso model where he depede varable s made o up of pos bu segmes. Ths suao correspods o he markes hroughou he da are observed

More information

Quantum Mechanics II Lecture 11 Time-dependent perturbation theory. Time-dependent perturbation theory (degenerate or non-degenerate starting state)

Quantum Mechanics II Lecture 11 Time-dependent perturbation theory. Time-dependent perturbation theory (degenerate or non-degenerate starting state) Pro. O. B. Wrgh, Auum Quaum Mechacs II Lecure Tme-depede perurbao heory Tme-depede perurbao heory (degeerae or o-degeerae sarg sae) Cosder a sgle parcle whch, s uperurbed codo wh Hamloa H, ca exs a superposo

More information

COMPARISON OF ESTIMATORS OF PARAMETERS FOR THE RAYLEIGH DISTRIBUTION

COMPARISON OF ESTIMATORS OF PARAMETERS FOR THE RAYLEIGH DISTRIBUTION COMPARISON OF ESTIMATORS OF PARAMETERS FOR THE RAYLEIGH DISTRIBUTION Eldesoky E. Affy. Faculy of Eg. Shbee El kom Meoufa Uv. Key word : Raylegh dsrbuo, leas squares mehod, relave leas squares, leas absolue

More information

AML710 CAD LECTURE 12 CUBIC SPLINE CURVES. Cubic Splines Matrix formulation Normalised cubic splines Alternate end conditions Parabolic blending

AML710 CAD LECTURE 12 CUBIC SPLINE CURVES. Cubic Splines Matrix formulation Normalised cubic splines Alternate end conditions Parabolic blending CUIC SLINE CURVES Cubc Sples Marx formulao Normalsed cubc sples Alerae ed codos arabolc bledg AML7 CAD LECTURE CUIC SLINE The ame sple comes from he physcal srume sple drafsme use o produce curves A geeral

More information

Complementary Tree Paired Domination in Graphs

Complementary Tree Paired Domination in Graphs IOSR Joural of Mahemacs (IOSR-JM) e-issn: 2278-5728, p-issn: 239-765X Volume 2, Issue 6 Ver II (Nov - Dec206), PP 26-3 wwwosrjouralsorg Complemeary Tree Pared Domao Graphs A Meeaksh, J Baskar Babujee 2

More information

Stabilization of LTI Switched Systems with Input Time Delay. Engineering Letters, 14:2, EL_14_2_14 (Advance online publication: 16 May 2007) Lin Lin

Stabilization of LTI Switched Systems with Input Time Delay. Engineering Letters, 14:2, EL_14_2_14 (Advance online publication: 16 May 2007) Lin Lin Egeerg Leers, 4:2, EL_4_2_4 (Advace ole publcao: 6 May 27) Sablzao of LTI Swched Sysems wh Ipu Tme Delay L L Absrac Ths paper deals wh sablzao of LTI swched sysems wh pu me delay. A descrpo of sysems sablzao

More information

Competitive Facility Location Problem with Demands Depending on the Facilities

Competitive Facility Location Problem with Demands Depending on the Facilities Aa Pacc Maageme Revew 4) 009) 5-5 Compeve Facl Locao Problem wh Demad Depedg o he Facle Shogo Shode a* Kuag-Yh Yeh b Hao-Chg Ha c a Facul of Bue Admrao Kobe Gau Uver Japa bc Urba Plag Deparme Naoal Cheg

More information

Average Consensus in Networks of Multi-Agent with Multiple Time-Varying Delays

Average Consensus in Networks of Multi-Agent with Multiple Time-Varying Delays I. J. Commucaos ewor ad Sysem Sceces 3 96-3 do:.436/jcs..38 Publshed Ole February (hp://www.scrp.org/joural/jcs/). Average Cosesus ewors of Mul-Age wh Mulple me-varyg Delays echeg ZHAG Hu YU Isue of olear

More information

Neural Network Global Sliding Mode PID Control for Robot Manipulators

Neural Network Global Sliding Mode PID Control for Robot Manipulators Neural Newor Global Sldg Mode PID Corol for Robo Mapulaors. C. Kuo, Member, IAENG ad Y. J. Huag, Member, IAENG Absrac hs paper preses a eural ewor global PID-sldg mode corol mehod for he racg corol of

More information

Comparison of the Bayesian and Maximum Likelihood Estimation for Weibull Distribution

Comparison of the Bayesian and Maximum Likelihood Estimation for Weibull Distribution Joural of Mahemacs ad Sascs 6 (2): 1-14, 21 ISSN 1549-3644 21 Scece Publcaos Comarso of he Bayesa ad Maxmum Lkelhood Esmao for Webull Dsrbuo Al Omar Mohammed Ahmed, Hadeel Salm Al-Kuub ad Noor Akma Ibrahm

More information

Asymptotic Behavior of Solutions of Nonlinear Delay Differential Equations With Impulse

Asymptotic Behavior of Solutions of Nonlinear Delay Differential Equations With Impulse P a g e Vol Issue7Ver,oveber Global Joural of Scece Froer Research Asypoc Behavor of Soluos of olear Delay Dffereal Equaos Wh Ipulse Zhag xog GJSFR Classfcao - F FOR 3 Absrac Ths paper sudes he asypoc

More information

Modified Integrated Multi-Point Approximation And GA Used In Truss Topology Optimization

Modified Integrated Multi-Point Approximation And GA Used In Truss Topology Optimization Joural of Muldscplary Egeerg Scece ad echology (JMES) Vol. 4 Issue 6, Jue - 2017 Modfed Iegraed Mul-Po Appromao Ad GA sed I russ opology Opmzao Adurahma M. Hasse 1, Mohammed A. Ha 2 Mechacal ad Idusral

More information

Solution. The straightforward approach is surprisingly difficult because one has to be careful about the limits.

Solution. The straightforward approach is surprisingly difficult because one has to be careful about the limits. ose ad Varably Homewor # (8), aswers Q: Power spera of some smple oses A Posso ose A Posso ose () s a sequee of dela-fuo pulses, eah ourrg depedely, a some rae r (More formally, s a sum of pulses of wdh

More information

AN INCREMENTAL QUASI-NEWTON METHOD WITH A LOCAL SUPERLINEAR CONVERGENCE RATE. Aryan Mokhtari Mark Eisen Alejandro Ribeiro

AN INCREMENTAL QUASI-NEWTON METHOD WITH A LOCAL SUPERLINEAR CONVERGENCE RATE. Aryan Mokhtari Mark Eisen Alejandro Ribeiro AN INCREMENTAL QUASI-NEWTON METHOD WITH A LOCAL SUPERLINEAR CONVERGENCE RATE Arya Mokhar Mark Ese Alejadro Rbero Deparme of Elecrcal ad Sysems Egeerg, Uversy of Pesylvaa ABSTRACT We prese a cremeal Broyde-Flecher-Goldfarb-Shao

More information

Fully Fuzzy Linear Systems Solving Using MOLP

Fully Fuzzy Linear Systems Solving Using MOLP World Appled Sceces Joural 12 (12): 2268-2273, 2011 ISSN 1818-4952 IDOSI Publcaos, 2011 Fully Fuzzy Lear Sysems Solvg Usg MOLP Tofgh Allahvraloo ad Nasser Mkaelvad Deparme of Mahemacs, Islamc Azad Uversy,

More information

Continuous Indexed Variable Systems

Continuous Indexed Variable Systems Ieraoal Joural o Compuaoal cece ad Mahemacs. IN 0974-389 Volume 3, Number 4 (20), pp. 40-409 Ieraoal Research Publcao House hp://www.rphouse.com Couous Idexed Varable ysems. Pouhassa ad F. Mohammad ghjeh

More information

Solution of Impulsive Differential Equations with Boundary Conditions in Terms of Integral Equations

Solution of Impulsive Differential Equations with Boundary Conditions in Terms of Integral Equations Joural of aheacs ad copuer Scece (4 39-38 Soluo of Ipulsve Dffereal Equaos wh Boudary Codos Ters of Iegral Equaos Arcle hsory: Receved Ocober 3 Acceped February 4 Avalable ole July 4 ohse Rabba Depare

More information

The Optimal Combination Forecasting Based on ARIMA,VAR and SSM

The Optimal Combination Forecasting Based on ARIMA,VAR and SSM Advaces Compuer, Sgals ad Sysems (206) : 3-7 Clausus Scefc Press, Caada The Opmal Combao Forecasg Based o ARIMA,VAR ad SSM Bebe Che,a, Mgya Jag,b* School of Iformao Scece ad Egeerg, Shadog Uversy, Ja,

More information

Moments of Order Statistics from Nonidentically Distributed Three Parameters Beta typei and Erlang Truncated Exponential Variables

Moments of Order Statistics from Nonidentically Distributed Three Parameters Beta typei and Erlang Truncated Exponential Variables Joural of Mahemacs ad Sascs 6 (4): 442-448, 200 SSN 549-3644 200 Scece Publcaos Momes of Order Sascs from Nodecally Dsrbued Three Parameers Bea ype ad Erlag Trucaed Expoeal Varables A.A. Jamoom ad Z.A.

More information

Redundancy System Fault Sampling Under Imperfect Maintenance

Redundancy System Fault Sampling Under Imperfect Maintenance A publcao of CHEMICAL EGIEERIG TRASACTIOS VOL. 33, 03 Gues Edors: Erco Zo, Pero Barald Copyrgh 03, AIDIC Servz S.r.l., ISB 978-88-95608-4-; ISS 974-979 The Iala Assocao of Chemcal Egeerg Ole a: www.adc./ce

More information

The textbook expresses the stock price as the present discounted value of the dividend paid and the price of the stock next period.

The textbook expresses the stock price as the present discounted value of the dividend paid and the price of the stock next period. ublc Affars 974 Meze D. Ch Fall Socal Sceces 748 Uversy of Wscos-Madso Sock rces, News ad he Effce Markes Hypohess (rev d //) The rese Value Model Approach o Asse rcg The exbook expresses he sock prce

More information

Development of Hybrid-Coded EPSO for Optimal Allocation of FACTS Devices in Uncertain Smart Grids

Development of Hybrid-Coded EPSO for Optimal Allocation of FACTS Devices in Uncertain Smart Grids Avalable ole a www.scecedrec.com Proceda Compuer Scece 6 (011) 49 434 Complex Adapve Sysems, Volume 1 Cha H. Dagl, Edor Chef Coferece Orgazed by ssour Uversy of Scece ad Techology 011- Chcago, IL Developme

More information

Fourth Order Runge-Kutta Method Based On Geometric Mean for Hybrid Fuzzy Initial Value Problems

Fourth Order Runge-Kutta Method Based On Geometric Mean for Hybrid Fuzzy Initial Value Problems IOSR Joural of Mahemacs (IOSR-JM) e-issn: 2278-5728, p-issn: 29-765X. Volume, Issue 2 Ver. II (Mar. - Apr. 27), PP 4-5 www.osrjourals.org Fourh Order Ruge-Kua Mehod Based O Geomerc Mea for Hybrd Fuzzy

More information

Complete Identification of Isotropic Configurations of a Caster Wheeled Mobile Robot with Nonredundant/Redundant Actuation

Complete Identification of Isotropic Configurations of a Caster Wheeled Mobile Robot with Nonredundant/Redundant Actuation 486 Ieraoal Joural Sugbok of Corol Km Auomao ad Byugkwo ad Sysems Moo vol 4 o 4 pp 486-494 Augus 006 Complee Idefcao of Isoropc Cofguraos of a Caser Wheeled Moble Robo wh Noreduda/Reduda Acuao Sugbok Km

More information

Brownian Motion and Stochastic Calculus. Brownian Motion and Stochastic Calculus

Brownian Motion and Stochastic Calculus. Brownian Motion and Stochastic Calculus Browa Moo Sochasc Calculus Xogzh Che Uversy of Hawa a Maoa earme of Mahemacs Seember, 8 Absrac Ths oe s abou oob decomoso he bascs of Suare egrable margales Coes oob-meyer ecomoso Suare Iegrable Margales

More information

Linear Regression Linear Regression with Shrinkage

Linear Regression Linear Regression with Shrinkage Lear Regresso Lear Regresso h Shrkage Iroduco Regresso meas predcg a couous (usuall scalar oupu from a vecor of couous pus (feaures x. Example: Predcg vehcle fuel effcec (mpg from 8 arbues: Lear Regresso

More information

arxiv: v1 [stat.ml] 21 Mar 2017

arxiv: v1 [stat.ml] 21 Mar 2017 Sochasc Prmal Dual Coordae Mehod wh No-Uform Samplg Based o Opmaly Volaos Asush Shbagak shbagak.a.mllab.@gmal.com Deparme of Scefc ad Egeerg Smulao, Nagoya Isue of Techology arxv:703.0706v [sa.ml] Mar

More information

The textbook expresses the stock price as the present discounted value of the dividend paid and the price of the stock next period.

The textbook expresses the stock price as the present discounted value of the dividend paid and the price of the stock next period. coomcs 435 Meze. Ch Fall 07 Socal Sceces 748 Uversy of Wscos-Madso Sock rces, News ad he ffce Markes Hypohess The rese Value Model Approach o Asse rcg The exbook expresses he sock prce as he prese dscoued

More information

Some Probability Inequalities for Quadratic Forms of Negatively Dependent Subgaussian Random Variables

Some Probability Inequalities for Quadratic Forms of Negatively Dependent Subgaussian Random Variables Joural of Sceces Islamc epublc of Ira 6(: 63-67 (005 Uvers of ehra ISSN 06-04 hp://scecesuacr Some Probabl Iequales for Quadrac Forms of Negavel Depede Subgaussa adom Varables M Am A ozorga ad H Zare 3

More information

Multiphase Flow Simulation Based on Unstructured Grid

Multiphase Flow Simulation Based on Unstructured Grid 200 Tuoral School o Flud Dyamcs: Topcs Turbulece Uversy of Marylad, May 24-28, 200 Oule Bacgroud Mulphase Flow Smulao Based o Usrucured Grd Bubble Pacg Mehod mehod Based o he Usrucured Grd Remar B CHEN,

More information

θ = θ Π Π Parametric counting process models θ θ θ Log-likelihood: Consider counting processes: Score functions:

θ = θ Π Π Parametric counting process models θ θ θ Log-likelihood: Consider counting processes: Score functions: Paramerc coug process models Cosder coug processes: N,,..., ha cou he occurreces of a eve of eres for dvduals Iesy processes: Lelhood λ ( ;,,..., N { } λ < Log-lelhood: l( log L( Score fucos: U ( l( log

More information

Abstract. Keywords: Mutation probability, evolutionary computation, optimization, sensitivity, variability. 1. Introduction. 2. Proposed Algorithm

Abstract. Keywords: Mutation probability, evolutionary computation, optimization, sensitivity, variability. 1. Introduction. 2. Proposed Algorithm EgOp 2008 Ieraoal Coferece o Egeerg Opmzao Ro de Jaero, Brazl, 01-05 Jue 2008. Absrac Redefg Muao Probables for Evoluoary Opmzao Problems Raja Aggarwal Faculy of Egeerg ad Compuer Scece Cocorda Uversy,

More information

SYRIAN SEISMIC CODE :

SYRIAN SEISMIC CODE : SYRIAN SEISMIC CODE 2004 : Two sac mehods have bee ssued Syra buldg code 2004 o calculae he laeral sesmc forces he buldg. The Frs Sac Mehod: I s he same mehod he prevous code (995) wh few modfcaos. I s

More information

Fault Tolerant Computing. Fault Tolerant Computing CS 530 Probabilistic methods: overview

Fault Tolerant Computing. Fault Tolerant Computing CS 530 Probabilistic methods: overview Probably 1/19/ CS 53 Probablsc mehods: overvew Yashwa K. Malaya Colorado Sae Uversy 1 Probablsc Mehods: Overvew Cocree umbers presece of uceray Probably Dsjo eves Sascal depedece Radom varables ad dsrbuos

More information

Solving Constrained Flow-Shop Scheduling. Problems with Three Machines

Solving Constrained Flow-Shop Scheduling. Problems with Three Machines It J Cotemp Math Sceces, Vol 5, 2010, o 19, 921-929 Solvg Costraed Flow-Shop Schedulg Problems wth Three Maches P Pada ad P Rajedra Departmet of Mathematcs, School of Advaced Sceces, VIT Uversty, Vellore-632

More information

Optimal Reactive Power Dispatching for Automatic Voltage Control of Hydropower Station Based on an Improved Genetic Algorithm

Optimal Reactive Power Dispatching for Automatic Voltage Control of Hydropower Station Based on an Improved Genetic Algorithm Opmal eacve Power Dspachg for Auomac Volage Corol of Hydropower Sao ased o a Improved Geec Algorhm GEHAO SHENG ICHEN JIANG IAOJN SHEN YI ENG Deparme. of Elecrcal Egeerg Shagha Jaoog versy uhu Dsrc, Shagha,

More information

arxiv: v2 [cs.lg] 19 Dec 2016

arxiv: v2 [cs.lg] 19 Dec 2016 1 Sasfcg mul-armed bad problems Paul Reverdy, Vabhav Srvasava, ad Naom Ehrch Leoard arxv:1512.07638v2 [cs.lg] 19 Dec 2016 Absrac Sasfcg s a relaxao of maxmzg ad allows for less rsky decso makg he face

More information

Regression Approach to Parameter Estimation of an Exponential Software Reliability Model

Regression Approach to Parameter Estimation of an Exponential Software Reliability Model Amerca Joural of Theorecal ad Appled Sascs 06; 5(3): 80-86 hp://www.scecepublshggroup.com/j/ajas do: 0.648/j.ajas.060503. ISSN: 36-8999 (Pr); ISSN: 36-9006 (Ole) Regresso Approach o Parameer Esmao of a

More information

A Constitutive Model for Multi-Line Simulation of Granular Material Behavior Using Multi-Plane Pattern

A Constitutive Model for Multi-Line Simulation of Granular Material Behavior Using Multi-Plane Pattern Joural of Compuer Scece 5 (): 8-80, 009 ISSN 549-009 Scece Publcaos A Cosuve Model for Mul-Le Smulao of Graular Maeral Behavor Usg Mul-Plae Paer S.A. Sadread, A. Saed Darya ad M. Zae KN Toos Uversy of

More information

4. THE DENSITY MATRIX

4. THE DENSITY MATRIX 4. THE DENSTY MATRX The desy marx or desy operaor s a alerae represeao of he sae of a quaum sysem for whch we have prevously used he wavefuco. Alhough descrbg a quaum sysem wh he desy marx s equvale o

More information

Human Action Recognition in Smart Classroom

Human Action Recognition in Smart Classroom Huma Aco Recogo Smar Classroom Habg Re Guagyou Xu (Depe of Compuer Scece ad echology, sghua Uversy, Bejg 00084, P.R.Cha) E-mal: rehabg@263.e xgy-dcs@mal.sghua.edu.c Absrac hs paper preses a ew sysem for

More information

ENGINEERING solutions to decision-making problems are

ENGINEERING solutions to decision-making problems are 3788 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 62, NO. 8, AUGUST 2017 Sasfcg Mul-Armed Bad Problems Paul Reverdy, Member, IEEE, Vabhav Srvasava, ad Naom Ehrch Leoard, Fellow, IEEE Absrac Sasfcg s a

More information

Asymptotic Regional Boundary Observer in Distributed Parameter Systems via Sensors Structures

Asymptotic Regional Boundary Observer in Distributed Parameter Systems via Sensors Structures Sesors,, 37-5 sesors ISSN 44-8 by MDPI hp://www.mdp.e/sesors Asympoc Regoal Boudary Observer Dsrbued Parameer Sysems va Sesors Srucures Raheam Al-Saphory Sysems Theory Laboraory, Uversy of Perpga, 5, aveue

More information

EE 6885 Statistical Pattern Recognition

EE 6885 Statistical Pattern Recognition EE 6885 Sascal Paer Recogo Fall 005 Prof. Shh-Fu Chag hp://www.ee.columba.edu/~sfchag Lecure 5 (9//05 4- Readg Model Parameer Esmao ML Esmao, Chap. 3. Mure of Gaussa ad EM Referece Boo, HTF Chap. 8.5 Teboo,

More information

The Properties of Probability of Normal Chain

The Properties of Probability of Normal Chain I. J. Coep. Mah. Sceces Vol. 8 23 o. 9 433-439 HIKARI Ld www.-hkar.co The Properes of Proaly of Noral Cha L Che School of Maheacs ad Sascs Zheghou Noral Uversy Zheghou Cy Hea Provce 4544 Cha cluu6697@sa.co

More information

Chapter 3: Maximum-Likelihood & Bayesian Parameter Estimation (part 1)

Chapter 3: Maximum-Likelihood & Bayesian Parameter Estimation (part 1) Aoucemes Reags o E-reserves Proec roosal ue oay Parameer Esmao Bomercs CSE 9-a Lecure 6 CSE9a Fall 6 CSE9a Fall 6 Paer Classfcao Chaer 3: Mamum-Lelhoo & Bayesa Parameer Esmao ar All maerals hese sles were

More information

D 2 : Decentralized Training over Decentralized Data

D 2 : Decentralized Training over Decentralized Data D : Deceralzed rag over Deceralzed Daa Hal ag Xagru La Mg Ya 3 Ce Zhag 4 J Lu 5 Absrac Whle rag a mache learg model usg mulple workers, each of whch collecs daa from s ow daa source, would be useful whe

More information

Periodic Resource Reallocation in Two-Echelon Repairable Item Inventory Systems

Periodic Resource Reallocation in Two-Echelon Repairable Item Inventory Systems -ASE-009-070 Perodc Resource Reallocao wo-echelo Reparable Iem Iveory Sysems Hoog Chu LAU, Je PAN, Huawe SON Absrac ve a exsg sock allocao a veory sysem, s ofe ecessary o perform reallocao over mulple

More information

NOTE ON SIMPLE AND LOGARITHMIC RETURN

NOTE ON SIMPLE AND LOGARITHMIC RETURN Appled udes Agrbusess ad Commerce AAC Ceer-r ublshg House, Debrece DOI:.94/AAC/27/-2/6 CIENIFIC AE NOE ON IME AND OGAIHMIC EUN aa Mskolcz Uversy of Debrece, Isue of Accoug ad Face mskolczpaa@gmal.com Absrac:

More information

CS344: Introduction to Artificial Intelligence

CS344: Introduction to Artificial Intelligence C344: Iroduco o Arfcal Iellgece Puhpa Bhaacharyya CE Dep. IIT Bombay Lecure 3 3 32 33: Forward ad bacward; Baum elch 9 h ad 2 March ad 2 d Aprl 203 Lecure 27 28 29 were o EM; dae 2 h March o 8 h March

More information

Fresnel Equations cont.

Fresnel Equations cont. Lecure 12 Chaper 4 Fresel quaos co. Toal eral refleco ad evaesce waves Opcal properes of meals Laer: Famlar aspecs of he eraco of lgh ad maer Fresel quaos r 2 Usg Sell s law, we ca re-wre: r s s r a a

More information

New Guaranteed H Performance State Estimation for Delayed Neural Networks

New Guaranteed H Performance State Estimation for Delayed Neural Networks Ieraoal Joural of Iformao ad Elecrocs Egeerg Vol. o. 6 ovember ew Guaraeed H Performace ae Esmao for Delayed eural eworks Wo Il Lee ad PooGyeo Park Absrac I hs paper a ew guaraeed performace sae esmao

More information

An Efficient Dual to Ratio and Product Estimator of Population Variance in Sample Surveys

An Efficient Dual to Ratio and Product Estimator of Population Variance in Sample Surveys "cece as True Here" Joural of Mahemacs ad ascal cece, Volume 06, 78-88 cece gpos Publshg A Effce Dual o Rao ad Produc Esmaor of Populao Varace ample urves ubhash Kumar Yadav Deparme of Mahemacs ad ascs

More information

Synchronization of Complex Network System with Time-Varying Delay Via Periodically Intermittent Control

Synchronization of Complex Network System with Time-Varying Delay Via Periodically Intermittent Control Sychrozao of Complex ework Sysem wh me-varyg Delay Va Perodcally Ierme Corol JIAG Ya Deparme of Elecrcal ad Iformao Egeerg Hua Elecrcal College of echology Xaga 4, Cha Absrac he sychrozao corol problem

More information

Efficient Estimators for Population Variance using Auxiliary Information

Efficient Estimators for Population Variance using Auxiliary Information Global Joural of Mahemacal cece: Theor ad Praccal. IN 97-3 Volume 3, Number (), pp. 39-37 Ieraoal Reearch Publcao Houe hp://www.rphoue.com Effce Emaor for Populao Varace ug Aular Iformao ubhah Kumar Yadav

More information

As evident from the full-sample-model, we continue to assume that individual errors are identically and

As evident from the full-sample-model, we continue to assume that individual errors are identically and Maxmum Lkelhood smao Greee Ch.4; App. R scrp modsa, modsb If we feel safe makg assumpos o he sascal dsrbuo of he error erm, Maxmum Lkelhood smao (ML) s a aracve alerave o Leas Squares for lear regresso

More information

RATIO ESTIMATORS USING CHARACTERISTICS OF POISSON DISTRIBUTION WITH APPLICATION TO EARTHQUAKE DATA

RATIO ESTIMATORS USING CHARACTERISTICS OF POISSON DISTRIBUTION WITH APPLICATION TO EARTHQUAKE DATA The 7 h Ieraoal as of Sascs ad Ecoomcs Prague Sepember 9-0 Absrac RATIO ESTIMATORS USING HARATERISTIS OF POISSON ISTRIBUTION WITH APPLIATION TO EARTHQUAKE ATA Gamze Özel Naural pulaos bolog geecs educao

More information

Available online Journal of Scientific and Engineering Research, 2014, 1(1): Research Article

Available online  Journal of Scientific and Engineering Research, 2014, 1(1): Research Article Avalable ole wwwjsaercom Joural o Scec ad Egeerg Research, 0, ():0-9 Research Arcle ISSN: 39-630 CODEN(USA): JSERBR NEW INFORMATION INEUALITIES ON DIFFERENCE OF GENERALIZED DIVERGENCES AND ITS APPLICATION

More information

CONTROLLABILITY OF A CLASS OF SINGULAR SYSTEMS

CONTROLLABILITY OF A CLASS OF SINGULAR SYSTEMS 44 Asa Joural o Corol Vol 8 No 4 pp 44-43 December 6 -re Paper- CONTROLLAILITY OF A CLASS OF SINGULAR SYSTEMS Guagmg Xe ad Log Wag ASTRACT I hs paper several dere coceps o corollably are vesgaed or a class

More information

Final Exam Applied Econometrics

Final Exam Applied Econometrics Fal Eam Appled Ecoomercs. 0 Sppose we have he followg regresso resl: Depede Varable: SAT Sample: 437 Iclded observaos: 437 Whe heeroskedasc-cosse sadard errors & covarace Varable Coeffce Sd. Error -Sasc

More information

Study on one-dimensional consolidation of soil under cyclic loading and with varied compressibility *

Study on one-dimensional consolidation of soil under cyclic loading and with varied compressibility * Zhuag e al. / J Zhejag v SCI 5 6A(:4-47 4 Joural of Zhejag versy SCIENCE ISSN 9-395 hp://www.ju.edu.c/jus E-mal: jus@ju.edu.c Sudy o oe-dmesoal cosoldao of sol uder cyclc loadg ad wh vared compressbly

More information