Efficient Optimization of Information-Theoretic Exploration in SLAM

Size: px
Start display at page:

Download "Efficient Optimization of Information-Theoretic Exploration in SLAM"

Transcription

1 Proceedings of he Tweny-Third AAAI Conference on Arificial Inelligence (2008) Efficien Opimizaion of Informaion-Theoreic Exploraion in SLAM Thomas Kollar and Nicholas Roy Compuer Science and Arificial Inelligence Laboraory Massachuses Insiue of Technology The Saa Cener, 32 Vassar S. Cambridge, MA {kollar Absrac We presen a novel mehod for informaion-heoreic exploraion, leveraging recen work on mapping and localizaion. We describe exploraion as he consrained opimizaion problem of compuing a rajecory o minimize poserior map error, subjec o he consrains of raveling hrough a se of sensing locaions o ensure map coverage. This rajecory is found by reducing he map o a skeleon graph and searching for a minimum enropy our hrough he graph. We describe how a specific facorizaion of he map covariance allows he reuse of EKF updaes during he opimizaion, giving an efficien gradien ascen search for he maximum informaion gain our hrough sensing locaions, where each our naurally incorporaes revisiing well-known map regions. By generaing incremenally larger ours as he exploraion finds new regions of he environmen, we demonsrae ha our approach can perform auonomous exploraion wih improved accuracy. Inroducion Compuing he bes rajecories for a mobile robo exploring an unknown environmen ypically requires saisfying he wo compeing objecives of compleeness and accuracy. Firsly, he exploraion rajecory mus choose measuremens ha cover he environmen, ensuring ha he robo has colleced measuremens of he enirey of he environmen in order o build a complee map. Secondly, he exploraion rajecory mus choose measuremens ha allow he SLAM algorihm o correcly infer relaionships beween differen pars of he environmen. Generaing he mos accurae map requires maximizing he muual informaion beween differen environmenal feaures and minimizing he loss of of informaion due o he robo moion. Differen pieces of his problem have been examined in previous work (Yamauchi 1997; Feder, Leonard, & Smih 1999; Makarenko e al. 2002; Sachniss & Burgard 2003; Sim & Roy 2005; Sachniss 2006), bu a consisen opimizaion process for addressing hese wo problems remains an open problem. In paricular, here has been lile work on performing mulisep desinaion selecion, primarily due o he fac ha compuing even a sandard minimum-disance rajecory is NP-hard. The problem is furher complicaed by he fac ha he objecive funcions of informaion gain Copyrigh c 2008, Associaion for he Advancemen of Arificial Inelligence ( All righs reserved. and expeced error do no obey he riangle inequaliy, prevening he use of many common opimizaion echniques. In his paper, we describe an algorihm for compuing an exploraion rajecory of an unknown environmen by combining a global opimizaion of a our of environmen waypoins wih a local conroller ha opimizes he rajecory hrough he seleced waypoins. The primary conribuion of our paper is a local-global opimizaion algorihm ha uses hill climbing o opimize a coverage our hrough a parially mapped environmen and a learned conroller ha predics he rajecory beween waypoins ha maximizes he informaion gain. Our second conribuion is o show ha he hill climbing can be performed efficienly using an alernae form of sandard SLAM process-measuremen updaes. SLAM wih he EKF I is beyond he scope of his paper o summarize he sae of he ar in roboic mapping; our overall approach o exploraion is no specific o he map represenaion or inference algorihm, alhough we are able o achieve subsanial gains in efficiency by assuming Gaussian map and robo poseriors. Wihou loss of generaliy, we will resric our discussion o Exended Kalman filer SLAM. Le us denoe he robo posiion a ime as x = (x, y, θ ), and each map feaure or landmark as m i = (x, y). The complee map of n landmarks is described as m = {m 1,...,m n }. When he robo execues a conrol acion u, is nex posiion is given by a process model x +1 = g(x,u, w ) where w is some unknown random noise ha is added o he effec of he conrol. Afer each moion, he robo receives observaions according o is curren posiion and he landmarks. The observaion of landmark i is given by a measuremen model z i = h(x, m i, v ) where v is some unknown random noise ha is added o he received measuremen. Typically a se of observaions z is received a ime. The goal of a SLAM algorihm is o infer he robo pose and he map based on knowledge of he conrol acions and observaions. By filering he acions and observaions, a join disribuion over he robo pose and he map can be mainained, specifically he disribuion p(x,m u 1:,z 1: ). If we assume his disribuion is Gaussian, hen he poserior is paramerized by he mean µ and covariance Σ saisics. If he process and measuremen models are linear funcions and he unobservable noise erms are Gaussian, hen he mean of he poserior disribuion compued 1369

2 Algorihm 1 The Exended Kalman Filer Require: Prior mean and covariance (µ 1, Σ 1 ), conrol u and observaion z. 1: Compue process mean, µ = g(µ 1,u, 0) 2: Compue process Jacobian G µ 1. 3: Compue process covariance, Σ = G Σ 1 G T + R. 4: Compue measuremen Jacobian H µ 1. 5: Compue Kalman gain, ( ) 1. K = Σ H T H Σ H T + Q 6: Compue new mean, µ = µ + K (h(µ, 0) z ). 7: Compue new covariance, Σ T = (I K T H T )Σ T. by he Kalman filer can be shown o minimize he leassquares error. When he models are non-linear, he exended Kalman filer can be used o repeaedly linearize he process and measuremen models. The Jacobians of he process and measuremen models g and h used o linearize hese models are G and H respecively, and R and Q are he covariances of he Gaussian noise disribuions for he process and measuremen updaes. The complee Exended Kalman Filer algorihm is shown in figure 1. Exploraion Given sufficien daa, he mean esimae of he map provided by he EKF will usually converge o he rue map 1. Unforunaely, few if any guaranees are given for he rae a which he map converges; he qualiy of he map is highly dependen on he robo rajecory as well as a variey of oher feaures such as sensor model, feaure disinciveness for daa associaion, ec.. The goal in his paper is o choose rajecories ha lead o sensor daa ha resuls in he bes map, maximizing boh he map coverage and he map accuracy. There are differen ways o reconcile hese objecives, such as opimizing he robo rajecory wih respec o an overall objecive ha is a weighed sum of individual erms (Makarenko e al. 2002), bu such approaches raise addiional quesions such as how o choose he individual weighs. Our approach is o define he coverage problem as choosing he bes sensing locaions ha maximize he likelihood he complee environmen will be observed. If hese sensing locaions serve as consrains on a rajecory planner, and we compue a rajecory ha maximizes he map accuracy subjec o hese consrains, hen we have a principled way o opimize boh coverage and accuracy. The rajecory consrains based on coverage can be calculaed firs, and hen he consrained rajecory can be calculaed o maximize map accuracy. 1 Alhough, convergence guaranees only hold for he sandard Kalman filer, no he exended form. Map Coverage Le us firs consider he problem of coverage. We define a visibiliy funcion δ(m i,z 0:T ), δ(m i 1 if map feaure mi appears a leas once in he observaions,z 0:T )= z 0:T, (1) 0 oherwise. The goal of a coverage planner is o search for he rajecory ha maximizes he number of sensed feaures, argmax z 0:T m δ(m i,z 0:T ). (2) i=1 We canno conrol or predic a priori wha observaions z will be received. We can conrol he robo poses x 0:T, bu he observaions are generaed sochasically according o he map and he robo pose, p(z 0:T x 0:T,m). We do no know he map m, so equaion (2) mus be compued in expecaion over he map prior disribuion p(m) and observaion likelihoods, such ha argmaxe p(m) E p(z0:t x 0:T,m) x 0:T m δ(m i,z 0:T ). (3) i=1 The disribuion p(m) is acually condiioned on he measuremens up o he curren ime, which have been lef ou for breviy. Compuing he double expecaion exacly over boh map prior and observaions is compuaionally infeasible, bu he disribuion over maps has only limied impac on he opimizaion. A he beginning of he mapping process, he map prior will be very uncerain, which will cause all rajecories o have approximaely he same expeced performance, as all maps are equally likely. As more observaions are made, he map disribuion will begin o converge o he rue map, and he mean map esimae ˆm will become an increasingly good approximaion o he complee disribuion. When he map prior is informaive, Mone Carlo mehods may be used o esimae he ouer expecaion direcly. Thus, we can approximae equaion (3) as: argmax x 0:T E p(z0:t x 0:T, ˆm) m δ(m i,z 0:T ). (4) i=1 While equaion (4) is an approximaion o our rue coverage objecive, in pracice a robo coverage algorihm ha chooses a se of desinaion poins o maximize he likelihood of compleing he map based on he curren esimae, and hen follows a our hrough hese poins will have performance comparable o any oher sequenial decision making algorihm. We can gain addiional compuaional racabiliy in solving equaion (4) by considering only a subse of possible locaions of he robo, ha is, a reracion of he map free space. Map feaures are consrained (by definiion) o lie on he boundary of free space, so our reracion mus be a subse of map locaions ha guaranee ha every boundary poin can be observed from some poin in his subse. This propery of boundary visibiliy is known as reliabiliy (Cornea, Silver, & Min 2005); using he medial-axis 1370

3 (a) Original (b) Median Filered (c) Skeleon (d) Sensing Waypoins Figure 1: The map skeleonizaion process. (a) The binarized map. (b) The smoohed map using median filering o eliminae high-frequency noise. (c) The piece-wise linear graph of he medial-axis skeleon. (d) The graph nodes ha consiue sensing locaion consrains for he rajecory opimizaion. Algorihm 2 Map Skeleonizaion Require: Map m. 1: Median filer he binarized map. 2: Skeleonize he map using he medial-axis ransform. 3: Reduce he skeleon o a piece-wise linear graph. 4: Reurn he end-poins of lines (juncions and graph leaves). ransform (Blum 1967) o reduce he free-space o a minimal skeleon allows us o approximae he map wih a minimal se of poins while guaraneeing reliabiliy necessary for equaion (4). Efficien algorihms for compuing a medial axis represenaion ha guaranee reliabiliy generally lead o near-opimal reducions in erms of minimizing he resuling skeleon; compuing he rue minimum medial axis represenaion is NP-hard (Coeurjolly, Hulin, & Sivignon 2007). In order o selec a se of sensing locaions ha provide a guaranee of coverage, we rerac he map o is medial axis. We hen furher reduce he skeleon o a graph by convering he resuling map skeleon o a piece-wise linear represenaion, reaining only he end-poins of he resuling lines. We can hen use graph search algorihms o idenify a our of he skeleon ha maximizes he prediced map accuracy. By generaing a graph wih guaraneed boundary visibiliy and consraining our exploraion rajecory o raverse he graph nodes, he exploraion rajecory is consrained o have full map coverage. The complee skeleonizaion algorihm is given by Algorihm 2. Alhough he map free space is given by an occupancy grid, his grid is binarized before skeleonizaion. Noe also ha high frequency objecs in he map such as discreizaion arifacs or small mapping errors can creae an overabundance of end-poins and juncion poins. To reduce he effec of high-frequency areas o have less influence on he end-poins and juncion poins, we firs median filer he image, as shown in figure 1(b). Map Accuracy Given a graph hrough he environmen ha maximizes he expeced coverage of map feaures, we wish o compue a our hrough hese poins, ha is, an ordering on he graph poins, o maximize he resuling map accuracy. We canno measure he map accuracy wihou ground ruh, bu he covariance of he poserior disribuion provides an esimae of he expeced squared error. Minimizing he deerminan of he covariance of he poserior Gaussian is equivalen o minimizing he differenial enropy described by Darbellay & Vajda (2000) as H(p(x)) = 1 2 ln[(2πe)n Σ ] As a resul, compuing a our ha minimizes he poserior map enropy is equivalen o an informaion-heoreic raveling salesman problem where he objecive funcion is informaion maximizaion raher han disance minimizaion. Unforunaely, as wih mos raveling salesmen problems, his opimizaion is again NP-hard (Garey & Johnson 1979). Furhermore, he marginal gains for compuing he globally opimal rajecory are likely o be low. On he oher hand, local opimizaion will allow he planner o ake advanage of some informaion gahering acions such as revisiing known-locaions ha a greedy sraegy will no always use. As a resul, we use local opimizaion similar o he gradien-ascen heurisics used for solving shores-pah TSP problems, such as he edge-swapping heurisic. A basic our is generaed from a simple greedy neares-neighbor heurisic. A each ieraion wo edges are swapped and he informaion gain is calculaed. If he poserior map enropy is improved, hen his edge move is reained. Edge-swaps are coninued unil no more edges are available o reduce he our cos, and he bes (lowes prediced poserior enropy) our is used. Noe ha for he convenional shores-pah TSP, his approach is referred o as he 2-op heurisic and is known o be beer han a O(log(n)) approximaion o he rue disanceopimal soluion. This performance guaranee does no hold 1371

4 (a) Shores Pah Opimizaion (b) Informaion-Theoreic Opimizaion Figure 2: Opimized ours. (a) For he graph in figure 1, he ordering of poins ha minimizes he oal pah lengh. (H(p(m)) = 6.93) ((b) For he same graph, he ordering of poins found by minimizing he poserior map enropy. (H(p(m)) = 7.23). (Noe ha he red lines only show conneciviy for clariy of presenaion. The pah lenghs are calculaed hrough free space, bu he pahs end o overlap and obscure he resuling conneciviy.) (a) A our using a shores-pah conroller (b) The same our using he learned conroller Figure 3: Comparison of a pah execued by a convenional shores-pah conroller ha uses poin urns (a) wih a pah execued by he learned conroller. The our of he graph in boh cases was fixed. for he exploraion problem because informaion gain violaes he riangle inequaliy propery (Rosenkranz, Searns, & Lewis 1977), bu performs well in pracice. Figure 2 shows he resuls of using he 2-op heurisic wih he shores pah objecive (figure 2a) and he informaionheoreic objecive (figure 2b). The principal difference beween he planned ours is he order of poins a he op of he free space; by firs aking a sraigh rajecory ino he room, he informaion-heoreic opimizaion improves he map of he room and hen revisis he remaining locaions, using he improved map o say well-localized. Addiionally, he informaion-heoreic opimizaion uses waypoins in previously-visied locaions o relocalize. In conras, he shores-pah rajecory visis he early waypoins firs and canno deliberaely revisi a locaion o re-localize, resuling in a lower-qualiy map. The informaion-heoreic opimizaion significanly reduces he map enropy, and as expeced also reduces he squared error of he resuling map (able 1). One-sep updae In order o evaluae he poserior map enropy of a paricular exploraion rajecory, we can use he EKF algorihm o predic he poserior map esimae afer he rajecory. This predicion requires compuing he expeced moion updaes, simulaing measuremens from he curren map esimae and updaing he EKF a each ime sep. As a resul, he predicion process can be compuaionally demanding and he cos is inensified by he fac ha every modificaion of he exploraion rajecory requires he EKF o recalculae he map for he enire rajecory following he modificaion. Even if he rajecory modificaion is relaively minor, he non-lineariy of he covariance updae in Algorihm 1 means ha he EKF updaes from one rajecory canno easily be re-used in he overlapping secion of anoher rajecory. Noe ha he simulaed moions and measuremens can be re-used if he same graph edge appears in wo poenial rajecories; i is only he EKF updaes along ha mus be recalculaed. In previous work (Prenice & Roy 2007) we have shown ha muliple updaes o a Gaussian filer can be combined ino a single ransfer funcion by re-facoring he covariance, allowing muliple process and measuremen updaes o occur wih a consan number of operaions. If he covariance is facored as Σ 1 = B 1 C 1, hen he individual facors can be updaed as [ [ ] [ B W X B C] = C] (5) Y Z [ ] 0 I = I M 1 [ ] 0 G T G RG T [ B C], (6) 1 and Σ recovered as Σ = B C 1. (Noe ha M = H T Q 1 H, he measuremen informaion.) Assuming we have known measuremens and moions, we recursively apply he measuremen Jacobian H, he measuremen noise Q, he moion Jacobian G, and he moion noise R in he form of he ransfer funcion ζ = [ W X Y Z ] o he original [ ] B. C 0 We can iniialize he process as B 0 = Σ 0 and C 0 = I, where Σ 0 is he iniial covariance marix and I, he ideniy marix. Noe ha each of hese erms can be seen in he original formulaion of he EKF (Algorihm 1). Using his one-sep EKF updae, we can collec simulaed moions and measuremens along each edge in he graph, compue he relevan Jacobians for each ime sep and collec erms ino a single ransfer funcion for ha edge ha can be reused for he map poserior predicion of fuure exploraion rajecories. In paricular, we compue a our by creaing one-sep ransfer funcions beween he ineres poins. If we have already creaed a ransfer funcion o wihin some hreshold ǫ of he original orienaion, we will use he pregeneraed ransfer funcion o obain he one-sep updae. Thus, we will cache he ransfer funcions for any alreadyvisied edges and use his o drasically speed up he search process. Moion Conrol Opimizaion The coverage exploraion opimizaion assumes a specific model of robo moion along each edge in he graph bu 1372

5 Algorihm 3 Informaion-heoreic our opimizaion Require: Approximae shores-pah TSP our bes hrough sensing locaions derived from skeleonized map 1: while no converged do 2: Use he 2-op heurisic o creae a new our curr, iniialize Σ 0. 3: for each edge e ij in curr do 4: if Edge e ij was previously execued saring from orienaion θ k hen 5: Use he cached ransfer funcion ζ ijk o updae Σ o Σ +n. 6: else if e ij has no been execued before hen 7: Execue i using he learned conroller and sore ζ ijk for ha edge and iniial orienaion. 8: end if 9: end for 10: Compue he enropy H curr = enropy(σ T ) afer execuing our curr. 11: if H curr < H bes hen 12: bes = curr. 13: end if 14: end while 15: reurn bes does no have direc conrol of he robo moion beween sensing locaions. However, he rajecory beween sensor measuremens can have a significan effec on he accuracy of he map. When he robo moion is noisy, subsanial uncerainy is inroduced ino he robo pose; when few or no observaions are expeced (e.g., because of sensor range limis or environmenal sparsiy), his uncerainy is propagaed hroughou he SLAM filer ino fuure measuremens, increasing he expeced error of he poserior. To learn good conrol policies, we use Policy Search Dynamic Programming (Bagnell e al. 2003; Kollar & Roy 2008), a form of reinforcemen learning ha decomposes ino a series of one-sep learning algorihms, essenially urning he learning of sequenial decision making ino dynamic programming. The algorihm operaes by firs learning a one-sep policy a ime T (e.g., he end ime). Training daa is generaed by sampling a robo pose and map disribuion p(x,m), and using all possible conrol rajecories a o propagae each sampled prior disribuion forward o a poserior disribuion according o he EKF using a physically realisic vehicle and sensor simulaor. Each disribuionacion pair is labeled wih he resuling informaion gain, and he disribuion-acion pair ha minimizes he map enropy is kep as a raining insance for a supervised learning problem a ime. The resul is a classifier ha provides an acion a for any disribuion p(s) ha minimizes he map enropy. A one-sep policy for he previous ime sep T 1 is hen learned by sampling disribuions and acions as before, bu propagaing each disribuion according o each acion a followed by he ime T one-sep policy previously obained. A wo-sep enropy is obained for each disribuion and acion a T 1 from he resuling disribuion-acion pair: he enropy associaed wih each disribuion and acion is accumulaed from he moion and observaions associaed wih he conrol a ime T 1 and he moion and observaions received by running he learned policy for ime T. A new policy is learned for ime T 1, and he learner hen ieraes a each ime hrough o ime T using he policies π i for i {...T}, resuling in a moion conroller ha minimizes he enropy over he lengh of he rajecory = 0...T. Figure 3 compares a sandard shores-pah moion conroller wih he learned conroller for he same our of sensing locaions. As expeced, he map enropy is significanly reduced for he learned conroller (H(m) = 7.23) compared o he shores-pah conroller (H(m) = 6.93). Incremenal Exploraion To summarize, our goal has been o opimally explore unknown environmens by solving a local-global opimizaion problem, maximizing boh map coverage and map accuracy. The coverage requiremen has led us o ake he parial map, skeleonize i, and exrac sensing locaions from i, as seen in Algorihm 2. The map accuracy requiremen has led us o a hill-climbing algorihm ha opimizes he visiaion of sensing locaions while a he same ime minimizing he resuling enropy of he map. The iniial our is opimized wih respec o disance, he 2-op heurisic modifies he our and he bes ours are kep, as seen in Algorihm 3. Compuing he EKF updaes for previously visied our edges is inefficien and can be opimized by soring a ransfer funcion for each edge. Addiional improvemen can be achieved by using a local learned conroller o plan local rajecories beween sensing locaions. Once he opimal our is compued and hen execued by he robo, he parial map of he environmen will grow. When a complee our is accomplished, a new our will be compued for he new parial map and he nex round of exploraion will begin. In his secion, we have presened a complee algorihm for informaionheoreic exploraion and have elaboraed a number of ways ha i can be made more efficien. Resuls To demonsrae he effeciveness of he approach, we esed he exploraion algorihm on scenarios from he Radish daase using a simulaion wih uncerain robo moion and uncerain landmark measuremens. Treaing boh he WEAN and LONGWOOD maps as parial maps, we ran he opimizaion wih he shores pah conroller and wih he learned conroller. Overall, i is clear ha he map enropy afer exploraion has been reduced, and in all bu he one insance (he WEAN map wih he informaion-heoreic objecive and he learned conroller), he squared error is also reduced for he opimized rajecory. In he larger LONGWOOD example (Figure 4), i appears clear ha he our says in sigh of as many map feaures (red squares) as possible. Having shown ha we can successfully reduce he map enropy and improve map accuracy, we nex demonsrae an incremenal exploraion of he environmen in Figure 5. Here he robo has a parial map of he environmen, skeleonizes his map o find sensing locaions, compues an informaion-heoreic our, and hen execues ha our. Afer he execuion of he our in (d), he robo hen compues a new our, which i hen execues in (e), exploring his parial map as well unil he enire environmen has been observed (f). Table 2 gives a comparison of he enropy and map accuracies for he maps in Figure 5 corresponding o he disance opimal and informaion-heoreic exploraion algorihms. Noing ha each pair of maps (a-d, b-e, c-f) may be direcly compared, we found ha he squared error and he en- 1373

6 (a) Disance Opimal Tour (b) Disance Opimal Pah (c) Informaion-heoreic Tour (d) Informaion-heoreic Pah Figure 4: Tours for he Longwood map. (a) The shores pah ordering of he skeleon map graph. (b) The resuling pah using he shorespah conroller. (c) The minimum-enropy pah ordering of he skeleon map graph. (d) The resuling pah using he learned conroller. MAP OPT. TYPE CONT. ENT. SE DIST. W Pah Lgh. Dis. Op m W Info Dis. Op m W Info Learned m L Pah Lgh. Learned m L Info Learned m Table 1: Algorihm Performance for a Single Tour. MAP is he map ha he experimen was performed in, W is he WEAN map and L is he LONGWOOD map. OPT. TYPE corresponds o graph our opimizaion, eiher shores pah or informaion-heoreic opimizaion. CONT. corresponds o he moion conroller, eiher shores pah or learned. ENT. and SE correspond o he merics of map enropy and map squared error compared o he known ground ruh respecively. DIST. corresponds o he disance he robo raveled over he enire our. ropy were boh significanly reduced by he opimized policy. Furher, by using one-sep ransfer funcions, we were able o reduce he oal number of edge predicion seps by 50%, for a similar ime savings of 50%. We expec his savings will grow for larger problem sizes. Conclusion We have presened a novel mehod ha frames exploraion as a consrained opimizaion problem. The algorihm reduces he curren environmenal map o a graph of he map skele- OPT. CONT. ENT. SE DIST (a) Dis Learned m (b) Dis Learned m (c) Dis Learned m (d) Info Learned m (e) Info Learned m (f) Info Learned m Table 2: Algorihm for a sequence of ours during online exploraion. OPT. corresponds o graph our opimizaion, eiher shores pah or informaion-heoreic opimizaion. The conroller CONT. in every case was he learned conroller. ENT. and SE correspond o he merics of map enropy and map squared error respecively. DIST. corresponds o he disance he robo raveled over he enire our. The informaion-heoreic our opimizaion resuls in boh a reduced map enropy and reduced map error compared o he known ground ruh. on, placing sensing consrains a he graph nodes which naurally include boundaries and froniers. Informaionheoreic ours are hen generaed hrough he sensing consrains using a gradien-ascen wih a heurisic; we can perform he opimizaion efficienly by using one-sep ransfer funcions on he covariance o re-use previously compued EKF updaes. To our knowledge his is one of few informaion-heoreic approaches o exploraion and he only one o consider mulisep ours of an environmen. Finally, a learner is employed o opimize he local ra- 1374

7 (a) (b) (c) (a-c) Map exploraion using shores pah opimizaion of exploraion rajecory (d) (e) (f) (d-f) Map exploraion using informaion-heoreic opimizaion of exploraion rajecory Figure 5: Example exploraion rajecories, comparing a shores-pah coverage rajecory wih an informaion-heoreic coverage rajecory. In panels (a) and (d), he algorihm has only a limied view of he environmen, and plans a our hrough he visible free space. In doing so, he map is expanded (b and e), and he robo can plan a second our hrough he larger space. The process ieraes again hrough a ye larger map in (c and f). jecory of he robo beween sensing consrains, o obain global informaion-heoreic exploraion ha can quickly find informaion-heoreic rajecories hrough an environmen, resuling in a high qualiy map. References Bagnell, J.; Kakade, S.; Ng, A.; and Schneider, J Policy search by dynamic programming. In Neural Informaion Processing Sysems, volume 16. MIT Press. Blum, H A Transformaion for Exracion New Descripors of Shape, Models for he Percepion of Speech and Visual Form. Cambridge: MIT Press. Coeurjolly, D.; Hulin, J.; and Sivignon, I Finding a Minimum Medial Axis of a Discree Shape is NPhard. Technical Repor RR-LIRIS , LIRIS UMR 5205 CNRS/INSA de Lyon/Universi Claude Bernard Lyon 1/Universi Lumire Lyon 2/Ecole Cenrale de Lyon. Cornea, N. D.; Silver, D.; and Min, P Curveskeleon applicaions. In Proceedings IEEE Visualizaion, Darbellay, G. A., and Vajda, I Enropy expressions for mulivariae coninuous disribuions. IEEE Transacions on Informaion Theory 46(2): Feder, H. J. S.; Leonard, J. J.; and Smih, C. M Adapive mobile robo navigaion and mapping. Inernaional Journal of Roboics Research, Special Issue on Field and Service Roboics 18(7): Garey, M. R., and Johnson, D. S Compuers and Inracabiliy : A Guide o he Theory of NP-Compleeness. Series of Books in he Mahemaical Sciences. W. H. Freeman. Kollar, T., and Roy, N Trajecory opimizaion using reinforcemen learning for map exploraion. The Inernaional Journal of Roboics Research 27(2): Makarenko, A.; Williams, S.; Bourgaul, F.; and Durran- Whye, H An experimen in inegraed exploraion. Proc. of he IEEE/RSJ Inl. Conf. on Inelligen Robos and Sysems (IROS). Prenice, S., and Roy, N The belief roadmap: Efficien planning in linear pomdps by facoring he covariance. In Proceedings of he 13h Inernaional Symposium of Roboics Research (ISRR). Rosenkranz, D.; Searns, R.; and Lewis, P An analysis of several heurisics for he raveling salesman problem. SIAM Journal on Compuing 6:563. Sim, R., and Roy, N Global a-opimal robo exploraion in slam. In Proceedings of he IEEE Inernaional Conference on Roboics and Auomaion (ICRA). Sachniss, C., and Burgard, W Exploring unknown environmens wih mobile robos using coverage maps. In Proc. of he Inernaional Conference on Arificial Inelligence (IJCAI). Sachniss, C Exploraion and Mapping wih Mobile Robos. Ph.D. Disseraion, Universiy of Freiburg, Deparmen of Compuer Science. Yamauchi, B A fronier-based approach for auonomous exploraion. Proceedings of he IEEE Inernaional Symposium on Compuaional Inelligence in Roboics and Auomaion

Probabilistic Robotics SLAM

Probabilistic Robotics SLAM Probabilisic Roboics SLAM The SLAM Problem SLAM is he process by which a robo builds a map of he environmen and, a he same ime, uses his map o compue is locaion Localizaion: inferring locaion given a map

More information

L07. KALMAN FILTERING FOR NON-LINEAR SYSTEMS. NA568 Mobile Robotics: Methods & Algorithms

L07. KALMAN FILTERING FOR NON-LINEAR SYSTEMS. NA568 Mobile Robotics: Methods & Algorithms L07. KALMAN FILTERING FOR NON-LINEAR SYSTEMS NA568 Mobile Roboics: Mehods & Algorihms Today s Topic Quick review on (Linear) Kalman Filer Kalman Filering for Non-Linear Sysems Exended Kalman Filer (EKF)

More information

Probabilistic Robotics SLAM

Probabilistic Robotics SLAM Probabilisic Roboics SLAM The SLAM Problem SLAM is he process by which a robo builds a map of he environmen and, a he same ime, uses his map o compue is locaion Localizaion: inferring locaion given a map

More information

Sequential Importance Resampling (SIR) Particle Filter

Sequential Importance Resampling (SIR) Particle Filter Paricle Filers++ Pieer Abbeel UC Berkeley EECS Many slides adaped from Thrun, Burgard and Fox, Probabilisic Roboics 1. Algorihm paricle_filer( S -1, u, z ): 2. Sequenial Imporance Resampling (SIR) Paricle

More information

Probabilistic Robotics

Probabilistic Robotics Probabilisic Roboics Bayes Filer Implemenaions Gaussian filers Bayes Filer Reminder Predicion bel p u bel d Correcion bel η p z bel Gaussians : ~ π e p N p - Univariae / / : ~ μ μ μ e p Ν p d π Mulivariae

More information

Zürich. ETH Master Course: L Autonomous Mobile Robots Localization II

Zürich. ETH Master Course: L Autonomous Mobile Robots Localization II Roland Siegwar Margaria Chli Paul Furgale Marco Huer Marin Rufli Davide Scaramuzza ETH Maser Course: 151-0854-00L Auonomous Mobile Robos Localizaion II ACT and SEE For all do, (predicion updae / ACT),

More information

Two Popular Bayesian Estimators: Particle and Kalman Filters. McGill COMP 765 Sept 14 th, 2017

Two Popular Bayesian Estimators: Particle and Kalman Filters. McGill COMP 765 Sept 14 th, 2017 Two Popular Bayesian Esimaors: Paricle and Kalman Filers McGill COMP 765 Sep 14 h, 2017 1 1 1, dx x Bel x u x P x z P Recall: Bayes Filers,,,,,,, 1 1 1 1 u z u x P u z u x z P Bayes z = observaion u =

More information

Introduction to Mobile Robotics

Introduction to Mobile Robotics Inroducion o Mobile Roboics Bayes Filer Kalman Filer Wolfram Burgard Cyrill Sachniss Giorgio Grisei Maren Bennewiz Chrisian Plagemann Bayes Filer Reminder Predicion bel p u bel d Correcion bel η p z bel

More information

Planning in POMDPs. Dominik Schoenberger Abstract

Planning in POMDPs. Dominik Schoenberger Abstract Planning in POMDPs Dominik Schoenberger d.schoenberger@sud.u-darmsad.de Absrac This documen briefly explains wha a Parially Observable Markov Decision Process is. Furhermore i inroduces he differen approaches

More information

2016 Possible Examination Questions. Robotics CSCE 574

2016 Possible Examination Questions. Robotics CSCE 574 206 Possible Examinaion Quesions Roboics CSCE 574 ) Wha are he differences beween Hydraulic drive and Shape Memory Alloy drive? Name one applicaion in which each one of hem is appropriae. 2) Wha are he

More information

Notes on Kalman Filtering

Notes on Kalman Filtering Noes on Kalman Filering Brian Borchers and Rick Aser November 7, Inroducion Daa Assimilaion is he problem of merging model predicions wih acual measuremens of a sysem o produce an opimal esimae of he curren

More information

EKF SLAM vs. FastSLAM A Comparison

EKF SLAM vs. FastSLAM A Comparison vs. A Comparison Michael Calonder, Compuer Vision Lab Swiss Federal Insiue of Technology, Lausanne EPFL) michael.calonder@epfl.ch The wo algorihms are described wih a planar robo applicaion in mind. Generalizaion

More information

1 Review of Zero-Sum Games

1 Review of Zero-Sum Games COS 5: heoreical Machine Learning Lecurer: Rob Schapire Lecure #23 Scribe: Eugene Brevdo April 30, 2008 Review of Zero-Sum Games Las ime we inroduced a mahemaical model for wo player zero-sum games. Any

More information

Robot Motion Model EKF based Localization EKF SLAM Graph SLAM

Robot Motion Model EKF based Localization EKF SLAM Graph SLAM Robo Moion Model EKF based Localizaion EKF SLAM Graph SLAM General Robo Moion Model Robo sae v r Conrol a ime Sae updae model Noise model of robo conrol Noise model of conrol Robo moion model

More information

Introduction to Mobile Robotics SLAM: Simultaneous Localization and Mapping

Introduction to Mobile Robotics SLAM: Simultaneous Localization and Mapping Inroducion o Mobile Roboics SLAM: Simulaneous Localizaion and Mapping Wolfram Burgard, Maren Bennewiz, Diego Tipaldi, Luciano Spinello Wha is SLAM? Esimae he pose of a robo and he map of he environmen

More information

Probabilistic Robotics The Sparse Extended Information Filter

Probabilistic Robotics The Sparse Extended Information Filter Probabilisic Roboics The Sparse Exended Informaion Filer MSc course Arificial Inelligence 2018 hps://saff.fnwi.uva.nl/a.visser/educaion/probabilisicroboics/ Arnoud Visser Inelligen Roboics Lab Informaics

More information

Announcements. Recap: Filtering. Recap: Reasoning Over Time. Example: State Representations for Robot Localization. Particle Filtering

Announcements. Recap: Filtering. Recap: Reasoning Over Time. Example: State Representations for Robot Localization. Particle Filtering Inroducion o Arificial Inelligence V22.0472-001 Fall 2009 Lecure 18: aricle & Kalman Filering Announcemens Final exam will be a 7pm on Wednesday December 14 h Dae of las class 1.5 hrs long I won ask anyhing

More information

Estimation of Poses with Particle Filters

Estimation of Poses with Particle Filters Esimaion of Poses wih Paricle Filers Dr.-Ing. Bernd Ludwig Chair for Arificial Inelligence Deparmen of Compuer Science Friedrich-Alexander-Universiä Erlangen-Nürnberg 12/05/2008 Dr.-Ing. Bernd Ludwig (FAU

More information

STATE-SPACE MODELLING. A mass balance across the tank gives:

STATE-SPACE MODELLING. A mass balance across the tank gives: B. Lennox and N.F. Thornhill, 9, Sae Space Modelling, IChemE Process Managemen and Conrol Subjec Group Newsleer STE-SPACE MODELLING Inroducion: Over he pas decade or so here has been an ever increasing

More information

An introduction to the theory of SDDP algorithm

An introduction to the theory of SDDP algorithm An inroducion o he heory of SDDP algorihm V. Leclère (ENPC) Augus 1, 2014 V. Leclère Inroducion o SDDP Augus 1, 2014 1 / 21 Inroducion Large scale sochasic problem are hard o solve. Two ways of aacking

More information

SEIF, EnKF, EKF SLAM. Pieter Abbeel UC Berkeley EECS

SEIF, EnKF, EKF SLAM. Pieter Abbeel UC Berkeley EECS SEIF, EnKF, EKF SLAM Pieer Abbeel UC Berkeley EECS Informaion Filer From an analyical poin of view == Kalman filer Difference: keep rack of he inverse covariance raher han he covariance marix [maer of

More information

A Hop Constrained Min-Sum Arborescence with Outage Costs

A Hop Constrained Min-Sum Arborescence with Outage Costs A Hop Consrained Min-Sum Arborescence wih Ouage Coss Rakesh Kawara Minnesoa Sae Universiy, Mankao, MN 56001 Email: Kawara@mnsu.edu Absrac The hop consrained min-sum arborescence wih ouage coss problem

More information

Physics 235 Chapter 2. Chapter 2 Newtonian Mechanics Single Particle

Physics 235 Chapter 2. Chapter 2 Newtonian Mechanics Single Particle Chaper 2 Newonian Mechanics Single Paricle In his Chaper we will review wha Newon s laws of mechanics ell us abou he moion of a single paricle. Newon s laws are only valid in suiable reference frames,

More information

Ensamble methods: Boosting

Ensamble methods: Boosting Lecure 21 Ensamble mehods: Boosing Milos Hauskrech milos@cs.pi.edu 5329 Senno Square Schedule Final exam: April 18: 1:00-2:15pm, in-class Term projecs April 23 & April 25: a 1:00-2:30pm in CS seminar room

More information

2.160 System Identification, Estimation, and Learning. Lecture Notes No. 8. March 6, 2006

2.160 System Identification, Estimation, and Learning. Lecture Notes No. 8. March 6, 2006 2.160 Sysem Idenificaion, Esimaion, and Learning Lecure Noes No. 8 March 6, 2006 4.9 Eended Kalman Filer In many pracical problems, he process dynamics are nonlinear. w Process Dynamics v y u Model (Linearized)

More information

Ensamble methods: Bagging and Boosting

Ensamble methods: Bagging and Boosting Lecure 21 Ensamble mehods: Bagging and Boosing Milos Hauskrech milos@cs.pi.edu 5329 Senno Square Ensemble mehods Mixure of expers Muliple base models (classifiers, regressors), each covers a differen par

More information

Overview. COMP14112: Artificial Intelligence Fundamentals. Lecture 0 Very Brief Overview. Structure of this course

Overview. COMP14112: Artificial Intelligence Fundamentals. Lecture 0 Very Brief Overview. Structure of this course OMP: Arificial Inelligence Fundamenals Lecure 0 Very Brief Overview Lecurer: Email: Xiao-Jun Zeng x.zeng@mancheser.ac.uk Overview This course will focus mainly on probabilisic mehods in AI We shall presen

More information

3.1.3 INTRODUCTION TO DYNAMIC OPTIMIZATION: DISCRETE TIME PROBLEMS. A. The Hamiltonian and First-Order Conditions in a Finite Time Horizon

3.1.3 INTRODUCTION TO DYNAMIC OPTIMIZATION: DISCRETE TIME PROBLEMS. A. The Hamiltonian and First-Order Conditions in a Finite Time Horizon 3..3 INRODUCION O DYNAMIC OPIMIZAION: DISCREE IME PROBLEMS A. he Hamilonian and Firs-Order Condiions in a Finie ime Horizon Define a new funcion, he Hamilonian funcion, H. H he change in he oal value of

More information

Tom Heskes and Onno Zoeter. Presented by Mark Buller

Tom Heskes and Onno Zoeter. Presented by Mark Buller Tom Heskes and Onno Zoeer Presened by Mark Buller Dynamic Bayesian Neworks Direced graphical models of sochasic processes Represen hidden and observed variables wih differen dependencies Generalize Hidden

More information

Article from. Predictive Analytics and Futurism. July 2016 Issue 13

Article from. Predictive Analytics and Futurism. July 2016 Issue 13 Aricle from Predicive Analyics and Fuurism July 6 Issue An Inroducion o Incremenal Learning By Qiang Wu and Dave Snell Machine learning provides useful ools for predicive analyics The ypical machine learning

More information

Vehicle Arrival Models : Headway

Vehicle Arrival Models : Headway Chaper 12 Vehicle Arrival Models : Headway 12.1 Inroducion Modelling arrival of vehicle a secion of road is an imporan sep in raffic flow modelling. I has imporan applicaion in raffic flow simulaion where

More information

Using the Kalman filter Extended Kalman filter

Using the Kalman filter Extended Kalman filter Using he Kalman filer Eended Kalman filer Doz. G. Bleser Prof. Sricker Compuer Vision: Objec and People Tracking SA- Ouline Recap: Kalman filer algorihm Using Kalman filers Eended Kalman filer algorihm

More information

CHAPTER 10 VALIDATION OF TEST WITH ARTIFICAL NEURAL NETWORK

CHAPTER 10 VALIDATION OF TEST WITH ARTIFICAL NEURAL NETWORK 175 CHAPTER 10 VALIDATION OF TEST WITH ARTIFICAL NEURAL NETWORK 10.1 INTRODUCTION Amongs he research work performed, he bes resuls of experimenal work are validaed wih Arificial Neural Nework. From he

More information

Online Convex Optimization Example And Follow-The-Leader

Online Convex Optimization Example And Follow-The-Leader CSE599s, Spring 2014, Online Learning Lecure 2-04/03/2014 Online Convex Opimizaion Example And Follow-The-Leader Lecurer: Brendan McMahan Scribe: Sephen Joe Jonany 1 Review of Online Convex Opimizaion

More information

INTRODUCTION TO MACHINE LEARNING 3RD EDITION

INTRODUCTION TO MACHINE LEARNING 3RD EDITION ETHEM ALPAYDIN The MIT Press, 2014 Lecure Slides for INTRODUCTION TO MACHINE LEARNING 3RD EDITION alpaydin@boun.edu.r hp://www.cmpe.boun.edu.r/~ehem/i2ml3e CHAPTER 2: SUPERVISED LEARNING Learning a Class

More information

Modal identification of structures from roving input data by means of maximum likelihood estimation of the state space model

Modal identification of structures from roving input data by means of maximum likelihood estimation of the state space model Modal idenificaion of srucures from roving inpu daa by means of maximum likelihood esimaion of he sae space model J. Cara, J. Juan, E. Alarcón Absrac The usual way o perform a forced vibraion es is o fix

More information

Fixed-lag Sampling Strategies for Particle Filtering SLAM

Fixed-lag Sampling Strategies for Particle Filtering SLAM To appear in he 7 IEEE Inernaional Conference on Roboics & Auomaion (ICRA 7) Fixed-lag Sampling Sraegies for Paricle Filering SLAM Krisopher R. Beevers and Wesley H. Huang Absrac We describe wo new sampling

More information

T L. t=1. Proof of Lemma 1. Using the marginal cost accounting in Equation(4) and standard arguments. t )+Π RB. t )+K 1(Q RB

T L. t=1. Proof of Lemma 1. Using the marginal cost accounting in Equation(4) and standard arguments. t )+Π RB. t )+K 1(Q RB Elecronic Companion EC.1. Proofs of Technical Lemmas and Theorems LEMMA 1. Le C(RB) be he oal cos incurred by he RB policy. Then we have, T L E[C(RB)] 3 E[Z RB ]. (EC.1) Proof of Lemma 1. Using he marginal

More information

Particle Swarm Optimization Combining Diversification and Intensification for Nonlinear Integer Programming Problems

Particle Swarm Optimization Combining Diversification and Intensification for Nonlinear Integer Programming Problems Paricle Swarm Opimizaion Combining Diversificaion and Inensificaion for Nonlinear Ineger Programming Problems Takeshi Masui, Masaoshi Sakawa, Kosuke Kao and Koichi Masumoo Hiroshima Universiy 1-4-1, Kagamiyama,

More information

Planning in Information Space for a Quadrotor Helicopter in a GPS-denied Environment

Planning in Information Space for a Quadrotor Helicopter in a GPS-denied Environment Planning in Informaion Space for a Quadroor Helicoper in a GPS-denied Environmen Ruijie He, Sam Prenice and Nicholas Roy Absrac This paper describes a moion planning algorihm for a quadroor helicoper flying

More information

Air Traffic Forecast Empirical Research Based on the MCMC Method

Air Traffic Forecast Empirical Research Based on the MCMC Method Compuer and Informaion Science; Vol. 5, No. 5; 0 ISSN 93-8989 E-ISSN 93-8997 Published by Canadian Cener of Science and Educaion Air Traffic Forecas Empirical Research Based on he MCMC Mehod Jian-bo Wang,

More information

Augmented Reality II - Kalman Filters - Gudrun Klinker May 25, 2004

Augmented Reality II - Kalman Filters - Gudrun Klinker May 25, 2004 Augmened Realiy II Kalman Filers Gudrun Klinker May 25, 2004 Ouline Moivaion Discree Kalman Filer Modeled Process Compuing Model Parameers Algorihm Exended Kalman Filer Kalman Filer for Sensor Fusion Lieraure

More information

Georey E. Hinton. University oftoronto. Technical Report CRG-TR February 22, Abstract

Georey E. Hinton. University oftoronto.   Technical Report CRG-TR February 22, Abstract Parameer Esimaion for Linear Dynamical Sysems Zoubin Ghahramani Georey E. Hinon Deparmen of Compuer Science Universiy oftorono 6 King's College Road Torono, Canada M5S A4 Email: zoubin@cs.orono.edu Technical

More information

Speaker Adaptation Techniques For Continuous Speech Using Medium and Small Adaptation Data Sets. Constantinos Boulis

Speaker Adaptation Techniques For Continuous Speech Using Medium and Small Adaptation Data Sets. Constantinos Boulis Speaker Adapaion Techniques For Coninuous Speech Using Medium and Small Adapaion Daa Ses Consaninos Boulis Ouline of he Presenaion Inroducion o he speaker adapaion problem Maximum Likelihood Sochasic Transformaions

More information

Simulation-Solving Dynamic Models ABE 5646 Week 2, Spring 2010

Simulation-Solving Dynamic Models ABE 5646 Week 2, Spring 2010 Simulaion-Solving Dynamic Models ABE 5646 Week 2, Spring 2010 Week Descripion Reading Maerial 2 Compuer Simulaion of Dynamic Models Finie Difference, coninuous saes, discree ime Simple Mehods Euler Trapezoid

More information

RL Lecture 7: Eligibility Traces. R. S. Sutton and A. G. Barto: Reinforcement Learning: An Introduction 1

RL Lecture 7: Eligibility Traces. R. S. Sutton and A. G. Barto: Reinforcement Learning: An Introduction 1 RL Lecure 7: Eligibiliy Traces R. S. Suon and A. G. Baro: Reinforcemen Learning: An Inroducion 1 N-sep TD Predicion Idea: Look farher ino he fuure when you do TD backup (1, 2, 3,, n seps) R. S. Suon and

More information

BU Macro BU Macro Fall 2008, Lecture 4

BU Macro BU Macro Fall 2008, Lecture 4 Dynamic Programming BU Macro 2008 Lecure 4 1 Ouline 1. Cerainy opimizaion problem used o illusrae: a. Resricions on exogenous variables b. Value funcion c. Policy funcion d. The Bellman equaion and an

More information

Tracking. Many slides adapted from Kristen Grauman, Deva Ramanan

Tracking. Many slides adapted from Kristen Grauman, Deva Ramanan Tracking Man slides adaped from Krisen Grauman Deva Ramanan Coures G. Hager Coures G. Hager J. Kosecka cs3b Adapive Human-Moion Tracking Acquisiion Decimaion b facor 5 Moion deecor Grascale convers. Image

More information

Look-ahead Proposals for Robust Grid-based SLAM

Look-ahead Proposals for Robust Grid-based SLAM Look-ahead Proposals for Robus Grid-based SLAM Slawomir Grzonka, Chrisian Plagemann, Giorgio Grisei, Wolfram Burgard To cie his version: Slawomir Grzonka, Chrisian Plagemann, Giorgio Grisei, Wolfram Burgard.

More information

SZG Macro 2011 Lecture 3: Dynamic Programming. SZG macro 2011 lecture 3 1

SZG Macro 2011 Lecture 3: Dynamic Programming. SZG macro 2011 lecture 3 1 SZG Macro 2011 Lecure 3: Dynamic Programming SZG macro 2011 lecure 3 1 Background Our previous discussion of opimal consumpion over ime and of opimal capial accumulaion sugges sudying he general decision

More information

Application of a Stochastic-Fuzzy Approach to Modeling Optimal Discrete Time Dynamical Systems by Using Large Scale Data Processing

Application of a Stochastic-Fuzzy Approach to Modeling Optimal Discrete Time Dynamical Systems by Using Large Scale Data Processing Applicaion of a Sochasic-Fuzzy Approach o Modeling Opimal Discree Time Dynamical Sysems by Using Large Scale Daa Processing AA WALASZE-BABISZEWSA Deparmen of Compuer Engineering Opole Universiy of Technology

More information

Optimal Path Planning for Flexible Redundant Robot Manipulators

Optimal Path Planning for Flexible Redundant Robot Manipulators 25 WSEAS In. Conf. on DYNAMICAL SYSEMS and CONROL, Venice, Ialy, November 2-4, 25 (pp363-368) Opimal Pah Planning for Flexible Redundan Robo Manipulaors H. HOMAEI, M. KESHMIRI Deparmen of Mechanical Engineering

More information

MATH 128A, SUMMER 2009, FINAL EXAM SOLUTION

MATH 128A, SUMMER 2009, FINAL EXAM SOLUTION MATH 28A, SUMME 2009, FINAL EXAM SOLUTION BENJAMIN JOHNSON () (8 poins) [Lagrange Inerpolaion] (a) (4 poins) Le f be a funcion defined a some real numbers x 0,..., x n. Give a defining equaion for he Lagrange

More information

A variational radial basis function approximation for diffusion processes.

A variational radial basis function approximation for diffusion processes. A variaional radial basis funcion approximaion for diffusion processes. Michail D. Vreas, Dan Cornford and Yuan Shen {vreasm, d.cornford, y.shen}@ason.ac.uk Ason Universiy, Birmingham, UK hp://www.ncrg.ason.ac.uk

More information

Rapid Termination Evaluation for Recursive Subdivision of Bezier Curves

Rapid Termination Evaluation for Recursive Subdivision of Bezier Curves Rapid Terminaion Evaluaion for Recursive Subdivision of Bezier Curves Thomas F. Hain School of Compuer and Informaion Sciences, Universiy of Souh Alabama, Mobile, AL, U.S.A. Absrac Bézier curve flaening

More information

Deep Learning: Theory, Techniques & Applications - Recurrent Neural Networks -

Deep Learning: Theory, Techniques & Applications - Recurrent Neural Networks - Deep Learning: Theory, Techniques & Applicaions - Recurren Neural Neworks - Prof. Maeo Maeucci maeo.maeucci@polimi.i Deparmen of Elecronics, Informaion and Bioengineering Arificial Inelligence and Roboics

More information

In this chapter the model of free motion under gravity is extended to objects projected at an angle. When you have completed it, you should

In this chapter the model of free motion under gravity is extended to objects projected at an angle. When you have completed it, you should Cambridge Universiy Press 978--36-60033-7 Cambridge Inernaional AS and A Level Mahemaics: Mechanics Coursebook Excerp More Informaion Chaper The moion of projeciles In his chaper he model of free moion

More information

Single-Pass-Based Heuristic Algorithms for Group Flexible Flow-shop Scheduling Problems

Single-Pass-Based Heuristic Algorithms for Group Flexible Flow-shop Scheduling Problems Single-Pass-Based Heurisic Algorihms for Group Flexible Flow-shop Scheduling Problems PEI-YING HUANG, TZUNG-PEI HONG 2 and CHENG-YAN KAO, 3 Deparmen of Compuer Science and Informaion Engineering Naional

More information

Lecture 2-1 Kinematics in One Dimension Displacement, Velocity and Acceleration Everything in the world is moving. Nothing stays still.

Lecture 2-1 Kinematics in One Dimension Displacement, Velocity and Acceleration Everything in the world is moving. Nothing stays still. Lecure - Kinemaics in One Dimension Displacemen, Velociy and Acceleraion Everyhing in he world is moving. Nohing says sill. Moion occurs a all scales of he universe, saring from he moion of elecrons in

More information

Decimal moved after first digit = 4.6 x Decimal moves five places left SCIENTIFIC > POSITIONAL. a) g) 5.31 x b) 0.

Decimal moved after first digit = 4.6 x Decimal moves five places left SCIENTIFIC > POSITIONAL. a) g) 5.31 x b) 0. PHYSICS 20 UNIT 1 SCIENCE MATH WORKSHEET NAME: A. Sandard Noaion Very large and very small numbers are easily wrien using scienific (or sandard) noaion, raher han decimal (or posiional) noaion. Sandard

More information

Západočeská Univerzita v Plzni, Czech Republic and Groupe ESIEE Paris, France

Západočeská Univerzita v Plzni, Czech Republic and Groupe ESIEE Paris, France ADAPTIVE SIGNAL PROCESSING USING MAXIMUM ENTROPY ON THE MEAN METHOD AND MONTE CARLO ANALYSIS Pavla Holejšovsá, Ing. *), Z. Peroua, Ing. **), J.-F. Bercher, Prof. Assis. ***) Západočesá Univerzia v Plzni,

More information

Random Walk with Anti-Correlated Steps

Random Walk with Anti-Correlated Steps Random Walk wih Ani-Correlaed Seps John Noga Dirk Wagner 2 Absrac We conjecure he expeced value of random walks wih ani-correlaed seps o be exacly. We suppor his conjecure wih 2 plausibiliy argumens and

More information

Tracking. Many slides adapted from Kristen Grauman, Deva Ramanan

Tracking. Many slides adapted from Kristen Grauman, Deva Ramanan Tracking Man slides adaped from Krisen Grauman Deva Ramanan Coures G. Hager Coures G. Hager J. Kosecka cs3b Adapive Human-Moion Tracking Acquisiion Decimaion b facor 5 Moion deecor Grascale convers. Image

More information

m = 41 members n = 27 (nonfounders), f = 14 (founders) 8 markers from chromosome 19

m = 41 members n = 27 (nonfounders), f = 14 (founders) 8 markers from chromosome 19 Sequenial Imporance Sampling (SIS) AKA Paricle Filering, Sequenial Impuaion (Kong, Liu, Wong, 994) For many problems, sampling direcly from he arge disribuion is difficul or impossible. One reason possible

More information

Simultaneous Localization and Mapping with Unknown Data Association Using FastSLAM

Simultaneous Localization and Mapping with Unknown Data Association Using FastSLAM Simulaneous Localizaion and Mapping wih Unknown Daa Associaion Using FasSLAM Michael Monemerlo, Sebasian Thrun Absrac The Exended Kalman Filer (EKF has been he de faco approach o he Simulaneous Localizaion

More information

Motion Planning under Uncertainty using Iterative Local Optimization in Belief Space

Motion Planning under Uncertainty using Iterative Local Optimization in Belief Space Moion Planning under Uncerainy using Ieraive Local Opimizaion in Belief Space Jur van den Berg 1 Sachin Pail 2 Ron Aleroviz 2 1 School of Compuing, Universiy of Uah, berg@cs.uah.edu. 2 Dep. of Compuer

More information

Mechanical Fatigue and Load-Induced Aging of Loudspeaker Suspension. Wolfgang Klippel,

Mechanical Fatigue and Load-Induced Aging of Loudspeaker Suspension. Wolfgang Klippel, Mechanical Faigue and Load-Induced Aging of Loudspeaker Suspension Wolfgang Klippel, Insiue of Acousics and Speech Communicaion Dresden Universiy of Technology presened a he ALMA Symposium 2012, Las Vegas

More information

FastSLAM with Stereo Vision

FastSLAM with Stereo Vision FasSLAM wih Sereo Vision Wikus Brink Elecronic Sysems Lab Elecrical and Elecronic Engineering Sellenbosch Universiy Email: wikusbrink@ieee.org Corné E. van Daalen Elecronic Sysems Lab Elecrical and Elecronic

More information

Dimitri Solomatine. D.P. Solomatine. Data-driven modelling (part 2). 2

Dimitri Solomatine. D.P. Solomatine. Data-driven modelling (part 2). 2 Daa-driven modelling. Par. Daa-driven Arificial di Neural modelling. Newors Par Dimiri Solomaine Arificial neural newors D.P. Solomaine. Daa-driven modelling par. 1 Arificial neural newors ANN: main pes

More information

Robotics I. April 11, The kinematics of a 3R spatial robot is specified by the Denavit-Hartenberg parameters in Tab. 1.

Robotics I. April 11, The kinematics of a 3R spatial robot is specified by the Denavit-Hartenberg parameters in Tab. 1. Roboics I April 11, 017 Exercise 1 he kinemaics of a 3R spaial robo is specified by he Denavi-Harenberg parameers in ab 1 i α i d i a i θ i 1 π/ L 1 0 1 0 0 L 3 0 0 L 3 3 able 1: able of DH parameers of

More information

Hidden Markov Models

Hidden Markov Models Hidden Markov Models Probabilisic reasoning over ime So far, we ve mosly deal wih episodic environmens Excepions: games wih muliple moves, planning In paricular, he Bayesian neworks we ve seen so far describe

More information

Technical Report Doc ID: TR March-2013 (Last revision: 23-February-2016) On formulating quadratic functions in optimization models.

Technical Report Doc ID: TR March-2013 (Last revision: 23-February-2016) On formulating quadratic functions in optimization models. Technical Repor Doc ID: TR--203 06-March-203 (Las revision: 23-Februar-206) On formulaing quadraic funcions in opimizaion models. Auhor: Erling D. Andersen Convex quadraic consrains quie frequenl appear

More information

Lecture 20: Riccati Equations and Least Squares Feedback Control

Lecture 20: Riccati Equations and Least Squares Feedback Control 34-5 LINEAR SYSTEMS Lecure : Riccai Equaions and Leas Squares Feedback Conrol 5.6.4 Sae Feedback via Riccai Equaions A recursive approach in generaing he marix-valued funcion W ( ) equaion for i for he

More information

Efficient POMDP Forward Search by Predicting the Posterior Belief Distribution Ruijie He and Nicholas Roy

Efficient POMDP Forward Search by Predicting the Posterior Belief Distribution Ruijie He and Nicholas Roy Compuer Science and Arificial Inelligence Laboraory Technical Repor MIT-CSAIL-TR-2009-044 Sepember 23, 2009 Efficien POMDP Forward Search by Predicing he Poserior Belief Disribuion Ruijie He and Nicholas

More information

Lecture 2 October ε-approximation of 2-player zero-sum games

Lecture 2 October ε-approximation of 2-player zero-sum games Opimizaion II Winer 009/10 Lecurer: Khaled Elbassioni Lecure Ocober 19 1 ε-approximaion of -player zero-sum games In his lecure we give a randomized ficiious play algorihm for obaining an approximae soluion

More information

Simulating models with heterogeneous agents

Simulating models with heterogeneous agents Simulaing models wih heerogeneous agens Wouer J. Den Haan London School of Economics c by Wouer J. Den Haan Individual agen Subjec o employmen shocks (ε i, {0, 1}) Incomplee markes only way o save is hrough

More information

On-line Adaptive Optimal Timing Control of Switched Systems

On-line Adaptive Optimal Timing Control of Switched Systems On-line Adapive Opimal Timing Conrol of Swiched Sysems X.C. Ding, Y. Wardi and M. Egersed Absrac In his paper we consider he problem of opimizing over he swiching imes for a muli-modal dynamic sysem when

More information

Presentation Overview

Presentation Overview Acion Refinemen in Reinforcemen Learning by Probabiliy Smoohing By Thomas G. Dieerich & Didac Busques Speaer: Kai Xu Presenaion Overview Bacground The Probabiliy Smoohing Mehod Experimenal Sudy of Acion

More information

) were both constant and we brought them from under the integral.

) were both constant and we brought them from under the integral. YIELD-PER-RECRUIT (coninued The yield-per-recrui model applies o a cohor, bu we saw in he Age Disribuions lecure ha he properies of a cohor do no apply in general o a collecion of cohors, which is wha

More information

Bias in Conditional and Unconditional Fixed Effects Logit Estimation: a Correction * Tom Coupé

Bias in Conditional and Unconditional Fixed Effects Logit Estimation: a Correction * Tom Coupé Bias in Condiional and Uncondiional Fixed Effecs Logi Esimaion: a Correcion * Tom Coupé Economics Educaion and Research Consorium, Naional Universiy of Kyiv Mohyla Academy Address: Vul Voloska 10, 04070

More information

Data Fusion using Kalman Filter. Ioannis Rekleitis

Data Fusion using Kalman Filter. Ioannis Rekleitis Daa Fusion using Kalman Filer Ioannis Rekleiis Eample of a arameerized Baesian Filer: Kalman Filer Kalman filers (KF represen poserior belief b a Gaussian (normal disribuion A -d Gaussian disribuion is

More information

PENALIZED LEAST SQUARES AND PENALIZED LIKELIHOOD

PENALIZED LEAST SQUARES AND PENALIZED LIKELIHOOD PENALIZED LEAST SQUARES AND PENALIZED LIKELIHOOD HAN XIAO 1. Penalized Leas Squares Lasso solves he following opimizaion problem, ˆβ lasso = arg max β R p+1 1 N y i β 0 N x ij β j β j (1.1) for some 0.

More information

Supplement for Stochastic Convex Optimization: Faster Local Growth Implies Faster Global Convergence

Supplement for Stochastic Convex Optimization: Faster Local Growth Implies Faster Global Convergence Supplemen for Sochasic Convex Opimizaion: Faser Local Growh Implies Faser Global Convergence Yi Xu Qihang Lin ianbao Yang Proof of heorem heorem Suppose Assumpion holds and F (w) obeys he LGC (6) Given

More information

Diebold, Chapter 7. Francis X. Diebold, Elements of Forecasting, 4th Edition (Mason, Ohio: Cengage Learning, 2006). Chapter 7. Characterizing Cycles

Diebold, Chapter 7. Francis X. Diebold, Elements of Forecasting, 4th Edition (Mason, Ohio: Cengage Learning, 2006). Chapter 7. Characterizing Cycles Diebold, Chaper 7 Francis X. Diebold, Elemens of Forecasing, 4h Ediion (Mason, Ohio: Cengage Learning, 006). Chaper 7. Characerizing Cycles Afer compleing his reading you should be able o: Define covariance

More information

!!"#"$%&#'()!"#&'(*%)+,&',-)./0)1-*23)

!!#$%&#'()!#&'(*%)+,&',-)./0)1-*23) "#"$%&#'()"#&'(*%)+,&',-)./)1-*) #$%&'()*+,&',-.%,/)*+,-&1*#$)()5*6$+$%*,7&*-'-&1*(,-&*6&,7.$%$+*&%'(*8$&',-,%'-&1*(,-&*6&,79*(&,%: ;..,*&1$&$.$%&'()*1$$.,'&',-9*(&,%)?%*,('&5

More information

Approximation Algorithms for Unique Games via Orthogonal Separators

Approximation Algorithms for Unique Games via Orthogonal Separators Approximaion Algorihms for Unique Games via Orhogonal Separaors Lecure noes by Konsanin Makarychev. Lecure noes are based on he papers [CMM06a, CMM06b, LM4]. Unique Games In hese lecure noes, we define

More information

Final Spring 2007

Final Spring 2007 .615 Final Spring 7 Overview The purpose of he final exam is o calculae he MHD β limi in a high-bea oroidal okamak agains he dangerous n = 1 exernal ballooning-kink mode. Effecively, his corresponds o

More information

References are appeared in the last slide. Last update: (1393/08/19)

References are appeared in the last slide. Last update: (1393/08/19) SYSEM IDEIFICAIO Ali Karimpour Associae Professor Ferdowsi Universi of Mashhad References are appeared in he las slide. Las updae: 0..204 393/08/9 Lecure 5 lecure 5 Parameer Esimaion Mehods opics o be

More information

Fundamental Problems In Robotics

Fundamental Problems In Robotics Fundamenal Problems In Roboics Wha does he world looks like? (mapping sense from various posiions inegrae measuremens o produce map assumes perfec knowledge of posiion Where am I in he world? (localizaion

More information

Energy Storage Benchmark Problems

Energy Storage Benchmark Problems Energy Sorage Benchmark Problems Daniel F. Salas 1,3, Warren B. Powell 2,3 1 Deparmen of Chemical & Biological Engineering 2 Deparmen of Operaions Research & Financial Engineering 3 Princeon Laboraory

More information

Some Basic Information about M-S-D Systems

Some Basic Information about M-S-D Systems Some Basic Informaion abou M-S-D Sysems 1 Inroducion We wan o give some summary of he facs concerning unforced (homogeneous) and forced (non-homogeneous) models for linear oscillaors governed by second-order,

More information

Ensemble Confidence Estimates Posterior Probability

Ensemble Confidence Estimates Posterior Probability Ensemble Esimaes Poserior Probabiliy Michael Muhlbaier, Aposolos Topalis, and Robi Polikar Rowan Universiy, Elecrical and Compuer Engineering, Mullica Hill Rd., Glassboro, NJ 88, USA {muhlba6, opali5}@sudens.rowan.edu

More information

Object tracking: Using HMMs to estimate the geographical location of fish

Object tracking: Using HMMs to estimate the geographical location of fish Objec racking: Using HMMs o esimae he geographical locaion of fish 02433 - Hidden Markov Models Marin Wæver Pedersen, Henrik Madsen Course week 13 MWP, compiled June 8, 2011 Objecive: Locae fish from agging

More information

Non-parametric techniques. Instance Based Learning. NN Decision Boundaries. Nearest Neighbor Algorithm. Distance metric important

Non-parametric techniques. Instance Based Learning. NN Decision Boundaries. Nearest Neighbor Algorithm. Distance metric important on-parameric echniques Insance Based Learning AKA: neares neighbor mehods, non-parameric, lazy, memorybased, or case-based learning Copyrigh 2005 by David Helmbold 1 Do no fi a model (as do LDA, logisic

More information

Timed Circuits. Asynchronous Circuit Design. Timing Relationships. A Simple Example. Timed States. Timing Sequences. ({r 6 },t6 = 1.

Timed Circuits. Asynchronous Circuit Design. Timing Relationships. A Simple Example. Timed States. Timing Sequences. ({r 6 },t6 = 1. Timed Circuis Asynchronous Circui Design Chris J. Myers Lecure 7: Timed Circuis Chaper 7 Previous mehods only use limied knowledge of delays. Very robus sysems, bu exremely conservaive. Large funcional

More information

ACE 562 Fall Lecture 8: The Simple Linear Regression Model: R 2, Reporting the Results and Prediction. by Professor Scott H.

ACE 562 Fall Lecture 8: The Simple Linear Regression Model: R 2, Reporting the Results and Prediction. by Professor Scott H. ACE 56 Fall 5 Lecure 8: The Simple Linear Regression Model: R, Reporing he Resuls and Predicion by Professor Sco H. Irwin Required Readings: Griffihs, Hill and Judge. "Explaining Variaion in he Dependen

More information

A First Course on Kinetics and Reaction Engineering. Class 19 on Unit 18

A First Course on Kinetics and Reaction Engineering. Class 19 on Unit 18 A Firs ourse on Kineics and Reacion Engineering lass 19 on Uni 18 Par I - hemical Reacions Par II - hemical Reacion Kineics Where We re Going Par III - hemical Reacion Engineering A. Ideal Reacors B. Perfecly

More information

Macroeconomic Theory Ph.D. Qualifying Examination Fall 2005 ANSWER EACH PART IN A SEPARATE BLUE BOOK. PART ONE: ANSWER IN BOOK 1 WEIGHT 1/3

Macroeconomic Theory Ph.D. Qualifying Examination Fall 2005 ANSWER EACH PART IN A SEPARATE BLUE BOOK. PART ONE: ANSWER IN BOOK 1 WEIGHT 1/3 Macroeconomic Theory Ph.D. Qualifying Examinaion Fall 2005 Comprehensive Examinaion UCLA Dep. of Economics You have 4 hours o complee he exam. There are hree pars o he exam. Answer all pars. Each par has

More information

Testing for a Single Factor Model in the Multivariate State Space Framework

Testing for a Single Factor Model in the Multivariate State Space Framework esing for a Single Facor Model in he Mulivariae Sae Space Framework Chen C.-Y. M. Chiba and M. Kobayashi Inernaional Graduae School of Social Sciences Yokohama Naional Universiy Japan Faculy of Economics

More information

CSE-473. A Gentle Introduction to Particle Filters

CSE-473. A Gentle Introduction to Particle Filters CSE-473 A Genle Inroducion o Paricle Filers Bayes Filers for Robo Localizaion Dieer Fo 2 Bayes Filers: Framework Given: Sream of observaions z and acion daa u: d Sensor model Pz. = { u, z2, u 1, z 1 Dynamics

More information