Anytime Planning for Decentralized Multi-Robot Active Information Gathering

Size: px
Start display at page:

Download "Anytime Planning for Decentralized Multi-Robot Active Information Gathering"

Transcription

1 Anytime Planning for Decentralized Multi-Robot Active Information Gathering Brent Schlotfeldt 1 Dinesh Thakur 1 Nikolay Atanasov 2 Vijay Kumar 1 George Pappas 1 1 GRASP Laboratory University of Pennsylvania Philadelphia, PA, USA 2 Department of Electrical and Computer Engineering University of California San Diego San Diego, CA, USA May 17, 2018 Brent Schlotfeldt 1, Dinesh Thakur 1, Nikolay Atanasov 2, Anytime Vijay Kumar Planning 1,, George Pappas 1 (Penn) May 17, / 25

2 What is the problem? Suppose the robot at x 0 wants to plan a path to x T to track a target y T Brent Schlotfeldt 1, Dinesh Thakur 1, Nikolay Atanasov 2, Anytime Vijay Kumar Planning 1,, George Pappas 1 (Penn) May 17, / 25

3 Overview 1 Active Information Gathering Problem Statement Reduction to Open-Loop Control, and Optimal Solution (ɛ,δ)-reduced Value Iteration Anytime Reduced Value Iteration 2 Multiple Robots Coordinate Descent Distributed Estimation Application: Target Tracking Brent Schlotfeldt 1, Dinesh Thakur 1, Nikolay Atanasov 2, Anytime Vijay Kumar Planning 1,, George Pappas 1 (Penn) May 17, / 25

4 Problem Statement Consider a team of n mobile robots, with sensor motion models: x i,t+1 = f i (x i,t, u i,t ), i {1,..., n} (1) where x i,t X, u i,t U, and U is a finite set. Brent Schlotfeldt 1, Dinesh Thakur 1, Nikolay Atanasov 2, Anytime Vijay Kumar Planning 1,, George Pappas 1 (Penn) May 17, / 25

5 Problem Statement Consider a team of n mobile robots, with sensor motion models: x i,t+1 = f i (x i,t, u i,t ), i {1,..., n} (1) where x i,t X, u i,t U, and U is a finite set. The robots goal is to track a target y t with target motion model: y t+1 = Ay t + w t, w t N (0, W ) t = 0,..., T (2) Brent Schlotfeldt 1, Dinesh Thakur 1, Nikolay Atanasov 2, Anytime Vijay Kumar Planning 1,, George Pappas 1 (Penn) May 17, / 25

6 Problem Statement Consider a team of n mobile robots, with sensor motion models: x i,t+1 = f i (x i,t, u i,t ), i {1,..., n} (1) where x i,t X, u i,t U, and U is a finite set. The robots goal is to track a target y t with target motion model: y t+1 = Ay t + w t, w t N (0, W ) t = 0,..., T (2) The robots are able to sense the target using on-board sensors obeying the following sensor observation model: z i,t = H(x i,t )y t + v t (x i,t ), v t N (0, V (x i,t )) (3) Brent Schlotfeldt 1, Dinesh Thakur 1, Nikolay Atanasov 2, Anytime Vijay Kumar Planning 1,, George Pappas 1 (Penn) May 17, / 25

7 Problem Statement Consider a team of n mobile robots, with sensor motion models: x i,t+1 = f i (x i,t, u i,t ), i {1,..., n} (1) where x i,t X, u i,t U, and U is a finite set. The robots goal is to track a target y t with target motion model: y t+1 = Ay t + w t, w t N (0, W ) t = 0,..., T (2) The robots are able to sense the target using on-board sensors obeying the following sensor observation model: z i,t = H(x i,t )y t + v t (x i,t ), v t N (0, V (x i,t )) (3) Note the Linear, Gaussian assumptions on the target and observations, but not on the robot. The set U of admissible controls is finite. To simplify notation, we refer to the state of the team as x t at time t. Brent Schlotfeldt 1, Dinesh Thakur 1, Nikolay Atanasov 2, Anytime Vijay Kumar Planning 1,, George Pappas 1 (Penn) May 17, / 25

8 Active Information Gathering Problem (Active Information Acquisition) Given an initial sensor pose x 0 X, a prior distribution over the target state y 0, and a finite planning horizon T, the task is to choose a sequence of control actions: u 0,..., u T 1 U T which minimize conditional entropy h between the target state: y 1:T and the measurement set z 1:T : min u U T J (n) T T (u) := h(y t z 1:t, x 1:t ) (4) t=1 s.t. x t+1 = f (x t, u t ), t = 0,..., T 1 y t+1 = Ay t + w t t = 0,..., T 1 z t = H(x t )y t + v t t = 0,..., T Note that many other information measures are possible, such as mutual information, minimum mean-squared error, and others. 1 1 Nikolay Atanasov et al. Information acquisition with sensing robots: Algorithms and error bounds. In: 2014 IEEE International Conference on Robotics and Automation (ICRA) Brent Schlotfeldt 1, Dinesh Thakur 1, Nikolay Atanasov 2, Anytime Vijay Kumar Planning 1,, George Pappas 1 (Penn) May 17, / 25 (5)

9 Reduction to Open-Loop Control Prior y 0 is Gaussian N (y 0, Σ 0 ), = Kalman filter is optimal. Objective proportional to log det, i.e. h(y t z 1:t, x 1:t ) log det(σ t ) Σ t independent of realizations z 1:t = open loop control optimal! 2 Problem (Active Information Acquisition) min µ U T J (n) T T (µ) := log det(σ t ) t=1 s.t. x t+1 = f (x t, u t ), t = 0,..., T 1 Σ t+1 = ρ p t+1 (ρe t (Σ t ), x t+1 ), t = 0,..., T 1 Update: ρ e t (Σ, x) := Σ K t (Σ, x)h t (x)σ K t (Σ, x) := ΣH t (x) T (H t (x)σh t (x) T + V t (x)) 1 Predict: ρ p t (Σ) := A t ΣA T t + W t 2 Jerome Le Ny and George J Pappas. On trajectory optimization for active sensing in Gaussian process models. In: 2009 IEEE Conference on Decision and Control (CDC) Brent Schlotfeldt 1, Dinesh Thakur 1, Nikolay Atanasov 2, Anytime Vijay Kumar Planning 1,, George Pappas 1 (Penn) May 17, / 25

10 Optimal Solution Deterministic optimal control problem can be solved with Forward Value Iteration, which builds a search tree from (x 0, Σ 0 ) x, 0 0 u 1 u 2 u 3 Depth due to length of planning horizon Full search has complexity O( U T ). Can we do better? Branching due to number of motion primitives Brent Schlotfeldt 1, Dinesh Thakur 1, Nikolay Atanasov 2, Anytime Vijay Kumar Planning 1,, George Pappas 1 (Penn) May 17, / 25

11 What if trajectories cross? Want: Conditions on Σ t to eliminate nodes if trajectories cross at level t of the tree: x x x1 2 3 x 1 3 a (1 a) x 2 2 x 3 2 x 2 4 x 2 5 x 2 There is a convex combination of Σ 1, and Σ 3 that dominates Σ 2, i.e. Σ 2 aσ 1 + (1 a)σ 3. It can be safely removed from the tree. Brent Schlotfeldt 1, Dinesh Thakur 1, Nikolay Atanasov 2, Anytime Vijay Kumar Planning 1,, George Pappas 1 (Penn) May 17, / 25

12 ɛ - Algebraic Redundancy Want: Conditions on Σ t to eliminate nodes if trajectories cross: 3 Definition (ɛ-algebraic Redundancy ) Let ɛ 0 and let {Σ i } K i=1 S + n be a finite set. A matrix Σ S+ n is ɛ-algebraically redundant with respect to {Σ i } if there exist nonnegative constants {α i } K i=1 such that: K K α i = 1 and Σ + ɛi n α i Σ i i=1 i=1 a (1 a) 1 3 Example: Suppose aσ 1 + (1 a)σ 3 is a convex combination of Σ 1, and Σ 2. Σ 1 and Σ 3 cannot strictly eliminate Σ 2, because it is not contained in aσ 1 + (1 a)σ 3. However if we relax the inequality by ɛi, we can remove Σ 2. 3 Michael P Vitus et al. On efficient sensor scheduling for linear dynamical systems. In: Automatica (2012), pp Brent Schlotfeldt 1, Dinesh Thakur 1, Nikolay Atanasov 2, Anytime Vijay Kumar Planning 1,, George Pappas 1 (Penn) May 17, / 25

13 (ɛ, δ)- Reductions Can we remove even more trajectories? What if the trajectories do not exactly cross at time t? Definition (Trajectory δ-crossing) Trajectories π 1, π 2 X T δ-cross at time t [1, T ] if d X (π 1 t, π 2 t ) δ for δ 0. Brent Schlotfeldt 1, Dinesh Thakur 1, Nikolay Atanasov 2, Anytime Vijay Kumar Planning 1,, George Pappas 1 (Penn) May 17, / 25

14 (ɛ, δ) Reduced Value Iteration Idea: Prune trajectories close in space and similar in informativeness Expanding Nodes in Search Tree = (ɛ,δ)-pruning = Algorithm 1 (ɛ, δ) Reduced Value Iteration 1: J 0 0, S 0 {(x 0, Σ 0, J 0)}. S t for t = 1,...T 2: for t = 1 : T do 3: for all (x, Σ, J) S t 1 do 4: for all u U do 5: x t f (x, u), Σ t ρ e t+1(ρ p t (Σ, x, u), x t)) 6: J t J + log det(σ t) 7: S t S t {(x t, Σ t, J t)} 8: end for 9: end for 10: Sort S t in ascending order according to log det( ) 11: S t S t[1] 12: for all (x, Σ, J) S t \ S t[1] do 13: % Find all nodes in S t, which δ-cross x: 14: Q {(Σ ) (x, Σ, J )} S t, d X (x, x ) δ 15: if isempty(q) or not(σ is ɛ-alg-red wrt Q) then 16: S t S t (x, Σ, J) 17: end if 18: end for 19: S t S t 20: end for 21: return min {J (x, Σ) S T } Brent Schlotfeldt 1, Dinesh Thakur 1, Nikolay Atanasov 2, Anytime Vijay Kumar Planning 1,, George Pappas 1 (Penn) May 17, / 25

15 Performance Guarantee Search tree with (ɛ, δ)-pruning is (ɛ, δ)-suboptimal with cost J ɛ,δ T that the bound holds 4 such 0 J ɛ,δ T J T (ζ T 1) [ JT log det(w )] + ɛ T (6) t 1 τ where ζ T := (1 + L s f L mδ) 1 (7) and T, L s f and L m are constants resulting from continuity assumptions. τ=1 Note the tradeoff in performance: (ɛ,δ) (0, 0) = J ɛ,δ T J T. If (ɛ,δ) (, ), we get a greedy policy with no guarantee. This bound is monotone in ɛ, δ. s=1 4 Nikolay Atanasov et al. Information acquisition with sensing robots: Algorithms and error bounds. In: 2014 IEEE International Conference on Robotics and Automation (ICRA) Brent Schlotfeldt 1, Dinesh Thakur 1, Nikolay Atanasov 2, Anytime Vijay Kumar Planning 1,, George Pappas 1 (Penn) May 17, / 25

16 Motivation for Anytime Planning How do ɛ,δ affect execution time? What if problem dimension changes? (e.g. more targets) What if team changes? (e.g. more robots) All of these can be addressed by modifying the search algorithm to iteratively construct the tree, adding progressively more trajectories until execution time runs out, always returning a feasible solution. This is called an anytime algorithm. Brent Schlotfeldt 1, Dinesh Thakur 1, Nikolay Atanasov 2, Anytime Vijay Kumar Planning 1,, George Pappas 1 (Penn) May 17, / 25

17 Anytime Reduced Value Iteration Idea: First build a greedy search tree, then iteratively ImprovePath until time expires To efficiently re-use computations, we mark all computed nodes S t as open and closed, i.e. O t, C t at all levels t in the tree 5 Algorithm 2 Anytime Reduced Value Iteration (x 0, Σ 0, T ARVI ) 1: J 0 0, S 0 {(x 0, Σ 0, J 0)}. S t for t = 1,...T 2: C 0 3: O t S t for t = 0,...T 4: S {S 0,..S T }, O {O 0,...O T }, C {C 0,..C T } 5: ɛ, δ 6: {S, O, C} ImprovePath(S, O, C, ɛ, δ) 7: Publish best solution from O[T ] 8: while Time Elapsed T ARVI do 9: Decrease (ɛ, δ) 10: {S, O, C} ImprovePath(S, O, C, ɛ, δ) 11: Publish best solution J from O[T ] 12: end while 5 Brent Schlotfeldt et al. Anytime Planning for Decentralized Multirobot Active Information Gathering. In: IEEE Robotics and Automation Letters 3.2 (2018), pp Brent Schlotfeldt 1, Dinesh Thakur 1, Nikolay Atanasov 2, Anytime Vijay Kumar Planning 1,, George Pappas 1 (Penn) May 17, / 25

18 Tree Interpretation Brent Schlotfeldt 1, Dinesh Thakur 1, Nikolay Atanasov 2, Anytime Vijay Kumar Planning 1,, George Pappas 1 (Penn) May 17, / 25

19 Improve Path Expanding Open, but not Closed nodes = (ɛ,δ)-pruning by marking nodes Open = Algorithm 3 ImprovePath (S, O, C, ɛ, δ) 1: for t = 1 : T do 2: for all (x, Σ, J) O t 1 \ C t 1 do 3: C t 1 C t 1 {x, Σ} 4: for all u U do 5: x t f (x, u), Σ t ρ xt (Σ) 6: J t J + log det(σ t ) 7: S t S t {(x t, Σ, J t ))} 8: end for 9: end for 10: Sort S t in ascending order according to log det( ) 11: O t O t S t [1] 12: for all (x, Σ, J) S t \ O t do 13: % Find all nodes in O t, which δ-cross x: 14: Q {Σ (x, Σ, J )} O t, d X (x, x ) δ 15: O t O t (x, Σ, J) 16: end for 17: O[t] = O t 18: end for 19: return {S, O, C} Brent Schlotfeldt 1, Dinesh Thakur 1, Nikolay Atanasov 2, Anytime Vijay Kumar Planning 1,, George Pappas 1 (Penn) May 17, / 25

20 Main Result The following holds regarding the performance of ARVI: 6 Theorem (ARVI) The following are satisfied by the ARVI algorithm: (i) When ImprovePath(ɛ, δ) returns with finite (ɛ, δ), the returned solution is guaranteed to be (ɛ, δ)-sub-optimal. (ii) The cost J (n) T time. of the returned solution decreases monotonically over (iii) Given infinite time, C = O S, and O[T ] will contain the optimal solution to the planning problem. 6 Brent Schlotfeldt et al. Anytime Planning for Decentralized Multirobot Active Information Gathering. In: IEEE Robotics and Automation Letters 3.2 (2018), pp Brent Schlotfeldt 1, Dinesh Thakur 1, Nikolay Atanasov 2, Anytime Vijay Kumar Planning 1,, George Pappas 1 (Penn) May 17, / 25

21 What about multiple robots? Algorithm still scales exponentially in the number of robots. Consider a coordinate descent approach 7. Each robot solves their own planning problem with fixed trajectories from prior robots. This continues: u1,0:(t c 1) argmin J (1) T (ˆµ) (8) ˆµ U1 T u2,0:(t c 1) argmin J (2) T (uc 1,0:(T 1), ˆµ) (9) ˆµ U2 T. 7 Nikolay Atanasov et al. Decentralized active information acquisition: Theory and application to multi-robot SLAM.. In: 2015 IEEE International Conference on Robotics and Automation (ICRA) (2015), pp Brent Schlotfeldt 1, Dinesh Thakur 1, Nikolay Atanasov 2, Anytime Vijay Kumar Planning 1,, George Pappas 1 (Penn) May 17, / 25

22 Performance Coordinate Descent reduces the planning complexity from exponential to linear in number of robots. Algorithm achieves no less than 50% of the optimal performance, in the case of submodular objective functions 8. Results can also be extended to (approximately) sub-modular objective functions. 8 Nikolay Atanasov et al. Decentralized active information acquisition: Theory and application to multi-robot SLAM.. In: 2015 IEEE International Conference on Robotics and Automation (ICRA) (2015), pp Brent Schlotfeldt 1, Dinesh Thakur 1, Nikolay Atanasov 2, Anytime Vijay Kumar Planning 1,, George Pappas 1 (Penn) May 17, / 25

23 What about Estimation? The robots can use a distributed information filter 9, which uses the inverse covariance Ω i,t = Σ 1 i,t and ω i,t = Σ 1 i,t y i,t: Update Step: ω i,t+1 = Predict Step: Ω i,t+1 = j N i {i} j N i {i} ŷ i (t) := Ω 1 i,t ω i,t κ i,j ω j,t + H T i κ i,j Ω j,t + H T i Ω i,t+1 = (AΩ 1 i,t+1 AT + W ) 1 ω i,t+1 = Ω i,t+1 Aŷ i,t+1 V 1 i z i (t) V 1 i H i (10) 9 Nikolay Atanasov et al. Joint estimation and localization in sensor networks. In: Decision and Control (CDC), 2014 IEEE 53rd Annual Conference on. IEEE. 2014, pp Brent Schlotfeldt 1, Dinesh Thakur 1, Nikolay Atanasov 2, Anytime Vijay Kumar Planning 1,, George Pappas 1 (Penn) May 17, / 25

24 Application: Target Tracking We use a discretized unicycle model for f (x, u): x t+1 1 x xt+1 2 t 1 ν sinc( ωτ = xt ) cos(θ t + ωτ ν sinc( ωτ 2 ) sin(θ t + ωτ θ t+1 θ t τω 2 ) 2 ) (11) Brent Schlotfeldt 1, Dinesh Thakur 1, Nikolay Atanasov 2, Anytime Vijay Kumar Planning 1,, George Pappas 1 (Penn) May 17, / 25

25 Application: Target Tracking We use a discretized unicycle model for f (x, u): x t+1 1 x xt+1 2 t 1 ν sinc( ωτ = xt ) cos(θ t + ωτ ν sinc( ωτ 2 ) sin(θ t + ωτ θ t+1 θ t τω 2 ) 2 ) (11) The target model is a double integrator: [ ] I2 τi y t+1 = A 2 y 0 I t + w t, w t N 2 ( [ ]) τ 0, q 3 /3I 2 τ 2 /2I 2 τ 2 /2I 2 τi 2 (12) Brent Schlotfeldt 1, Dinesh Thakur 1, Nikolay Atanasov 2, Anytime Vijay Kumar Planning 1,, George Pappas 1 (Penn) May 17, / 25

26 Application: Target Tracking We use a discretized unicycle model for f (x, u): x t+1 1 x xt+1 2 t 1 ν sinc( ωτ = xt ) cos(θ t + ωτ ν sinc( ωτ 2 ) sin(θ t + ωτ θ t+1 θ t τω 2 ) 2 ) (11) The target model is a double integrator: [ ] I2 τi y t+1 = A 2 y 0 I t + w t, w t N 2 The robots have range and bearing sensors. ( [ ]) τ 0, q 3 /3I 2 τ 2 /2I 2 τ 2 /2I 2 τi 2 z t,m = h(x t, y t,m ) + v t, v t N ( 0, V (x t, y t,m ) ) [ ] [ ] r(x, y) (y 1 x h(x, y) = := 1 ) 2 + (y 2 x 2 ) 2 α(x, y) tan 1 ((y 2 x 2 )(y 1 x 1 )) θ (12) (13) Brent Schlotfeldt 1, Dinesh Thakur 1, Nikolay Atanasov 2, Anytime Vijay Kumar Planning 1,, George Pappas 1 (Penn) May 17, / 25

27 Linearization, and MPC Due to the non-linear observation model, we linearize about a predicted target trajectory y and apply MPC: y h(x, y) = 1 [ ] (y 1 x 1 ) (y 2 x 2 ) 0 1x2 (14) r(x, y) sin(θ + α(x, y)) cos(θ + α(x, y)) 0 1x2 Brent Schlotfeldt 1, Dinesh Thakur 1, Nikolay Atanasov 2, Anytime Vijay Kumar Planning 1,, George Pappas 1 (Penn) May 17, / 25

28 Linearization, and MPC Due to the non-linear observation model, we linearize about a predicted target trajectory y and apply MPC: y h(x, y) = 1 [ ] (y 1 x 1 ) (y 2 x 2 ) 0 1x2 (14) r(x, y) sin(θ + α(x, y)) cos(θ + α(x, y)) 0 1x2 Algorithm 5 Anytime Multi-Robot Target Tracking 1: Input: T max, x 0, ˆω 0, ˆΩ 0, f, U, H, V, A, W, T, T ARVI 2: while t = 1 : T max do 3: Send ω i,t and Ω i,t to neighboring robots. 4: Receive measurements z i,t and perform distributed update step with any neighbor ω i,t, Ω i,t received. 5: Predict a target trajectory of length T : ŷ t,...ŷ T, and linearize observation model: H t y h(, ŷ t ) 6: Plan T -step trajectories with ARVI (Alg. 2) and coordinate descent. 7: Apply first control input to move each robot. 8: end while Brent Schlotfeldt 1, Dinesh Thakur 1, Nikolay Atanasov 2, Anytime Vijay Kumar Planning 1,, George Pappas 1 (Penn) May 17, / 25

29 Simulation Environment Brent Schlotfeldt 1, Dinesh Thakur 1, Nikolay Atanasov 2, Anytime Vijay Kumar Planning 1,, George Pappas 1 (Penn) May 17, / 25

30 Some Results 2 Average Target Entropy 10 Number of Targets 1.2 Average RMSE over time 0 m Entropy per target (nats) m 1 m 5 m 10 m 15 m m Average targets detected m 1 m 5 m 10 m 15 m m RMSE per target (m) m 5 m 10 m 15 m m Timestep (a) Timestep (b) Timestep (c) 4 Entropy per Target 25 Number of Targets Tracked 2 Position RMSE per target Entropy per target (nats) T ARVI = 0 (greedy) T ARVI = 1.5s T ARVI = 3s Number of targets T ARVI = 0 (greedy) T ARVI = 1.5s T ARVI = 3s RMSE (m) T ARVI = 0 (greedy) T ARVI = 1.5s T ARVI = 3s Timestep (d) Timestep (e) Timestep (f) Figure: Simulation results with 9 robots tracking 16 moving targets. The top row sweeps over communication radius, and the bottom row sweeps planning time. Brent Schlotfeldt 1, Dinesh Thakur 1, Nikolay Atanasov 2, Anytime Vijay Kumar Planning 1,, George Pappas 1 (Penn) May 17, / 25

31 Conclusion Future Work: Resilient Active Information Acquisition (IROS 2018) Unknown Target Models Quadrotor Motion Model, Camera observation Model, Continuous Actions, etc. Brent Schlotfeldt 1, Dinesh Thakur 1, Nikolay Atanasov 2, Anytime Vijay Kumar Planning 1,, George Pappas 1 (Penn) May 17, / 25

Non-Myopic Target Tracking Strategies for Non-Linear Systems

Non-Myopic Target Tracking Strategies for Non-Linear Systems Non-Myopic Target Tracking Strategies for Non-Linear Systems Zhongshun Zhang and Pratap Tokekar Abstract We study the problem of devising a closed-loop strategy to control the position of a robot that

More information

Decentralized Active Information Acquisition: Theory and Application to Multi-Robot SLAM

Decentralized Active Information Acquisition: Theory and Application to Multi-Robot SLAM 25 IEEE International Conference on Robotics and Automation ICRA Washington State Convention Center Seattle, Washington, May 26-3, 25 Decentralized Active Information Acquisition: heory and Application

More information

10 Robotic Exploration and Information Gathering

10 Robotic Exploration and Information Gathering NAVARCH/EECS 568, ROB 530 - Winter 2018 10 Robotic Exploration and Information Gathering Maani Ghaffari April 2, 2018 Robotic Information Gathering: Exploration and Monitoring In information gathering

More information

Multi-Robotic Systems

Multi-Robotic Systems CHAPTER 9 Multi-Robotic Systems The topic of multi-robotic systems is quite popular now. It is believed that such systems can have the following benefits: Improved performance ( winning by numbers ) Distributed

More information

Partially Observable Markov Decision Processes (POMDPs)

Partially Observable Markov Decision Processes (POMDPs) Partially Observable Markov Decision Processes (POMDPs) Sachin Patil Guest Lecture: CS287 Advanced Robotics Slides adapted from Pieter Abbeel, Alex Lee Outline Introduction to POMDPs Locally Optimal Solutions

More information

Planning Periodic Persistent Monitoring Trajectories for Sensing Robots in Gaussian Random Fields

Planning Periodic Persistent Monitoring Trajectories for Sensing Robots in Gaussian Random Fields 013 IEEE International Conference on Robotics and Automation (ICRA) Karlsruhe, Germany, May 6-10, 013 Planning Periodic Persistent Monitoring Trajectories for Sensing Robots in Gaussian Random Fields Xiaodong

More information

OPTIMAL FUSION OF SENSOR DATA FOR DISCRETE KALMAN FILTERING Z. G. FENG, K. L. TEO, N. U. AHMED, Y. ZHAO, AND W. Y. YAN

OPTIMAL FUSION OF SENSOR DATA FOR DISCRETE KALMAN FILTERING Z. G. FENG, K. L. TEO, N. U. AHMED, Y. ZHAO, AND W. Y. YAN Dynamic Systems and Applications 16 (2007) 393-406 OPTIMAL FUSION OF SENSOR DATA FOR DISCRETE KALMAN FILTERING Z. G. FENG, K. L. TEO, N. U. AHMED, Y. ZHAO, AND W. Y. YAN College of Mathematics and Computer

More information

Luiz F. O. Chamon, George J. Pappas and Alejandro Ribeiro. CDC 2017 December 12 th, 2017

Luiz F. O. Chamon, George J. Pappas and Alejandro Ribeiro. CDC 2017 December 12 th, 2017 The Mean Square Error in Kalman Filtering Sensor Selection is Approximately Supermodular Luiz F. O. Chamon, George J. Pappas and Alejandro Ribeiro CDC 2017 December 12 th, 2017 Chamon, Pappas, Ribeiro

More information

Decentralized Stabilization of Heterogeneous Linear Multi-Agent Systems

Decentralized Stabilization of Heterogeneous Linear Multi-Agent Systems 1 Decentralized Stabilization of Heterogeneous Linear Multi-Agent Systems Mauro Franceschelli, Andrea Gasparri, Alessandro Giua, and Giovanni Ulivi Abstract In this paper the formation stabilization problem

More information

Gradient Sampling for Improved Action Selection and Path Synthesis

Gradient Sampling for Improved Action Selection and Path Synthesis Gradient Sampling for Improved Action Selection and Path Synthesis Ian M. Mitchell Department of Computer Science The University of British Columbia November 2016 mitchell@cs.ubc.ca http://www.cs.ubc.ca/~mitchell

More information

Distributed Data Fusion with Kalman Filters. Simon Julier Computer Science Department University College London

Distributed Data Fusion with Kalman Filters. Simon Julier Computer Science Department University College London Distributed Data Fusion with Kalman Filters Simon Julier Computer Science Department University College London S.Julier@cs.ucl.ac.uk Structure of Talk Motivation Kalman Filters Double Counting Optimal

More information

Robust Monte Carlo Methods for Sequential Planning and Decision Making

Robust Monte Carlo Methods for Sequential Planning and Decision Making Robust Monte Carlo Methods for Sequential Planning and Decision Making Sue Zheng, Jason Pacheco, & John Fisher Sensing, Learning, & Inference Group Computer Science & Artificial Intelligence Laboratory

More information

Explorability of a Turbulent Scalar Field

Explorability of a Turbulent Scalar Field 017 American Control Conference Sheraton Seattle Hotel May 4 6, 017, Seattle, USA Explorability of a Turbulent Scalar Field Vivek Mishra and Fumin Zhang Abstract In this paper we extend the concept of

More information

Nonlinear Optimization for Optimal Control Part 2. Pieter Abbeel UC Berkeley EECS. From linear to nonlinear Model-predictive control (MPC) POMDPs

Nonlinear Optimization for Optimal Control Part 2. Pieter Abbeel UC Berkeley EECS. From linear to nonlinear Model-predictive control (MPC) POMDPs Nonlinear Optimization for Optimal Control Part 2 Pieter Abbeel UC Berkeley EECS Outline From linear to nonlinear Model-predictive control (MPC) POMDPs Page 1! From Linear to Nonlinear We know how to solve

More information

A Multi-Robot Control Policy for Information Gathering in the Presence of Unknown Hazards

A Multi-Robot Control Policy for Information Gathering in the Presence of Unknown Hazards A Multi-Robot Control Policy for Information Gathering in the Presence of Unknown Hazards Mac Schwager, Philip Dames, Daniela Rus, and Vijay Kumar Abstract This paper addresses the problem of deploying

More information

= m(0) + 4e 2 ( 3e 2 ) 2e 2, 1 (2k + k 2 ) dt. m(0) = u + R 1 B T P x 2 R dt. u + R 1 B T P y 2 R dt +

= m(0) + 4e 2 ( 3e 2 ) 2e 2, 1 (2k + k 2 ) dt. m(0) = u + R 1 B T P x 2 R dt. u + R 1 B T P y 2 R dt + ECE 553, Spring 8 Posted: May nd, 8 Problem Set #7 Solution Solutions: 1. The optimal controller is still the one given in the solution to the Problem 6 in Homework #5: u (x, t) = p(t)x k(t), t. The minimum

More information

Coarticulation in Markov Decision Processes

Coarticulation in Markov Decision Processes Coarticulation in Markov Decision Processes Khashayar Rohanimanesh Department of Computer Science University of Massachusetts Amherst, MA 01003 khash@cs.umass.edu Sridhar Mahadevan Department of Computer

More information

Formation Control of Nonholonomic Mobile Robots

Formation Control of Nonholonomic Mobile Robots Proceedings of the 6 American Control Conference Minneapolis, Minnesota, USA, June -6, 6 FrC Formation Control of Nonholonomic Mobile Robots WJ Dong, Yi Guo, and JA Farrell Abstract In this paper, formation

More information

Manipulators. Robotics. Outline. Non-holonomic robots. Sensors. Mobile Robots

Manipulators. Robotics. Outline. Non-holonomic robots. Sensors. Mobile Robots Manipulators P obotics Configuration of robot specified by 6 numbers 6 degrees of freedom (DOF) 6 is the minimum number required to position end-effector arbitrarily. For dynamical systems, add velocity

More information

Simultaneous Localization and Mapping

Simultaneous Localization and Mapping Simultaneous Localization and Mapping Miroslav Kulich Intelligent and Mobile Robotics Group Czech Institute of Informatics, Robotics and Cybernetics Czech Technical University in Prague Winter semester

More information

Towards control over fading channels

Towards control over fading channels Towards control over fading channels Paolo Minero, Massimo Franceschetti Advanced Network Science University of California San Diego, CA, USA mail: {minero,massimo}@ucsd.edu Invited Paper) Subhrakanti

More information

SENSOR SCHEDULING AND EFFICIENT ALGORITHM IMPLEMENTATION FOR TARGET TRACKING. Amit Singh Chhetri

SENSOR SCHEDULING AND EFFICIENT ALGORITHM IMPLEMENTATION FOR TARGET TRACKING. Amit Singh Chhetri SENSOR SCHEDULING AND EFFICIENT ALGORITHM IMPLEMENTATION FOR TARGET TRACKING by Amit Singh Chhetri A Dissertation Presented in Partial Fulfillment of the Requirements for the Degree Doctor of Philosophy

More information

1 Kalman Filter Introduction

1 Kalman Filter Introduction 1 Kalman Filter Introduction You should first read Chapter 1 of Stochastic models, estimation, and control: Volume 1 by Peter S. Maybec (available here). 1.1 Explanation of Equations (1-3) and (1-4) Equation

More information

Decision Procedures An Algorithmic Point of View

Decision Procedures An Algorithmic Point of View An Algorithmic Point of View ILP References: Integer Programming / Laurence Wolsey Deciding ILPs with Branch & Bound Intro. To mathematical programming / Hillier, Lieberman Daniel Kroening and Ofer Strichman

More information

Provably Correct Persistent Surveillance for Unmanned Aerial Vehicles Subject to Charging Constraints

Provably Correct Persistent Surveillance for Unmanned Aerial Vehicles Subject to Charging Constraints Provably Correct Persistent Surveillance for Unmanned Aerial Vehicles Subject to Charging Constraints Kevin Leahy, Dingjiang Zhou, Cristian-Ioan Vasile, Konstantinos Oikonomopoulos, Mac Schwager, and Calin

More information

Lecture 10 Linear Quadratic Stochastic Control with Partial State Observation

Lecture 10 Linear Quadratic Stochastic Control with Partial State Observation EE363 Winter 2008-09 Lecture 10 Linear Quadratic Stochastic Control with Partial State Observation partially observed linear-quadratic stochastic control problem estimation-control separation principle

More information

IMPROVED MPC DESIGN BASED ON SATURATING CONTROL LAWS

IMPROVED MPC DESIGN BASED ON SATURATING CONTROL LAWS IMPROVED MPC DESIGN BASED ON SATURATING CONTROL LAWS D. Limon, J.M. Gomes da Silva Jr., T. Alamo and E.F. Camacho Dpto. de Ingenieria de Sistemas y Automática. Universidad de Sevilla Camino de los Descubrimientos

More information

Physics-Aware Informative Coverage Planning for Autonomous Vehicles

Physics-Aware Informative Coverage Planning for Autonomous Vehicles Physics-Aware Informative Coverage Planning for Autonomous Vehicles Michael J. Kuhlman 1, Student Member, IEEE, Petr Švec2, Member, IEEE, Krishnanand N. Kaipa 2, Member, IEEE, Donald Sofge 3, Member, IEEE,

More information

Optimal Temporal Logic Planning in Probabilistic Semantic Maps

Optimal Temporal Logic Planning in Probabilistic Semantic Maps Optimal Temporal Logic Planning in Probabilistic Semantic Maps Jie Fu, Nikolay Atanasov, Ufuk Topcu, and George J. Pappas Abstract This paper considers robot motion planning under temporal logic constraints

More information

CS Algorithms and Complexity

CS Algorithms and Complexity CS 50 - Algorithms and Complexity Linear Programming, the Simplex Method, and Hard Problems Sean Anderson 2/15/18 Portland State University Table of contents 1. The Simplex Method 2. The Graph Problem

More information

ECE276A: Sensing & Estimation in Robotics Lecture 10: Gaussian Mixture and Particle Filtering

ECE276A: Sensing & Estimation in Robotics Lecture 10: Gaussian Mixture and Particle Filtering ECE276A: Sensing & Estimation in Robotics Lecture 10: Gaussian Mixture and Particle Filtering Lecturer: Nikolay Atanasov: natanasov@ucsd.edu Teaching Assistants: Siwei Guo: s9guo@eng.ucsd.edu Anwesan Pal:

More information

Multi-Robot Active SLAM with Relative Entropy Optimization

Multi-Robot Active SLAM with Relative Entropy Optimization Multi-Robot Active SLAM with Relative Entropy Optimization Michail Kontitsis, Evangelos A. Theodorou and Emanuel Todorov 3 Abstract This paper presents a new approach for Active Simultaneous Localization

More information

Design of Norm-Optimal Iterative Learning Controllers: The Effect of an Iteration-Domain Kalman Filter for Disturbance Estimation

Design of Norm-Optimal Iterative Learning Controllers: The Effect of an Iteration-Domain Kalman Filter for Disturbance Estimation Design of Norm-Optimal Iterative Learning Controllers: The Effect of an Iteration-Domain Kalman Filter for Disturbance Estimation Nicolas Degen, Autonomous System Lab, ETH Zürich Angela P. Schoellig, University

More information

Discrete Double Integrator Consensus

Discrete Double Integrator Consensus Proceedings of the 47th IEEE Conference on Decision and Control Cancun, Mexico, Dec. 9-11, 28 Discrete Double Integrator Consensus David W. Casbeer, Randy Beard, and A. Lee Swindlehurst Abstract A distributed

More information

Lecture Notes: (Stochastic) Optimal Control

Lecture Notes: (Stochastic) Optimal Control Lecture Notes: (Stochastic) Optimal ontrol Marc Toussaint Machine Learning & Robotics group, TU erlin Franklinstr. 28/29, FR 6-9, 587 erlin, Germany July, 2 Disclaimer: These notes are not meant to be

More information

SLAM Techniques and Algorithms. Jack Collier. Canada. Recherche et développement pour la défense Canada. Defence Research and Development Canada

SLAM Techniques and Algorithms. Jack Collier. Canada. Recherche et développement pour la défense Canada. Defence Research and Development Canada SLAM Techniques and Algorithms Jack Collier Defence Research and Development Canada Recherche et développement pour la défense Canada Canada Goals What will we learn Gain an appreciation for what SLAM

More information

A Residual Gradient Fuzzy Reinforcement Learning Algorithm for Differential Games

A Residual Gradient Fuzzy Reinforcement Learning Algorithm for Differential Games International Journal of Fuzzy Systems manuscript (will be inserted by the editor) A Residual Gradient Fuzzy Reinforcement Learning Algorithm for Differential Games Mostafa D Awheda Howard M Schwartz Received:

More information

Bubble Clustering: Set Covering via Union of Ellipses

Bubble Clustering: Set Covering via Union of Ellipses Bubble Clustering: Set Covering via Union of Ellipses Matt Kraning, Arezou Keshavarz, Lei Zhao CS229, Stanford University Email: {mkraning,arezou,leiz}@stanford.edu Abstract We develop an algorithm called

More information

Decentralized Goal Assignment and Trajectory Generation in Multi-Robot Networks: A Multiple Lyapunov Functions Approach

Decentralized Goal Assignment and Trajectory Generation in Multi-Robot Networks: A Multiple Lyapunov Functions Approach Decentralized Goal Assignment and Trajectory Generation in Multi-Robot Networks: A Multiple Lyapunov Functions Approach Dimitra Panagou, Matthew Turpin and Vijay Kumar Abstract This paper considers the

More information

Vlad Estivill-Castro. Robots for People --- A project for intelligent integrated systems

Vlad Estivill-Castro. Robots for People --- A project for intelligent integrated systems 1 Vlad Estivill-Castro Robots for People --- A project for intelligent integrated systems V. Estivill-Castro 2 Probabilistic Map-based Localization (Kalman Filter) Chapter 5 (textbook) Based on textbook

More information

Target Localization and Circumnavigation Using Bearing Measurements in 2D

Target Localization and Circumnavigation Using Bearing Measurements in 2D Target Localization and Circumnavigation Using Bearing Measurements in D Mohammad Deghat, Iman Shames, Brian D. O. Anderson and Changbin Yu Abstract This paper considers the problem of localization and

More information

Estimation of State Noise for the Ensemble Kalman filter algorithm for 2D shallow water equations.

Estimation of State Noise for the Ensemble Kalman filter algorithm for 2D shallow water equations. Estimation of State Noise for the Ensemble Kalman filter algorithm for 2D shallow water equations. May 6, 2009 Motivation Constitutive Equations EnKF algorithm Some results Method Navier Stokes equations

More information

EXPERIMENTAL ANALYSIS OF COLLECTIVE CIRCULAR MOTION FOR MULTI-VEHICLE SYSTEMS. N. Ceccarelli, M. Di Marco, A. Garulli, A.

EXPERIMENTAL ANALYSIS OF COLLECTIVE CIRCULAR MOTION FOR MULTI-VEHICLE SYSTEMS. N. Ceccarelli, M. Di Marco, A. Garulli, A. EXPERIMENTAL ANALYSIS OF COLLECTIVE CIRCULAR MOTION FOR MULTI-VEHICLE SYSTEMS N. Ceccarelli, M. Di Marco, A. Garulli, A. Giannitrapani DII - Dipartimento di Ingegneria dell Informazione Università di Siena

More information

Online Passive-Aggressive Algorithms

Online Passive-Aggressive Algorithms Online Passive-Aggressive Algorithms Koby Crammer Ofer Dekel Shai Shalev-Shwartz Yoram Singer School of Computer Science & Engineering The Hebrew University, Jerusalem 91904, Israel {kobics,oferd,shais,singer}@cs.huji.ac.il

More information

Convergence Analysis of the Variance in. Gaussian Belief Propagation

Convergence Analysis of the Variance in. Gaussian Belief Propagation 0.09/TSP.204.2345635, IEEE Transactions on Signal Processing Convergence Analysis of the Variance in Gaussian Belief Propagation Qinliang Su and Yik-Chung Wu Abstract It is known that Gaussian belief propagation

More information

Learning Manipulation Patterns

Learning Manipulation Patterns Introduction Dynamic Movement Primitives Reinforcement Learning Neuroinformatics Group February 29, 2012 Introduction Dynamic Movement Primitives Reinforcement Learning Motivation shift to reactive motion

More information

Distributed Receding Horizon Control of Cost Coupled Systems

Distributed Receding Horizon Control of Cost Coupled Systems Distributed Receding Horizon Control of Cost Coupled Systems William B. Dunbar Abstract This paper considers the problem of distributed control of dynamically decoupled systems that are subject to decoupled

More information

Optimal Control of Multi-battery Energy-aware Systems

Optimal Control of Multi-battery Energy-aware Systems 5th IEEE Conference on Decision and Control and European Control Conference (CDC-ECC) Orlando, FL, USA, December -5, Optimal Control of Multi-battery Energy-aware Systems Tao Wang and Christos G. Cassandras

More information

ADMM and Fast Gradient Methods for Distributed Optimization

ADMM and Fast Gradient Methods for Distributed Optimization ADMM and Fast Gradient Methods for Distributed Optimization João Xavier Instituto Sistemas e Robótica (ISR), Instituto Superior Técnico (IST) European Control Conference, ECC 13 July 16, 013 Joint work

More information

Communication constraints and latency in Networked Control Systems

Communication constraints and latency in Networked Control Systems Communication constraints and latency in Networked Control Systems João P. Hespanha Center for Control Engineering and Computation University of California Santa Barbara In collaboration with Antonio Ortega

More information

A decentralized Polynomial based SLAM algorithm for a team of mobile robots

A decentralized Polynomial based SLAM algorithm for a team of mobile robots Preprints of the 19th World Congress The International Federation of Automatic Control A decentralized Polynomial based SLAM algorithm for a team of mobile robots Luigi D Alfonso, Antonio Grano, Pietro

More information

An Adaptive Clustering Method for Model-free Reinforcement Learning

An Adaptive Clustering Method for Model-free Reinforcement Learning An Adaptive Clustering Method for Model-free Reinforcement Learning Andreas Matt and Georg Regensburger Institute of Mathematics University of Innsbruck, Austria {andreas.matt, georg.regensburger}@uibk.ac.at

More information

A Guaranteed Cost LMI-Based Approach for Event-Triggered Average Consensus in Multi-Agent Networks

A Guaranteed Cost LMI-Based Approach for Event-Triggered Average Consensus in Multi-Agent Networks A Guaranteed Cost LMI-Based Approach for Event-Triggered Average Consensus in Multi-Agent Networks Amir Amini, Arash Mohammadi, Amir Asif Electrical and Computer Engineering,, Montreal, Canada. Concordia

More information

Mixed-Integer Nonlinear Programming Formulation of a UAV Path Optimization Problem

Mixed-Integer Nonlinear Programming Formulation of a UAV Path Optimization Problem Mixed-Integer Nonlinear Programming Formulation of a UAV Path Optimization Problem Shankarachary Ragi and Hans D. Mittelmann Abstract We present a mixed-integer nonlinear programming (MINLP) formulation

More information

Partially Observable Markov Decision Processes (POMDPs)

Partially Observable Markov Decision Processes (POMDPs) Partially Observable Markov Decision Processes (POMDPs) Geoff Hollinger Sequential Decision Making in Robotics Spring, 2011 *Some media from Reid Simmons, Trey Smith, Tony Cassandra, Michael Littman, and

More information

E190Q Lecture 11 Autonomous Robot Navigation

E190Q Lecture 11 Autonomous Robot Navigation E190Q Lecture 11 Autonomous Robot Navigation Instructor: Chris Clark Semester: Spring 013 1 Figures courtesy of Siegwart & Nourbakhsh Control Structures Planning Based Control Prior Knowledge Operator

More information

Reachability-based Adaptive UAV Scheduling and Planning in Cluttered and Dynamic Environments

Reachability-based Adaptive UAV Scheduling and Planning in Cluttered and Dynamic Environments Reachability-based Adaptive UAV Scheduling and Planning in Cluttered and Dynamic Environments Esen Yel and Nicola Bezzo Abstract Modern unmanned aerial vehicles (UAVs) rely on periodic sensor measurements

More information

L11. EKF SLAM: PART I. NA568 Mobile Robotics: Methods & Algorithms

L11. EKF SLAM: PART I. NA568 Mobile Robotics: Methods & Algorithms L11. EKF SLAM: PART I NA568 Mobile Robotics: Methods & Algorithms Today s Topic EKF Feature-Based SLAM State Representation Process / Observation Models Landmark Initialization Robot-Landmark Correlation

More information

A strongly polynomial algorithm for linear systems having a binary solution

A strongly polynomial algorithm for linear systems having a binary solution A strongly polynomial algorithm for linear systems having a binary solution Sergei Chubanov Institute of Information Systems at the University of Siegen, Germany e-mail: sergei.chubanov@uni-siegen.de 7th

More information

Self-triggered and team-triggered control of. networked cyber-physical systems

Self-triggered and team-triggered control of. networked cyber-physical systems Self-triggered and team-triggered control of 1 networked cyber-physical systems Cameron Nowzari Jorge Cortés I. INTRODUCTION This chapter describes triggered control approaches for the coordination of

More information

Mobile Robot Localization

Mobile Robot Localization Mobile Robot Localization 1 The Problem of Robot Localization Given a map of the environment, how can a robot determine its pose (planar coordinates + orientation)? Two sources of uncertainty: - observations

More information

Efficient Information Planning in Graphical Models

Efficient Information Planning in Graphical Models Efficient Information Planning in Graphical Models computational complexity considerations John Fisher & Giorgos Papachristoudis, MIT VITALITE Annual Review 2013 September 9, 2013 J. Fisher (VITALITE Annual

More information

Economic and Distributed Model Predictive Control: Recent Developments in Optimization-Based Control

Economic and Distributed Model Predictive Control: Recent Developments in Optimization-Based Control SICE Journal of Control, Measurement, and System Integration, Vol. 10, No. 2, pp. 039 052, March 2017 Economic and Distributed Model Predictive Control: Recent Developments in Optimization-Based Control

More information

Isobath following using an altimeter as a unique exteroceptive sensor

Isobath following using an altimeter as a unique exteroceptive sensor Isobath following using an altimeter as a unique exteroceptive sensor Luc Jaulin Lab-STICC, ENSTA Bretagne, Brest, France lucjaulin@gmail.com Abstract. We consider an underwater robot equipped with an

More information

MAE 598: Multi-Robot Systems Fall 2016

MAE 598: Multi-Robot Systems Fall 2016 MAE 598: Multi-Robot Systems Fall 2016 Instructor: Spring Berman spring.berman@asu.edu Assistant Professor, Mechanical and Aerospace Engineering Autonomous Collective Systems Laboratory http://faculty.engineering.asu.edu/acs/

More information

Coordinated Path Following for Mobile Robots

Coordinated Path Following for Mobile Robots Coordinated Path Following for Mobile Robots Kiattisin Kanjanawanishkul, Marius Hofmeister, and Andreas Zell University of Tübingen, Department of Computer Science, Sand 1, 7276 Tübingen Abstract. A control

More information

THIS work is concerned with motion planning in Dynamic,

THIS work is concerned with motion planning in Dynamic, 1 Robot Motion Planning in Dynamic, Uncertain Environments Noel E. Du Toit, Member, IEEE, and Joel W. Burdick, Member, IEEE, Abstract This paper presents a strategy for planning robot motions in dynamic,

More information

Passivity-based Formation Control for UAVs with a Suspended Load

Passivity-based Formation Control for UAVs with a Suspended Load Passivity-based Formation Control for UAVs with a Suspended Load Chris Meissen Kristian Klausen Murat Arcak Thor I. Fossen Andrew Packard Department of Mechanical Engineering at the University of California,

More information

Robust Adaptive Motion Planning in the Presence of Dynamic Obstacles

Robust Adaptive Motion Planning in the Presence of Dynamic Obstacles 2016 American Control Conference (ACC) Boston Marriott Copley Place July 6-8, 2016. Boston, MA, USA Robust Adaptive Motion Planning in the Presence of Dynamic Obstacles Nurali Virani and Minghui Zhu Abstract

More information

ESC794: Special Topics: Model Predictive Control

ESC794: Special Topics: Model Predictive Control ESC794: Special Topics: Model Predictive Control Discrete-Time Systems Hanz Richter, Professor Mechanical Engineering Department Cleveland State University Discrete-Time vs. Sampled-Data Systems A continuous-time

More information

Stochastic Target Interception in Non-convex Domain Using MILP

Stochastic Target Interception in Non-convex Domain Using MILP Stochastic Target Interception in Non-convex Domain Using MILP Apoorva Shende, Matthew J. Bays, and Daniel J. Stilwell Virginia Polytechnic Institute and State University Blacksburg, VA 24060 Email: {apoorva,

More information

Experimental validation of a decentralized control law for multi-vehicle collective motion

Experimental validation of a decentralized control law for multi-vehicle collective motion Experimental validation of a decentralized control law for multi-vehicle collective motion Daniele Benedettelli, Nicola Ceccarelli, Andrea Garulli, Antonio Giannitrapani Abstract The paper presents the

More information

arxiv: v1 [cs.sy] 4 Mar 2014

arxiv: v1 [cs.sy] 4 Mar 2014 A Potential Game Approach for Information-Maximizing Cooperative Planning of Sensor Networks Han-Lim Choi* and Su-Jin Lee arxiv:403.08v [cs.sy] 4 Mar 04 Abstract This paper presents a potential game approach

More information

Non-Collision Conditions in Multi-agent Robots Formation using Local Potential Functions

Non-Collision Conditions in Multi-agent Robots Formation using Local Potential Functions 2008 IEEE International Conference on Robotics and Automation Pasadena, CA, USA, May 19-23, 2008 Non-Collision Conditions in Multi-agent Robots Formation using Local Potential Functions E G Hernández-Martínez

More information

The Particle Filter. PD Dr. Rudolph Triebel Computer Vision Group. Machine Learning for Computer Vision

The Particle Filter. PD Dr. Rudolph Triebel Computer Vision Group. Machine Learning for Computer Vision The Particle Filter Non-parametric implementation of Bayes filter Represents the belief (posterior) random state samples. by a set of This representation is approximate. Can represent distributions that

More information

CMU Lecture 11: Markov Decision Processes II. Teacher: Gianni A. Di Caro

CMU Lecture 11: Markov Decision Processes II. Teacher: Gianni A. Di Caro CMU 15-781 Lecture 11: Markov Decision Processes II Teacher: Gianni A. Di Caro RECAP: DEFINING MDPS Markov decision processes: o Set of states S o Start state s 0 o Set of actions A o Transitions P(s s,a)

More information

Statistical Visual-Dynamic Model for Hand-Eye Coordination

Statistical Visual-Dynamic Model for Hand-Eye Coordination The 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems October 18-22, 2010, Taipei, Taiwan Statistical Visual-Dynamic Model for Hand-Eye Coordination Daniel Beale, Pejman Iravani

More information

Formation path following control of unicycle-type mobile robots

Formation path following control of unicycle-type mobile robots 8 IEEE International Conference on Robotics and Automation Pasadena, CA, USA, May 19-3, 8 Formation path following control of unicycle-type mobile robots Jawhar Ghommam, Maarouf Saad and Faïçal Mnif Abstract

More information

Decentralized Decision Making for Multiagent Systems

Decentralized Decision Making for Multiagent Systems 5 Decentralized Decision Making for Multiagent Systems George Mathews, Hugh Durrant-Whyte, and Mikhail Prokopenko 5.1 Introduction Decision making in large distributed multiagent systems is a difficult

More information

Distributed MAP probability estimation of dynamic systems with wireless sensor networks

Distributed MAP probability estimation of dynamic systems with wireless sensor networks Distributed MAP probability estimation of dynamic systems with wireless sensor networks Felicia Jakubiec, Alejandro Ribeiro Dept. of Electrical and Systems Engineering University of Pennsylvania https://fling.seas.upenn.edu/~life/wiki/

More information

Learning Dexterity Matthias Plappert SEPTEMBER 6, 2018

Learning Dexterity Matthias Plappert SEPTEMBER 6, 2018 Learning Dexterity Matthias Plappert SEPTEMBER 6, 2018 OpenAI OpenAI is a non-profit AI research company, discovering and enacting the path to safe artificial general intelligence. OpenAI OpenAI is a non-profit

More information

Vision-based range estimation via Immersion and Invariance for robot formation control

Vision-based range estimation via Immersion and Invariance for robot formation control Vision-based range estimation via Immersion and Invariance for robot formation control Fabio Morbidi, Gian Luca Mariottini, Domenico Prattichizzo Abstract The paper introduces a new vision-based range

More information

Distributed Optimization over Networks Gossip-Based Algorithms

Distributed Optimization over Networks Gossip-Based Algorithms Distributed Optimization over Networks Gossip-Based Algorithms Angelia Nedić angelia@illinois.edu ISE Department and Coordinated Science Laboratory University of Illinois at Urbana-Champaign Outline Random

More information

Weak Input-to-State Stability Properties for Navigation Function Based Controllers

Weak Input-to-State Stability Properties for Navigation Function Based Controllers University of Pennsylvania ScholarlyCommons Departmental Papers MEAM) Department of Mechanical Engineering & Applied Mechanics December 2006 Weak Input-to-State Stability Properties for Navigation Function

More information

Parameter Estimation in a Moving Horizon Perspective

Parameter Estimation in a Moving Horizon Perspective Parameter Estimation in a Moving Horizon Perspective State and Parameter Estimation in Dynamical Systems Reglerteknik, ISY, Linköpings Universitet State and Parameter Estimation in Dynamical Systems OUTLINE

More information

Adaptive Nonlinear Model Predictive Control with Suboptimality and Stability Guarantees

Adaptive Nonlinear Model Predictive Control with Suboptimality and Stability Guarantees Adaptive Nonlinear Model Predictive Control with Suboptimality and Stability Guarantees Pontus Giselsson Department of Automatic Control LTH Lund University Box 118, SE-221 00 Lund, Sweden pontusg@control.lth.se

More information

CMU Lecture 4: Informed Search. Teacher: Gianni A. Di Caro

CMU Lecture 4: Informed Search. Teacher: Gianni A. Di Caro CMU 15-781 Lecture 4: Informed Search Teacher: Gianni A. Di Caro UNINFORMED VS. INFORMED Uninformed Can only generate successors and distinguish goals from non-goals Informed Strategies that can distinguish

More information

An efficient approach to stochastic optimal control. Bert Kappen SNN Radboud University Nijmegen the Netherlands

An efficient approach to stochastic optimal control. Bert Kappen SNN Radboud University Nijmegen the Netherlands An efficient approach to stochastic optimal control Bert Kappen SNN Radboud University Nijmegen the Netherlands Bert Kappen Examples of control tasks Motor control Bert Kappen Pascal workshop, 27-29 May

More information

A Hierarchy of Suboptimal Policies for the Multi-period, Multi-echelon, Robust Inventory Problem

A Hierarchy of Suboptimal Policies for the Multi-period, Multi-echelon, Robust Inventory Problem A Hierarchy of Suboptimal Policies for the Multi-period, Multi-echelon, Robust Inventory Problem Dimitris J. Bertsimas Dan A. Iancu Pablo A. Parrilo Sloan School of Management and Operations Research Center,

More information

Distributed Optimization. Song Chong EE, KAIST

Distributed Optimization. Song Chong EE, KAIST Distributed Optimization Song Chong EE, KAIST songchong@kaist.edu Dynamic Programming for Path Planning A path-planning problem consists of a weighted directed graph with a set of n nodes N, directed links

More information

Convex receding horizon control in non-gaussian belief space

Convex receding horizon control in non-gaussian belief space Convex receding horizon control in non-gaussian belief space Robert Platt Jr. Abstract One of the main challenges in solving partially observable control problems is planning in high-dimensional belief

More information

Implan: Scalable Incremental Motion Planning for Multi-Robot Systems

Implan: Scalable Incremental Motion Planning for Multi-Robot Systems Implan: Scalable Incremental Motion Planning for Multi-Robot Systems Indranil Saha UC Berkeley and UPenn Joint work with Rattanachai Ramaithitima (UPenn), Vijay Kumar (UPenn), George Pappas (UPenn) and

More information

Sparse Covariance Selection using Semidefinite Programming

Sparse Covariance Selection using Semidefinite Programming Sparse Covariance Selection using Semidefinite Programming A. d Aspremont ORFE, Princeton University Joint work with O. Banerjee, L. El Ghaoui & G. Natsoulis, U.C. Berkeley & Iconix Pharmaceuticals Support

More information

Temporal-Difference Q-learning in Active Fault Diagnosis

Temporal-Difference Q-learning in Active Fault Diagnosis Temporal-Difference Q-learning in Active Fault Diagnosis Jan Škach 1 Ivo Punčochář 1 Frank L. Lewis 2 1 Identification and Decision Making Research Group (IDM) European Centre of Excellence - NTIS University

More information

MDP Preliminaries. Nan Jiang. February 10, 2019

MDP Preliminaries. Nan Jiang. February 10, 2019 MDP Preliminaries Nan Jiang February 10, 2019 1 Markov Decision Processes In reinforcement learning, the interactions between the agent and the environment are often described by a Markov Decision Process

More information

Motion Planning for LTL Specifications: A Satisfiability Modulo Convex Optimization Approach

Motion Planning for LTL Specifications: A Satisfiability Modulo Convex Optimization Approach Motion Planning for LTL Specifications: A Satisfiability Modulo Convex Optimization Approach Yasser Shoukry UC Berkeley, UCLA, and UPenn Joint work with Pierluigi Nuzzo (UC Berkeley), Indranil Saha (IIT

More information

An Optimization-based Approach to Decentralized Assignability

An Optimization-based Approach to Decentralized Assignability 2016 American Control Conference (ACC) Boston Marriott Copley Place July 6-8, 2016 Boston, MA, USA An Optimization-based Approach to Decentralized Assignability Alborz Alavian and Michael Rotkowitz Abstract

More information

QUALIFYING EXAM IN SYSTEMS ENGINEERING

QUALIFYING EXAM IN SYSTEMS ENGINEERING QUALIFYING EXAM IN SYSTEMS ENGINEERING Written Exam: MAY 23, 2017, 9:00AM to 1:00PM, EMB 105 Oral Exam: May 25 or 26, 2017 Time/Location TBA (~1 hour per student) CLOSED BOOK, NO CHEAT SHEETS BASIC SCIENTIFIC

More information

Resilient Submodular Maximization For Control And Sensing

Resilient Submodular Maximization For Control And Sensing University of Pennsylvania ScholarlyCommons Publicly Accessible Penn Dissertations 2018 Resilient Submodular Maximization For Control And Sensing Vasileios Tzoumas University of Pennsylvania, vdtzoumas@gmail.com

More information

STATE ESTIMATION IN COORDINATED CONTROL WITH A NON-STANDARD INFORMATION ARCHITECTURE. Jun Yan, Keunmo Kang, and Robert Bitmead

STATE ESTIMATION IN COORDINATED CONTROL WITH A NON-STANDARD INFORMATION ARCHITECTURE. Jun Yan, Keunmo Kang, and Robert Bitmead STATE ESTIMATION IN COORDINATED CONTROL WITH A NON-STANDARD INFORMATION ARCHITECTURE Jun Yan, Keunmo Kang, and Robert Bitmead Department of Mechanical & Aerospace Engineering University of California San

More information