Planning & Decision-making in Robotics Search Algorithms: Markov Property, Dependent vs. Independent variables, Dominance relationship

Size: px

Start display at page:

Download "Planning & Decision-making in Robotics Search Algorithms: Markov Property, Dependent vs. Independent variables, Dominance relationship"

Christian Wilkins
5 years ago
Views:

1 6-782 Planning & Decision-making in Robotics Search Algorithms: Markov Property, Dependent vs. Independent variables, Dominance relationship Maxim Likhachev Robotics Institute Carnegie Mellon University

2 Consider Planning with Battery Constraint - want a collision-free path to s goal = (x goal, y goal ) - want to minimize some cost function associated with each transition (for example, minimize the risk of flying close to people) - subject to the trajectory being feasible given the UAV battery level L What should be the variables defining each state (i.e., dimensions of the search)? Carnegie Mellon University 2

3 Consider Planning with Battery Constraint x start,y start,l start =L S start - want a collision-free path to s goal = (x goal, y goal ) - want to minimize some cost function associated with each transition (for example, minimize the risk of flying close to people) - subject to the trajectory being feasible given the UAV battery level L - Planning needs to be in (x,y,l), where l is the remaining battery level x,y,l =l start - S x 2,y 2,l 2 =l start - x i,y i,l i =0 Carnegie Mellon University 3 S i states with battery level 0 have no successors

4 Markov Property Cost and Set of Successors needs to depend ONLY on the current state (no dependence on the history of the path leading up to it!) c(s 0,s ) S for all states s: succ(s) = function of s for all s in succ(s): c(s,s )= function of s, s S 0 c(s 0,s 2 ) Carnegie Mellon University 4

5 Markov Property Cost and Set of Successors needs to depend ONLY on the current state (no dependence on the history of the path leading up to it!) c(s 0,s ) S for all states s: succ(s) = function of s for all s in succ(s): c(s,s )= function of s, s S 0 c(s 0,s 2 ) Clearly true in an explicit (given) graph Can be violated in implicit (dynamically generated) graphs, where succ(s) and c(s,s ) are computed on-the-fly as a function of s, when using dependent variables Carnegie Mellon University 5

6 Independent vs. Dependent Variables X(s) variables associated with s X(s) = {X ind (s), X dep (s)} X ind (s) independent variables X dep (s) dependent variables X dep (s) = end-effector pose X ind (s) =joint angles Independent Variables are used to define state s two states s and s are considered to be the same state if and only if X ind (s) = X ind (s ) Dependent Variables often used to help with computing cost or list of successor states if for all s, X dep (s) = f(x ind (s)) (that is, only depends on independent variables, then Markov Property holds true) Often however, developers suggest to compute X dep (s) based on the path leading up to X ind (s) Carnegie Mellon University 6

7 Consider Planning with Battery Constraint - want a collision-free path to s goal = (x goal, y goal ) - want to minimize some cost function associated with each transition (for example, minimize the risk of flying close to people) - subject to the trajectory being feasible given the UAV battery level L - Consider X ind =(x,y), X dep =(l), where l is the remaining battery level Carnegie Mellon University 7

8 Consider Planning with Battery Constraint x start,y start,l start =L S start - want a collision-free path to s goal = (x goal, y goal ) - want to minimize some cost function associated with each transition (for example, minimize the risk of flying close to people) - subject to the trajectory being feasible given the UAV battery level L - Consider X ind =(x,y), X dep =(l), where l is the remaining battery level 20 x,y,l =l start - 2 S x 2,y 2,l 2 =l start - x 3,y 3,l 3 =l start -2 S x 4,y 4,l 4 =? S 4 S goal What is l(s 4 ) at the time s 4 gets expanded? What does it break? Carnegie Mellon University 8

9 Consider Planning with Battery Constraint x start,y start,l start =L S start - want a collision-free path to s goal = (x goal, y goal ) - want to minimize some cost function associated with each transition (for example, minimize the risk of flying close to people) - subject to the trajectory being feasible given the UAV battery level L - Consider X ind =(x,y), X dep =(l), where l is the remaining battery level 20 x,y,l =l start - 2 S x 2,y 2,l 2 =l start - x 3,y 3,l 3 =l start -2 S x 4,y 4,l 4 =? S 4 S goal How does this graph look if X ind = (x,y,l)? Carnegie Mellon University 9

10 Consider Planning with Constraints on Rate of Turning Suppose we are planning 2D (x,y) path for a ground robot and constraining its heading to change by at most 45 degrees at each timestep based on the previous transition - Consider X ind =(x,y), X dep =(Ѳ), where Ѳ is robot s heading Example of incompleteness? Carnegie Mellon University 0

11 Consider Planning with Continuous (x,y,ѳ) Suppose we are planning 3D (x,y,ѳ) path for a ground robot but we don t have motion primitives (lattice) that move the robot exactly between the centers of 3D cells - Consider X ind =(x disc,y disc,ѳ disc ), X dep =(x cont,y cont,ѳ cont ), where X dep keeps track of the continuous robot pose along its path [Barraquand, J. & Latombe, 93] Example of incompleteness? Carnegie Mellon University

12 Consider Planning in Dynamic Environments Suppose we are planning a path among moving obstacles - want a collision-free path to s goal - want to minimize some cost function associated with each transition - Consider X ind =(robot pose), X dep =(t), where t is time Example of incompleteness? Carnegie Mellon University 2

13 Consider Planning in Dynamic Environments Suppose we are planning a path among moving obstacles - want a collision-free path to s goal - assume cost function is time - Consider X ind =(robot pose), X dep =(t), where t is time Is it incomplete? Carnegie Mellon University 3

14 Back to Planning with Battery Constraint - want a collision-free path to s goal = (x goal, y goal ) - assume cost function is battery consumption - subject to the trajectory being feasible given the UAV battery level L - Consider X ind =(x,y), X dep =(l), where l is the remaining battery level Is it incomplete? Carnegie Mellon University 4

15 Back to Planning with Battery Constraint - want a collision-free path to s goal = (x goal, y goal ) - assume cost function is battery consumption - subject to the trajectory being feasible given the UAV battery level L - Consider X ind =(x,y), X dep =(l), where l is the remaining battery level x start,y start,l start =L S start x,y,l =l start - x 3,y 3,l 3 =l start -2 x 4,y 4,l 4 =? S S 3 x 2,y 2,l 2 =l start - S 4 S goal Carnegie Mellon University 5

16 Back to Planning with Battery Constraint - want a collision-free path to s goal = (x goal, y goal ) - assume cost function is battery consumption - subject to the trajectory being feasible given the UAV battery level L - Consider X ind =(x,y), X dep =(l), where l is the remaining battery level x start,y start,l start =L S start x,y,l =l start - x 3,y 3,l 3 =l start -2 x 4,y 4,l 4 =l start -2 S x 2,y 2,l 2 =l start - S 3 S 4 In fact, don t even need to have variable l since it is the same as g(s)! You can just plan in (x,y) S goal Carnegie Mellon University 6

17 Back to Planning with Battery Constraint - want a collision-free path to s goal = (x goal, y goal ) - assume cost function is battery consumption - subject to the trajectory being feasible given the UAV battery level L - Consider X ind =(x,y), X dep =(l), where l is the remaining battery level x start,y start,l start =L S start x,y,l =l start - S x 2,y 2,l 2 =l start - x 3,y 3,l 3 =l start -2 S 3 x 4,y 4,l 4 =l start -2 S 4 In general, when is it OK to use X dep (s) in determining succ(s) and edgecosts? S goal Carnegie Mellon University 7

18 Back to Planning with Battery Constraint x start,y start,l start =L S start - want a collision-free path to s goal = (x goal, y goal ) - assume cost function is battery consumption - subject to the trajectory being feasible given the UAV battery level L - Consider X ind =(x,y), X dep =(l), where l is the remaining battery level Whenever you can guarantee that for any state s: if we have two paths π (s start,s) and π 2 (s start,s) s.t. c(π ) c(π 2 ), then it implies that c (s,s ) c 2 (s,s ), x where,y,l =l c i (s,s ) start - cost x 3,y of 3,l a 3 =l least-cost start -2 path x 4,y 4 from,l 4 =l start s to -2 s after s is reached from s S S start via path π 3 S i 4 x 2,y 2,l 2 =l start - In general, when is it OK to use X dep (s) in determining succ(s) and edgecosts? S goal Carnegie Mellon University 8

19 Back to Planning with Battery Constraint x start,y start,l start =L S start - want a collision-free Assuming we path are to running s goal = (x optimal goal, y goal search ) - assume cost function is battery (such as consumption A*). - subject to the trajectory being feasible given the UAV battery level L - Consider X ind =(x,y), X dep =(l), where l is the remaining battery level Whenever you can guarantee that for any state s: if we have two paths π (s start,s) and π 2 (s start,s) s.t. c(π ) c(π 2 ), then it implies that c (s,s ) c 2 (s,s ), x where,y,l =l c i (s,s ) start - cost x 3,y of 3,l a 3 =l least-cost start -2 path x 4,y 4 from,l 4 =l start s to -2 s after s is reached from s S S start via path π 3 S i 4 x 2,y 2,l 2 =l start - In general, when is it OK to use X dep (s) in determining succ(s) and edgecosts? S goal Carnegie Mellon University 9

20 Back to Planning with Battery Constraint Suppose we are What planning happens 2D if we (x,y) are running path for UAV x start,y start,l start =L S start - want a collision-free Assuming we path are to running s goal = (x optimal goal, y goal search ) - assume cost function is battery (such as consumption A*). - subject to the trajectory being feasible given the UAV battery level L - Consider X ind =(x,y), X dep =(l), where l is the remaining battery level Whenever you can guarantee that for any state s: if we have two paths π (s start,s) and π 2 (s start,s) s.t. c(π ) c(π 2 ), then it implies that c (s,s ) c 2 (s,s ), x where,y,l =l c i (s,s ) start - cost x 3,y of 3,l a 3 =l least-cost start -2 path x 4,y 4 from,l 4 =l start s to -2 s after s is reached from s S S start via path π 3 S i 4 x 2,y 2,l 2 =l start - suboptimal search such as weighted A*? In general, when is it OK to use X dep (s) in determining succ(s) and edgecosts? S goal Carnegie Mellon University 20

21 Dominance Relationship x start,y start,l start =L S start - want a collision-free path to s goal = (x goal, y goal ) - want to minimize some cost function associated with each transition (for example, minimize the risk of flying close to people) - subject What to the are trajectory the general being conditions feasible given for pruning the UAV dominated battery level states? L - Consider X ind =(x,y,l) x,y,l =l start - 2 S x 2,y 2,l 2 =l start - x 3,y 3,l 3 =l start -2 S x 4,y 4,l 4 =? S 4 Do we really need two copies of s 4? One with l 4 =l start -2 and one with l 4 =l start -3 S goal Carnegie Mellon University 2

22 Dominance Relationship x start,y start,l start =L S start - want a collision-free path to s goal = (x goal, y goal ) - want to minimize some cost function associated with each transition (for example, minimize the risk of flying close to people) - subject What to the are trajectory the general being conditions feasible given for pruning the UAV dominated battery level states? L - Consider X ind =(x,y,l) if (g(s) g(s ) and s dominates s ), then s can be pruned by search s dominates s implies s cannot be part of a solution that is better than the solution from s x,y,l =l start - 2 S x 2,y 2,l 2 =l start - x 3,y 3,l 3 =l start -2 S x 4,y 4,l 4 =? S 4 Do we really need two copies of s 4? One with l 4 =l start -2 and one with l 4 =l start -3 S goal Carnegie Mellon University 22

23 A* Search with Dominance Check Main function g(s start ) = 0; all other g-values are infinite; OPEN = {s start }; ComputePath(); publish solution; ComputePath function while(s goal is not expanded and OPEN 0) remove s with the smallest [f(s) = g(s)+h(s)] from OPEN; insert s into CLOSED; for every successor s of s such that s not in CLOSED if g(s ) > g(s) + c(s,s ) g(s ) = g(s) + c(s,s ); if there exists state s such that (g(s ) g(s ) AND s dominates s ) continue; //skip inserting state s into OPEN, i.e., prune insert s into OPEN; Carnegie Mellon University 23

24 What You Should Know Dependent vs. Independent variables. Definition of Markov Property The definition and the use of Dominance relationship Carnegie Mellon University 24

Incremental Path Planning

Incremental Path Planning Dynamic and Incremental A* Prof. Brian Williams (help from Ihsiang Shu) 6./6.8 Cognitive Robotics February 7 th, Outline Review: Optimal Path Planning in Partially Known Environments.