Integrated Task & Motion Planning for a Simplified Mars Rover Exploration Problem

Cognitive Robotics Spring 2022 Final Project - Alex Furman & Yotam Granov

(0) Introduction

Abstract

In this work, we implement task and motion planning methods in order to solve a simplified exploration problem for a hypothetical Mars rover called Ignorance. The rover is tasked with autonomously navigating across the surface of Mars to multiple goal zones without hitting any obstacles, and we must ensure that it reaches all goal zones assigned to it. At each goal zone, the rover will extract a geological sample from a cave at that location (which it must first map out), and once it has retrieved samples from every goal zone it must return to its base with the samples.

To solve this problem, we sampled a large number of points in the rover's configuration space (C-space) and implemented the Probabilistic Roadmap (PRM) algorithm in order to build a graph connecting the sampled points to their k-nearest neighbors. A single point is sampled from within each goal zone, which we'll call a goal node, and every goal node will be connected to the PRM graph (along with the location of the base, which we treat as the start node). We then apply Dijkstra's shortest path algorithm between the start node and every goal node, as well as between every pair of goal nodes, in order to construct a simple weighted graph that represents the total distance between each crucial node (the start and goal nodes). We treat this graph as a weighted Travelling Salesman Problem (without the limitation that each node can only be visited once), and solve it using the Optimal Fast Downward classical planner in order to obtain the final path for Ignorance. In order to effectively integrate the task and motion planning here, we utilize the Unified Planning Framework library for Python, which allows us to easily encode classical planning problems into PDDL with geometric constraints obtained from the motion planner (which is implemented completely in Python).

Finally, at each goal point we show the motion planning procedure for how the rover's arm will retrieve the geological sample without colliding with any of the cave's walls, and we do so using the Rapidly Exploring Random Trees (RRT) algorithm. We also implement the optimized RRT algorithm (known as RRT*) for this same purpose, and compare the efficiency of the two algorithms when applied to our problem.

Running the Code

In order to run the code, simply run the command python Ignorance.py from the command line with any of the following command line arguments:

  -h          Show this help message and exit
  -nav        Run the navigation simulation (default: True)
  -arm        Run the arm simulation (default: True)
  -n          Number of goals to create (default: 3)
  -o          Number of obstacles to create (default will randomize the number of obstacles) (default: None)
  -s          Number of samples for PRM (default: 200)
  -a          Show animations (default: False)
  -k          Show the k-nearest neighbors animation (default: False)
  -p          Save .pddl files (default: False)
  -i          Save .png image files (default: False)
  -g          Save .gif animation files (default: False)

Dependencies: Python 3, numpy, matplotlib, os, imageio, shutil, networkx, copy, math, random, sys, scipy, warnings, unified_planning (version=0.4.2.362.dev1), unified_planning[fast-downward], argparse

(1) Rover Navigation

In the first part of the problem, we want to plan the path of the rover between its starting point and a variety of goal zones (by default there are 3 such zones), all of which are randomly generated at the start of a run. There are a variety of obstacles (represented as circles of varying sizes) which are also randomly generated across the map - the number of obstacles can either be stochastic (between 5 and 20) or determined by the user. The rover, which is treated as a point for the sake of simplicity, must find a path that begins at the start node (represented as a single point), passes through each of the goal zones (represented as circles of random radii) at least once, and terminates to the start node - all without colliding with any obstacles.

Initial Map

First, the initial map of the environment for the navigation problem is produced. It is shown in the following figure:

The rover's start point is represented by the blue point, the goal zones are the green circles, and the obstacles are the dark orange circles. The free space of the rover is colored light orange.

Probabilistic Roadmap (PRM)

Next, we sample a certain number of points (the default is 200) from the rover's workspace, including one point per goal zone (this becomes the goal node that will be added to the roadmap, and they are represented in dark green). We use a k-Nearest Neighbors (kNN) algorithm (the default k is 5) each sample's nearest neighbors (i.e. create an edge between them) using the kd-tree method, and we can see the results of this in the following animation:

All of the sampled points in the free space are initially shown in black, and the node that is cuurently being assesses for its k-nearest neighbors is highlighted yellow. These neighbors are then highlighted in orange (apologies for the color clashing). Once we have the neighbors identified for every single sample, we connected the samples to their neighbors (these connections are represented using edges) and then run a collision detection algorithm to check whether any of the produced edges collide with any of the obstacles.

In the following animation, we see the initial sampling process, where the black points represent the sample nodes that were produced (using uniform random sampling), and then the production of valid edges using the kNN and collision detection algorithms:

The yellow points represent nodes which have been connected to their neighbors, and the black lines between the yellow points represent the appropriate edges. The start and goal nodes are highlighted once they are connected to the roadmpa. Once the edge production process is completed, we are left with a graph that should, in most cases (this is a probabilistic process after all), connect the start node to all of the goal nodes and most of the goal nodes to each other, and thus we obtain our Probabilistic Roadmap (PRM) $^{[7]}$.

Dijkstra's Algorithm

Once we have our PRM, we would like to find the shortest paths connecting each goal node to the start node, as well as those connecting the goal nodes to each other (when possible). Our PRM is a weighted undirected graph, where the weights are determined by the Euclidean distance between each pair of nodes (i.e. the length of the edge connecting them), and so we can implement Dijkstra's shortest path algorithm $^{[6]}$ to get these shortest paths. The following animation shows the paths obtained by Dijkstra's algorithm, highlighted in yellow:

Travelling Salesman Problem

Next, we simplify the paths obtained by the Dijkstra algorithm into a simple weighted undirected graph, exemplified in the figure below:

We then treat this graph as a weighted Travelling Salesman Problem (TSP), but one where each goal node in the graph must be visited *at least* once (and not only once, as is usually constrained for TSP's), and we can solve this using classical planning methods by representing the distances between nodes as the action cost for moving between those nodes. Our classical planning representation (which we do in PDDL using the Unified Planning Framework $^{[1]}$ for Python) is conducted as follows:

Domain

There is a rover type and a location type, as well as a start type which extends location as well as a set of goal types (goal{i}, where $i$ is a natural number) unique to each goal node (this was part of a messy workaround that we needed to implement) which also extend the location type
The fluents (which are boolean, i.e. predicates) indicate the current location (node) of the rover (called rover-at(?rover, ?location)), as well as whether or not the rover has already visited a certain node (called visited ?rover ?location)
A numeric fluent called total-cost records the sum of the costs of the actions in the plan - we will ask our planner to minimize this value in its solution
The possible actions are encoded as a set of actions called move_{i}_{j}, which take the rover from node $i$ to node $j$ - the reason we didn't just create one move action for all nodes is because we needed to encode the action costs (given by the length of the shortest path between those nodes) in a compatible way for the UPF (as part of our messy workaround)
- move{i}_{j}(?rover ?i ?j) has precondition rover_at(?rover ?i) and effects $\lnot$(rover_at ?rover ?i)$\land$(rover_at ?rover ?j)$\land$(visited ?rover ?j)$\land$(increase (total-cost) {cost{i}{j}})))

Problem

We instantiate one object per type (one rover of type rover, one start point of type start, and a set of start points g{i} of type goal{i})
We set the initial state such that the rover is at the start point, the rover has visited the start point, and the total cost is zero - i.e. (rover_at rover s) (visited rover s) (= (total-cost) 0))
We set the goal state such that the rover is back at the start point and that all of the goal points have been visited - i.e. (rover_at ?rover ?s)$\land$(visited ?rover ?g{j})$\forall g_{j}\in{goals}$
Finally, we declare the quality metric to be the minimization of the total cost of the actions in the plan (i.e. the total-cost fluent)

An example of these .pddl files can be found in the Graphics folder. Once the problem is represented in PDDL, we use the Fast Downward Planner $^{[4]}$ (with optimality guarantees) in order to obtain the optimal solution to our TSP. Using the plan it gives, we can finally tell our rover what trajectory to take in order to conduct its navigation and complete its tasks.

Final Trajectory

(2) Arm Motion Planning

Once we reach a location of interest, our mars rover neeeds to retrieve the geological sample, without colliding with the obstacles around it. Thus it was decided to use RRT to map out a trajectory for the rover's robot arm, given a known workspace.

Rapidly Exploring Random Trees (RRT)

Definitions

The RRT algorithm works by interacting with 2 spaces: the Configuration Space (C-Space), and the Workspace (W-Space). Given a robot arm with n joints, the C-Space is a set of dimension $R^n$, which contains all possible configurations of the robot's joints. For example, given a simple R-R robot arm whose motors can each make one full revolution, the dimension of the C-Space is $R^2$, with the bounds $([0, 2\pi),[0, 2\pi))$. The W-Space contains the physical representation of the robot, the goal, and all obstacles. The dimension of the W-Space in our case is simplified to $R^2$. (Note that the RRT algorithm still works for higher dimensions of C-Space, and W-Sapce - however runtimes will inevitably increase)

The Algorithm

The RRT algorithm works by expanding a tree of configuration nodes over the configuration space (C-Space), using random sampling. Our tree starts with a single root configuration in the C-Space (This represents the initial joint values of the robot). From there, a random configuration is sampled, and the tree locates the node in the tree which is nearest to the random configuration. From there a new node is created in the direction of the random node, a step-size away from the nearest node. The forward kinematics of the robot arm is run with this new configuration, and the robot queries whether it has collided with its environment. If it has, the node is discarded. If it has not colllided, the node is added to the tree, with the nearest node as its parent. We repeat this process until we find a configuration which causes the robot arm to reach the goal. The trajectory to the goal can now be extracted by following the path in the tree from the initial node to the goal-node

Notes on the Algorithm

It is important to choose the right step-size. Too big a step-size causes a large jump in the robot arm's configuration in the workspace - meaning a collision may have occured while moving from one configuration to another. Too small a step-size will cause very long run-times.

There are many variations on the RRT algorithm, one specifically worth mentioning here is RRT*. RRT* is RRT with 'rewiring' capabilities. After adding some node to the tree, the node looks for its closest neighbours within a specified radius. Once it finds these neighbours, it checks if connecting to the closest neighbour would result in collision. If it does, it moves on to the next closest neighbour. If there is no collision, the node removes its previous parent, and is adopted by this closest node. This results in a shorter trajectory the robot arm needs to execute. In fact, RRT* ensures an asymptotically optimal solution $^{[5]}$.

Search in C-Space

Final Arm Trajectory

References

[1] AIPlan4EU. "The Unified Planning Library", 2021. GitHub: https://github.com/aiplan4eu/unified-planning

[2] Wenjun Cheng & Yuhui Gao. "Using PDDL to Solve Vehicle Routing Problems". 8th International Conference on Intelligent Information Processing (IIP), 2014.

[3] Tim Chinenov. "Robotic Path Planning: RRT and RRT*". 2019.

[4] Malte Helmert. "The Fast Downward Planning System". Journal of Artificial Intelligence Research (JAIR), 2006. GitHub: https://github.com/aibasel/downward

[5] Sertac Karaman & Emilio Frazzoli. "Sampling-Based Algorithms for Optimal Motion Planning". The International Journal of Robotics Research (IJRR), 2011.

[6] Alexey Klochay. "Implementing Dijkstra’s Algorithm in Python". Udacity, 2021.

[7] Atsushi Sakai, Daniel Ingram, Joseph Dinius, Karan Chawla, Antonin Raffin & Alexis Paques. "PythonRobotics: A Python Code Collection of Robotics Algorithms". 2018. GitHub: https://github.com/AtsushiSakai/PythonRobotics

[8] Oren Salzman. "Sampling-Based Planners" Lecture, Algorithmic Robot Motion Planning Course (236610): Technion - Israel Institute of Technology, 2021.

Name		Name	Last commit message	Last commit date
Latest commit History 88 Commits
Arm_Modules		Arm_Modules
Graphics		Graphics
Literature		Literature
Nav_Modules		Nav_Modules
.gitignore		.gitignore
Arm_Run.py		Arm_Run.py
Ignorance.py		Ignorance.py
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Integrated Task & Motion Planning for a Simplified Mars Rover Exploration Problem

Cognitive Robotics Spring 2022 Final Project - Alex Furman & Yotam Granov

(0) Introduction

Abstract

Running the Code

(1) Rover Navigation

Initial Map

Probabilistic Roadmap (PRM)

Dijkstra's Algorithm

Travelling Salesman Problem

Domain

Problem

Final Trajectory

(2) Arm Motion Planning

Rapidly Exploring Random Trees (RRT)

Definitions

The Algorithm

Notes on the Algorithm

Search in C-Space

Final Arm Trajectory

References

About

Releases

Packages

Contributors 2

Languages

AlexanderFurman/Cognitive-Robotics-Project

Folders and files

Latest commit

History

Repository files navigation

Integrated Task & Motion Planning for a Simplified Mars Rover Exploration Problem

Cognitive Robotics Spring 2022 Final Project - Alex Furman & Yotam Granov

(0) Introduction

Abstract

Running the Code

(1) Rover Navigation

Initial Map

Probabilistic Roadmap (PRM)

Dijkstra's Algorithm

Travelling Salesman Problem

Domain

Problem

Final Trajectory

(2) Arm Motion Planning

Rapidly Exploring Random Trees (RRT)

Definitions

The Algorithm

Notes on the Algorithm

Search in C-Space

Final Arm Trajectory

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages