CRH*: A Deadlock Free Framework for Scalable Prioritised Path Planning in Multi-robot Systems

Heselden, James R.; Das, Gautham P.

doi:10.1007/978-3-030-89177-0_7

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13054))

Included in the following conference series:

Annual Conference Towards Autonomous Robotic Systems

1522 Accesses
1 Citations

Abstract

Multi-robot system is an ever growing tool which is able to be applied to a wide range of industries to improve productivity and robustness, especially when tasks are distributed in space, time and functionality. Recent works have shown the benefits of multi-robot systems in fields such as warehouse automation, entertainment and agriculture. The work presented in this paper tackles the deadlock problem in multi-robot navigation, in which robots within a common work-space, are caught in situations where they are unable to navigate to their targets, being blocked by one another. This problem can be mitigated by efficient multi-robot path planning. Our work focused around the development of a scalable rescheduling algorithm named Conflict Resolution Heuristic A* (CRH*) (https://github.com/iranaphor/crh_star) for decoupled prioritised planning. Extensive experimental evaluation of CRH* was carried out in discrete event simulations of a fleet of autonomous agricultural robots. The results from these experiments proved that the algorithm was both scalable and deadlock-free. Additionally, novel customisation options were included to test further optimisations in system performance. Continuous Assignment and Dynamic Scoring showed to reduce the make-span of the routing whilst Combinatorial Heuristics showed to reduce the impact of outliers on priority orderings.

Access provided by Autonomous University of Puebla. Download conference paper PDF

Graph-Based Multi-Robot Path Finding and Planning

Article 16 June 2022

Dynamic multi-robot task allocation under uncertainty and temporal constraints

Article 06 November 2021

A Comparison of Two Decoupled Methods for Simultaneous Multiple Robots Path Planning

Keywords

1 Introduction

Autonomous mobile robotic technologies have matured over recent decades enabling their uses in many real-world applications. This has resulted in a push towards scaling up systems to large mobile robot fleets to improve operational efficiency, especially when tasks are inherently distributed in space, time or functionality. One of the primary requirements to ensure proper coordination among a fleet of autonomous mobile robots in a shared-workspace, is efficient path planning and allocation. Without effective coordination, interference between robots can have a detrimental effect on the fleet operations. Congestion-free route planning and coordination is needed to address this interference.

In Multi-Robot Path Planning (MRPP), there are four main challenges to overcome: Completeness, is the guarantee that a route will be found if one exists; Optimality, is the guarantee to find a set of routes minimising some metric such as make-span (time the last robot arrives) or flow-time (sum of all route lengths) [19]; Deadlocking, is where a robot will be in a state preventing another from reaching a target [5, 8]; and Scalability, is the issue of processing resources thinning as the complexity of the joint state-space increases. Among these, optimality and scalability can be considered mutually exclusive, and are tackled well by Coupled and Decoupled approaches respectively.

An example of a scenario where MRPP can be useful is a fleet of agricultural robots deployed in a polytunnel environment to execute a series of in-field logistics tasks [7]. These tasks are dynamic in nature and are dispersed over the environment. This paper addresses the MRPP problem to enable congestion and deadlock-free movement of robots in such a polytunnel environment, by proposing a novel algorithm called Conflict Resolution Heuristic A* (CRH*). By relying on decentralised and decoupled route planning to find sub-optimal solutions, the CRH* algorithm is complete and scalable, as demonstrated by empirical evaluations.

The rest of this paper is organised as follows: Sect. 2 provides an overview of related work in Prioritised Planning; Sect. 3 provides an overview of the implementation; Sect. 4 details the experimental setup, evaluation and results; and Sect. 5 concludes our findings.

2 Related Works

Decentralised and decoupled MRPP approaches are widely preferred to address the high computational complexity and low scalability of centralised approaches. Prioritised Planning (PP) is a decoupled MRPP approach which assigns priorities to each robot and then plans a route for each one sequentially, treating all prior robots as dynamic obstacles.

As the complexity of the joint state-space is proportional to the total configuration of priorities, finding the optimal assignment of priorities is a NP-Hard problem. Thus a full state-space search is unfeasible and smarter approaches must be taken [2]. When routes are unavailable due to blocking from higher priority agents, the robot can be considered to be deadlocked. Rescheduling can be used to optimise priority assignments and replan routes. CRH* tackles the area of rescheduling, making use of enhanced reservation tables and comprehensive replanning in a distributed decision-making topology.

Our approach focuses in the topological domain, in which the map is a finite set of discrete positions (nodes) and connections between them (edges) indicating possible paths from one node to another [10]. It is used as the common discretised environment representation over which all robots plan their routes. Such representations have low planning complexity and can easily detect possible conflicts compared to metric maps.

2.1 Heuristics

The effective use of heuristics works with the intent to give priority to the robot which needs it the most. In Static Ordering (SO) [9] scoring is made using the order the agents were added to the network, and in Euclidean Distance (ED) [17] scoring is based on the distance to the target.

These work well in metric maps which are very open and well-connected, however for agricultural environments in the scale of hectares, these are not viable as they are. Polytunnels and fields often include long isolated paths to traverse, which are better managed by approaches such as below, which follow the idea that agents struggle due to environmental constraints. ED can be adapted into Optimal Path Length (OPL) to become viable in these scenarios.

In Planning Time (PT) [18], the processing time to find a route is used for scoring, however this falls short in heterogeneous systems where platforms have differing processing hardware. In Naive & Coupled Surroundings [6, 19] agents are scored based on the cluttering of their workspace by counting the number of local obstacles. Where in Naive Surroundings (NS), obstacles are regarded as distinct and in Coupled Surroundings (CS) they are treated as effective, where effective obstacles are regions the agent is unable to navigate. In Path Prospects (PaP) [19] scoring is made based on the number of effective obstacles between the start and target to score on the number of paths available in the homology class of trajectories.

In PP, it is standard to use a single heuristic to assign priorities. We extend on this, exploring the potential of combining multiple heuristics to improve to handling of outliers demonstrated in Fig. 1.

2.2 Rescheduling

Rescheduling works to reduce the impact of deadlocks by modifying the priority schema to optimise generated orderings [9]. In Random rescheduling [4] priorities are randomised and replanned whenever a deadlock occurs. The Hill-Climbing search [3] extends on this by randomly swapping pairs of priorities. Continuous Enhancement [15] allows agents to modify their own score if they are unable to find a route. This is extended by Deterministic Rescheduling [1], to award the maximum priority, following the idea that issues are caused by their local environment. Local Priority Assignment [14] works by exploring all local priority configurations in proximity to an agent unable to find a route. Priority Tuning [13] optimises successful assignments by shuffling priorities between the least optimal agents, repeating until convergence. In our approach, we extend the ideas in Deterministic Rescheduling by integrating the intelligent overriding of higher priorities.

2.3 Path Finding

Path Finding algorithms are a core component within prioritised planning, being used to generate routes once priorities are assigned. As detailed in [11, 16], there are many categories of path finding algorithms. Being one of the most fundamental path finding algorithms, A* [12] works only to identify a route. Local Repair A* (LRA*) [20], recalculates the remainder of its route when a collision is pre-empted. Cooperative A* (CA*) [16], uses three dimensional space-time reservation tables to mark off impassable regions. Our approach includes an extension to the reservation tables of CA* to override existing reservations; and an extension to LRA* to optimise replanning from a point of conflict.

3 Design and Development

3.1 Overview

The main architecture of CRH* (Fig. 2), works in three stages, with the final two repeating together till convergence. In the first stage, the robot is assigned a target, plans a route, and informs the coordinator of its reservations. The coordinator, on receiving the route, adds the reservations to the global map, notifying any other robots of overturned reservations in the second stage. The robots each receive their failed reservations (FR) in the third stage, and perform replanning to identify new routes, passing these new routes to the coordinator.

Each message contains a list of reservations, of which each consist of an edge identifier, agent identifier, and a reservation start and end time.

On receiving an FR, the shortest of three replanning methods is returned to the coordinator. These three methods are: Replanning from Start where a new route is generated from scratch; Replanning from Conflict where the route beyond FR is replanned; and Replanning with Delay which adds a time delay (in which the robot will wait to use the FR once it is available) and replans from there.

3.2 A* Adaptations

There were two key modifications to the motion planning algorithm to enable the enhancements for our approach, the first was the reservation override system (CRH Battle), the second was the deadlock management (FAILED List).

The CRH Battle was included within A* where as each edge is initially investigated, a score (referred to as the CRH score) is generated to query against any existing reservations in the time period specified. Only if the robot’s CRH score is greater, will the override work. While this scoring system can utilise any single heuristic such as described in Sect. 2.1, it is also capable of taking the combination of multiple heuristics (e.g. ED, \(\sum (ED,OPL)\), or \(\prod (PT,NS,CS)\)). For our experiments, we utilise a variety of scoring systems as the heuristics themselves are arbitrary to the aim of the research.

The deadlock management is implemented in the form of an additional flag for exploration. A* has two flags, OPEN and CLOSED, which define whether a node is on the frontier for exploration or has already been explored, where if the OPEN list is exhausted, the planning fails. In our approach, when reservation overrides fail, the edge is added to a new FAILED list, which is accessed once the OPEN list is exhausted to obtain the most optimal edge to override, boosting the CRH Score generated.

3.3 Framework Facilities

Continuous Assignment. In PP, the handling of new assignments is done in one of two ways, in Batch Assignment replanning will wait till all have completed routes, whilst in Full Replanning all agents replan in the current state. In our approach, we propose a third method Continuous Assignment as a direct contrast to Batch Assignment in which rescheduling is applied without replanning the entire network, only updating affected agents. This allows for a reduction in make-span as shown in Fig. 3.

Dynamic Scoring. The standard method of score generation is Static, in which scores for each agent are assigned before routing. In our approach, we implement the concept of dynamic scoring, where every reservation uses locally relevant information to compute heuristics. As the routing gets closer to a target, each edge will be reserved with information relevant to that edge rather then information from the agents current location, for example with ED, each edge will be reserved with its distance to the target rather the distance to the target from the start node as would happen with static scoring.

4 Experiments

Discrete Event Simulation. To ensure a comprehensive evaluation, Discrete Event Simulation (DES) was utilised along with Monte-Carlo simulations, enabled through the deterministic nature of the approach. In this, time jumps to points of interest (POI) rather than evaluating every timestep. In our experiments, these POI consist of any timestep in which a route is completed; time jumps to the next route completion, performs any calculations, and repeats.

Experiment Maps. Experimentation was completed on two topological maps Fork (Fig. 4a) and Riseholme (Fig. 4b), representing a single polytunnel and a pair of polytunnels respectively. The experimentation was completed on the Fork map for the majority of experiments, with the Riseholme map used also for the scalability test.

Heuristics. In the following experiments, used to generate the CRH scores for resolving conflicts are the heuristics of: Optimal Path Length (OPL), Euclidean Distance (ED), Planning Time (PT), and Static Ordering (SO) of which we utilise the Agent’s ID.

4.1 Evaluation of CRH*

Deadlocks. The FAILED list, is used to handle every deadlock encountered, so by recording the activity of the FAILED list, we are able to identify how often potential deadlock situations occur. Figure 5 shows the average number of deadlocks encountered per target over 50 runs with 1000 targets per run. These results show on average 0.47 deadlocks for each given target, which if not managed as they are here, would cause many delays in navigation.

Scalability. As the approach is decentralised in design, path planning is expected to run distributed, thus does not contribute to scalability concerns. The area of highest concern is thus communication, which consists of one message per replan to the coordinator. So to evaluate scalability, the Replans per Target (RT) is recorded, which accurately measures the load on the coordinator. The results in Fig. 6 show upward trends initially before plateauing, avoiding exponential scaling and thus can be regarded as scalable.

4.2 Evaluation of Optimisation Improvements

Batch vs. Continuous Assignment. Continuous Assignment works to improve make-span by reducing idle time for agents waiting to replan. Figure 7a shows the resulting make-span distributions contrasting Batch and Continuous Assignment. The tests were run five times with 20 targets for each of 5, 10 and 20 agents using OPL as the CRH score to resolve conflicts. The results show Continuous Assignment improves make-span in both small and large simulations. This is evidenced by Fig. 7b showing the same tests run for 1000 targets with make-span reductions of 47%, 38%, and 20% respectively to the number of agents.

Static vs. Dynamic Scoring. To test the efficacy of the optimisation, the decoupled optimal make-span is negated from the CRH* make-span to get the worst-case delay. This delay was recorded across two sets of experiments, the first testing static scoring, and the second with dynamic scoring. This was repeated across six categories using the heuristics of ED, OPL and PT, for each of 50 and 100 targets per agent. The results in Fig. 8 show Dynamic Scoring offers a significant improvement, with the margin between Static and Dynamic Scoring growing larger proportionally to the total targets.

Base vs. Combinatorial Heuristics. To evaluate the efficacy of combinatorial heuristics, 10 tests were performed with 10 agents each, recording the average number of replans per run. Each test consisted of 10 runs with 20 targets, completed for each heuristic independently then combined together. This is repeated three times, the first comparing ED, OPL and PT; the second using SO and OPL; and the third comparing SO and ED. From Fig. 9, the average range of outliers is reduced when using the combination of all heuristics as opposed to purely independent heuristics. In addition, the average number of replans decreases for combined heuristics when using a combination of heuristic and Agent ID.

5 Conclusion

In this work, we have proposed the CRH* algorithm, a decentralised and decoupled enhancement to prioritised planning which aimed to improve deadlock avoidance with the use of reservation tables and deterministic rescheduling. We show through experimental evaluation that our approach is both scalable and deadlock-free as the complexity of the joint state-space increases with the size of the topological map and the number of agents.

We have also shown, clear make-span reductions of up to 47% with the use of Continuous Assignment and up to 82% with Dynamic Scoring. We have also shown prioritisation quality improvements with Combinatorial Heuristics reducing the impact of extreme outliers.

Due to the agnostic nature of the approach, the discrete event simulation and specific heuristics chosen are arbitrary to the evaluation of the developed features, however further work will include exploration with a wider sample of heuristics, and performance evaluations beyond discrete event simulation.

References

Andreychuk, A., Yakovlev, K.: Two techniques that enhance the performance of multi-robot prioritized path planning. In: Proceedings of International Joint Conference Autonomous agents and multiagent systems, AAMAS, vol. 3, pp. 2177–2179 (2018). ISSN 15582914. arXiv: 1805.01270
Azarm, K., Schmidt, G.: Conflict-free motion of multiple mobile robots based on decentralized motion planning and negotiation. In: Proceedings of International Conference on Robotics and Automation, vol. 4, pp. 3526–3533. IEEE (1997)
Google Scholar
Bennewitz, M., Burgard, W., Thrun, S.: Finding and optimizing solvable priority schemes for decoupled path planning techniques for teams of mobile robots. Rob. Auton. Syst. 41(2–3), 89–99 (2002)
Article Google Scholar
Bennewitz, M., Burgard, W., Thrun, S.: Optimizing schedules for prioritized path planning of multi-robot systems. In: IEEE International Conference on Robotics and Automation, vol. 1, pp. 271–276 (2001)
Google Scholar
Cap, M., et al.: Prioritized planning algorithms for trajectory coordination of multiple mobile robots. IEEE Trans. Autom. Sci. Eng. 12(3), 835–849 (2015). ISSN: 15455955. arXiv: 1409.2399
Clark, C.M., Bretl, T., Rock, S.: Applying kinodynamic randomized motion planning with a dynamic priority system to multi-robot space systems. In: Proceedings, IEEE Aerospace Conference, vol. 7, p. 7. IEEE (2002)
Google Scholar
Das, G.P., et al.: Discrete event simulations for scalability analysis of robotic in-field logistics in agriculture - a case study. In: ICRA 2018 International Conference on Robotics and Automation, Workshop on Robotic Vision and Action in Agriculture, Brisbane (2018)
Google Scholar
Dewangan, R.K., Shukla, A., Godfrey, W.W.: Survey on prioritized multi robot path planning. In: 2017 IEEE International Conference on Smart Technologies and Management for Computing, Communication, Controls, Energy And Materials, pp. 423–428. IEEE (2017)
Google Scholar
Erdmann, M., Lozano-Pérez, T.: On multiple moving objects. Algorithmica 2(1–4), 477–521 (1987)
Article MathSciNet Google Scholar
Fentanes, J.P., et al.: Now or later? Predicting and maximising success of navigation actions from long-term experience. In: Proceedings - IEEE International Conference on Robotics and Automation, June, Seattle, USA, vol. 2015, pp. 1112–1117 (2015)
Google Scholar
Ferguson, D., Likhachev, M., Stentz, A.: A guide to heuristic-based path planning. In: Proceedings International Workshop on Planning Under Uncertainty for Autonomous Systems, International Conference on Automated Planning and Scheduling, pp. 1–10 (2005)
Google Scholar
Hart, P.E., Nilsson, N.J., Raphael, B.: A formal basis for the heuristic determination of minimum cost paths. IEEE Trans. Syst. Sci. Cybern. 4(2), 100–107 (1968)
Article Google Scholar
Hu, B., Cao, Z.: Minimizing task completion time of prioritized motion planning in multi-robot systems. In: IEEE International Conference on Systems, Man and Cybernetics, vol. 2019, pp. 1018–1023. IEEE (2019)
Google Scholar
Ma, H., et al.: Searching with consistent prioritization for multi-agent path finding. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33, pp. 7643–7650 (2019). ISSN: 2159–5399. arXiv: 1812.06356
Regele, R., Levi, P.: Cooperative multi-robot path planning by heuristic priority adjustment. In: 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 5954–5959. IEEE (2006)
Google Scholar
Silver, D.: Cooperative Pathfinding. Aiide 1, 117–122 (2005)
Google Scholar
Van Den Berg, J.P., Overmars, M.H.: Prioritized motion planning for multiple robots. In: 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS, pp. 2217–2222 (2005)
Google Scholar
Velagapudi, P., Sycara, K., Scerri, P.: Decentralized prioritized planning in large multirobot teams. In: IEEE/RSJ 2010 International Conference on Intelligent Robots and Systems, pp. 4603–4609 (2010)
Google Scholar
Wu, W., Bhattacharya, S., Prorok, A.: Multi-robot path deconfliction through prioritization by path prospects, pp. 22–24 (2019). arXiv: 1908.02361
Zelinsky, A.: A mobile robot navigation exploration algorithm. IEEE Trans. Robot. Autom. 8(6), 707–717 (1992)
Article Google Scholar

Download references

Author information

Authors and Affiliations

University of Lincoln, Lincoln, Lincolnshire, UK
James R. Heselden & Gautham P. Das

Authors

James R. Heselden
View author publications
You can also search for this author in PubMed Google Scholar
Gautham P. Das
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to James R. Heselden or Gautham P. Das .

Editor information

Editors and Affiliations

University of Lincoln, Lincoln, UK
Charles Fox
University of Lincoln, Lincoln, UK
Junfeng Gao
University of Lincoln, Lincoln, UK
Amir Ghalamzan Esfahani
University of Lincoln, Lincoln, UK
Mini Saaj
University of Lincoln, Lincoln, UK
Marc Hanheide
University of Lincoln, Lincoln, UK
Simon Parsons

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Heselden, J.R., Das, G.P. (2021). CRH*: A Deadlock Free Framework for Scalable Prioritised Path Planning in Multi-robot Systems. In: Fox, C., Gao, J., Ghalamzan Esfahani, A., Saaj, M., Hanheide, M., Parsons, S. (eds) Towards Autonomous Robotic Systems. TAROS 2021. Lecture Notes in Computer Science(), vol 13054. Springer, Cham. https://doi.org/10.1007/978-3-030-89177-0_7

Download citation

DOI: https://doi.org/10.1007/978-3-030-89177-0_7
Published: 31 October 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-89176-3
Online ISBN: 978-3-030-89177-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics