Mapless Collaborative Navigation for a Multi-Robot System Based on the Deep Reinforcement Learning
Open Access
- 9 October 2019
- journal article
- research article
- Published by MDPI AG in Applied Sciences
- Vol. 9 (20), 4198
- https://doi.org/10.3390/app9204198
Abstract
Compared with the single robot system, a multi-robot system has higher efficiency and fault tolerance. The multi-robot system has great potential in some application scenarios, such as the robot search, rescue and escort tasks, and so on. Deep reinforcement learning provides a potential framework for multi-robot formation and collaborative navigation. This paper mainly studies the collaborative formation and navigation of multi-robots by using the deep reinforcement learning algorithm. The proposed method improves the classical Deep Deterministic Policy Gradient (DDPG) to address the single robot mapless navigation task. We also extend the single-robot Deep Deterministic Policy Gradient algorithm to the multi-robot system, and obtain the Parallel Deep Deterministic Policy Gradient (PDDPG). By utilizing the 2D lidar sensor, the group of robots can accomplish the formation construction task and the collaborative formation navigation task. The experiment results in a Gazebo simulation platform illustrates that our method is capable of guiding mobile robots to construct the formation and keep the formation during group navigation, directly through raw lidar data inputs.Keywords
This publication has 30 references indexed in Scilit:
- Socially compliant mobile robot navigation via inverse reinforcement learningThe International Journal of Robotics Research, 2016
- Mastering the game of Go with deep neural networks and tree searchNature, 2016
- Fast marching tree: A fast marching sampling-based method for optimal motion planning in many dimensionsThe International Journal of Robotics Research, 2015
- Sparse Methods for Efficient Asymptotically Optimal Kinodynamic PlanningPublished by Springer Nature ,2015
- Human-level control through deep reinforcement learningNature, 2015
- Model predictive control strategy for smooth path tracking of autonomous vehicles with steering actuator dynamicsInternational Journal of Automotive Technology, 2014
- Reinforcement learning in robotics: A surveyThe International Journal of Robotics Research, 2013
- Integrated vehicle dynamics control via coordination of active front steering and rear brakingEuropean Journal of Control, 2013
- LPV design of fault-tolerant control for road vehiclesInternational Journal of Applied Mathematics and Computer Science, 2012
- Sampling-based algorithms for optimal motion planningThe International Journal of Robotics Research, 2011