Socially compliant mobile robot navigation via inverse reinforcement learning
Top Cited Papers
- 11 July 2016
- journal article
- research article
- Published by SAGE Publications in The International Journal of Robotics Research
- Vol. 35 (11), 1289-1307
- https://doi.org/10.1177/0278364915619772
Abstract
Mobile robots are increasingly populating our human environments. To interact with humans in a socially compliant way, these robots need to understand and comply with mutually accepted rules. In this paper, we present a novel approach to model the cooperative navigation behavior of humans. We model their behavior in terms of a mixture distribution that captures both the discrete navigation decisions, such as going left or going right, as well as the natural variance of human trajectories. Our approach learns the model parameters of this distribution that match, in expectation, the observed behavior in terms of user-defined features. To compute the feature expectations over the resulting high-dimensional continuous distributions, we use Hamiltonian Markov chain Monte Carlo sampling. Furthermore, we rely on a Voronoi graph of the environment to efficiently explore the space of trajectories from the robot’s current position to its target position. Using the proposed model, our method is able to imitate the behavior of pedestrians or, alternatively, to replicate a specific behavior that was taught by tele-operation in the target environment of the robot. We implemented our approach on a real mobile robot and demonstrated that it is able to successfully navigate in an office environment in the presence of humans. An extensive set of experiments suggests that our technique outperforms state-of-the-art methods to model the behavior of pedestrians, which also makes it applicable to fields such as behavioral science or computer graphics.Keywords
This publication has 45 references indexed in Scilit:
- Topological constraints in search-based robot path planningAutonomous Robots, 2012
- From human to humanoid locomotion—an inverse optimal control approachAutonomous Robots, 2009
- A survey of robot learning from demonstrationRobotics and Autonomous Systems, 2008
- The formation of trajectories during goal‐oriented locomotion in humans. II. A maximum smoothness modelEuropean Journal of Neuroscience, 2007
- Sensor-Based Exploration: The Hierarchical Generalized Voronoi GraphThe International Journal of Robotics Research, 2000
- The dynamic window approach to collision avoidanceIEEE Robotics & Automation Magazine, 1997
- Voronoi diagrams—a survey of a fundamental geometric data structureACM Computing Surveys, 1991
- Hybrid Monte CarloPhysics Letters B, 1987
- An efficient algorithm for K shortest simple pathsNetworks, 1982
- A note on two problems in connexion with graphsNumerische Mathematik, 1959