Socially compliant mobile robot navigation via inverse reinforcement learning

Top Cited Papers

11 July 2016

journal article
research article
Published by SAGE Publications in The International Journal of Robotics Research

Vol. 35 (11), 1289-1307
https://doi.org/10.1177/0278364915619772

Abstract

Mobile robots are increasingly populating our human environments. To interact with humans in a socially compliant way, these robots need to understand and comply with mutually accepted rules. In this paper, we present a novel approach to model the cooperative navigation behavior of humans. We model their behavior in terms of a mixture distribution that captures both the discrete navigation decisions, such as going left or going right, as well as the natural variance of human trajectories. Our approach learns the model parameters of this distribution that match, in expectation, the observed behavior in terms of user-defined features. To compute the feature expectations over the resulting high-dimensional continuous distributions, we use Hamiltonian Markov chain Monte Carlo sampling. Furthermore, we rely on a Voronoi graph of the environment to efficiently explore the space of trajectories from the robot’s current position to its target position. Using the proposed model, our method is able to imitate the behavior of pedestrians or, alternatively, to replicate a specific behavior that was taught by tele-operation in the target environment of the robot. We implemented our approach on a real mobile robot and demonstrated that it is able to successfully navigate in an office environment in the presence of humans. An extensive set of experiments suggests that our technique outperforms state-of-the-art methods to model the behavior of pedestrians, which also makes it applicable to fields such as behavioral science or computer graphics.

Keywords

This publication has 45 references indexed in Scilit:

Topological constraints in search-based robot path planning
Autonomous Robots, 2012
From human to humanoid locomotion—an inverse optimal control approach
Autonomous Robots, 2009
A survey of robot learning from demonstration
Robotics and Autonomous Systems, 2008
The formation of trajectories during goal‐oriented locomotion in humans. II. A maximum smoothness model
European Journal of Neuroscience, 2007
Sensor-Based Exploration: The Hierarchical Generalized Voronoi Graph
The International Journal of Robotics Research, 2000
The dynamic window approach to collision avoidance
IEEE Robotics & Automation Magazine, 1997
Voronoi diagrams—a survey of a fundamental geometric data structure
ACM Computing Surveys, 1991
Hybrid Monte Carlo
Physics Letters B, 1987
An efficient algorithm for K shortest simple paths
Networks, 1982
A note on two problems in connexion with graphs
Numerische Mathematik, 1959

Cited by 294 articles