论文标题
使用强化学习在受约束的行人环境中的机器人导航
Robot Navigation in Constrained Pedestrian Environments using Reinforcement Learning
论文作者
论文摘要
流畅地围绕行人导航是部署在人类环境(例如建筑物和房屋)中的移动机器人的必要能力。尽管对社会导航的研究主要集中在开放空间中行人数量的可伸缩性上,但典型的室内环境提出了受约束空间(例如走廊和门口)的额外挑战,例如限制可操作性和影响行人互动模式的挑战。我们提出了一种基于强化学习(RL)的方法,以学习能够动态适应移动行人的情况的政策,同时在受约束环境中在所需位置之间导航。策略网络从运动计划者那里获得指导,该计划者提供了遵循全球计划的轨迹的航路点,而RL处理本地交互。我们探索了多层训练的组成原理,并发现在一系列几何简单的布局中训练的政策成功地将其概括为更复杂的看不见的布局,这些布局表现出训练过程中可用的结构元素的组成。超越了像域一样超越墙壁世界,我们展示了学习策略的转移,以看不见两个真实环境的3D重建。这些结果支持构图原理在现实世界中导航的适用性,并指示在重建环境中对涉及交互的任务的多代理模拟使用有前途的使用。
Navigating fluently around pedestrians is a necessary capability for mobile robots deployed in human environments, such as buildings and homes. While research on social navigation has focused mainly on the scalability with the number of pedestrians in open spaces, typical indoor environments present the additional challenge of constrained spaces such as corridors and doorways that limit maneuverability and influence patterns of pedestrian interaction. We present an approach based on reinforcement learning (RL) to learn policies capable of dynamic adaptation to the presence of moving pedestrians while navigating between desired locations in constrained environments. The policy network receives guidance from a motion planner that provides waypoints to follow a globally planned trajectory, whereas RL handles the local interactions. We explore a compositional principle for multi-layout training and find that policies trained in a small set of geometrically simple layouts successfully generalize to more complex unseen layouts that exhibit composition of the structural elements available during training. Going beyond walls-world like domains, we show transfer of the learned policy to unseen 3D reconstructions of two real environments. These results support the applicability of the compositional principle to navigation in real-world buildings and indicate promising usage of multi-agent simulation within reconstructed environments for tasks that involve interaction.