论文标题
L4KDE:运动动力学扩展的学习
L4KDE: Learning for KinoDynamic Tree Expansion
论文作者
论文摘要
我们介绍了运动动力学计划的运动动力学扩展(L4KDE)方法。基于树木的计划方法,例如快速探索随机树(RRT),是在连续的状态空间计划中找到全球最佳计划的主要方法。这些方法的核心是树膨胀,这是将新节点添加到不断扩展的树中的过程。我们研究了基于树木计划的运动动力学变体,我们知道系统动力学和运动学约束。为了快速选择节点以连接新采样的坐标,现有方法通常无法优化以找到过渡到采样坐标的成本较低的节点。取而代之的是,他们使用坐标之间的欧几里得距离之类的指标作为选择候选节点以连接到搜索树的启发式。我们建议L4KDE解决这个问题。 L4KDE使用神经网络来预测查询状态之间的过渡成本,该状态可以有效地分批计算,而与常用的启发式方法相比,在维持几乎差异的渐近优化保证的同时,提供了更高的过渡成本质量估计。我们从经验上证明了L4KDE在各种具有挑战性的系统动力学方面提供的显着性能提高,并能够在同一模型类别的不同实例上概括,并与一组现代化的基于树的运动计划者一起进行。
We present the Learning for KinoDynamic Tree Expansion (L4KDE) method for kinodynamic planning. Tree-based planning approaches, such as rapidly exploring random tree (RRT), are the dominant approach to finding globally optimal plans in continuous state-space motion planning. Central to these approaches is tree-expansion, the procedure in which new nodes are added into an ever-expanding tree. We study the kinodynamic variants of tree-based planning, where we have known system dynamics and kinematic constraints. In the interest of quickly selecting nodes to connect newly sampled coordinates, existing methods typically cannot optimise to find nodes that have low cost to transition to sampled coordinates. Instead, they use metrics like Euclidean distance between coordinates as a heuristic for selecting candidate nodes to connect to the search tree. We propose L4KDE to address this issue. L4KDE uses a neural network to predict transition costs between queried states, which can be efficiently computed in batch, providing much higher quality estimates of transition cost compared to commonly used heuristics while maintaining almost-surely asymptotic optimality guarantee. We empirically demonstrate the significant performance improvement provided by L4KDE on a variety of challenging system dynamics, with the ability to generalise across different instances of the same model class, and in conjunction with a suite of modern tree-based motion planners.