在3D点云中对场景流的层次关注学习

论文标题

在3D点云中对场景流的层次关注学习

Hierarchical Attention Learning of Scene Flow in 3D Point Clouds

论文作者

Wang, Guangming, Wu, Xinrui, Liu, Zhe, Wang, Hesheng

论文摘要

场景流表示动态环境中每个点的3D运动。就像代表2D图像中像素运动的光流一样，场景流的3D运动表示有益于许多应用，例如自主驾驶和服务机器人。本文研究了两个连续3D点云的场景流估计问题。在本文中，提出了一个具有双重注意的新型层次神经网络，以了解相邻帧中点特征的相关性，并逐层从粗糙的层到细小的场景流动。拟议的网络具有新的无效分层体系结构。更不可能的意味着输入点的数量大于场景流估计的输出点的数量，这带来了更多的输入信息，并平衡了精度和资源消耗。在这个层次结构中，分别生成和监督不同级别的场景流。引入了一种新颖的细心嵌入模块，以使用双重注意方法以斑块的方式使用双重注意方法来汇总相邻点的特征。在我们的网络设计中仔细考虑了用于流动嵌入和流量监督的适当层。实验表明，所提出的网络的表现优于Flaythings3d和Kitti场景流2015年数据集的3D场景流量估算的最新性能。我们还将提出的网络应用于现实的LiDAR射仪任务，这是自动驾驶中的关键问题。实验结果表明，我们提出的网络可以胜过基于ICP的方法，并显示出良好的实际应用能力。

Scene flow represents the 3D motion of every point in the dynamic environments. Like the optical flow that represents the motion of pixels in 2D images, 3D motion representation of scene flow benefits many applications, such as autonomous driving and service robot. This paper studies the problem of scene flow estimation from two consecutive 3D point clouds. In this paper, a novel hierarchical neural network with double attention is proposed for learning the correlation of point features in adjacent frames and refining scene flow from coarse to fine layer by layer. The proposed network has a new more-for-less hierarchical architecture. The more-for-less means that the number of input points is greater than the number of output points for scene flow estimation, which brings more input information and balances the precision and resource consumption. In this hierarchical architecture, scene flow of different levels is generated and supervised respectively. A novel attentive embedding module is introduced to aggregate the features of adjacent points using a double attention method in a patch-to-patch manner. The proper layers for flow embedding and flow supervision are carefully considered in our network designment. Experiments show that the proposed network outperforms the state-of-the-art performance of 3D scene flow estimation on the FlyingThings3D and KITTI Scene Flow 2015 datasets. We also apply the proposed network to realistic LiDAR odometry task, which is an key problem in autonomous driving. The experiment results demonstrate that our proposed network can outperform the ICP-based method and shows the good practical application ability.

下载PDF全文

下载文献需遵守相关版权规定

论文标题