论文标题
通过自我监督的关联网络多对象跟踪
Multi-object tracking with self-supervised associating network
论文作者
论文摘要
多对象跟踪(MOT)是具有很大发展潜力的任务,并且仍然有许多问题需要解决。在传统的检测范式跟踪中,基于功能的对象重新识别方法有很多工作。但是,此方法缺乏培训数据问题。为了标记多目标跟踪数据集,视频序列中的每个检测都需要其位置和ID。由于将连续的ID分配给每个序列中的每个检测是一项非常劳动密集的任务,因此当前的多目标跟踪数据集不足以训练重新识别网络。因此,在本文中,我们使用许多没有人类标签的简短视频提出了一种新颖的自我监督学习方法,并通过以自我监督的方式培训的重新识别网络来改善跟踪性能,以解决缺乏培训数据问题。尽管重新识别网络以自我监督的方式进行了培训,但它仍可以在MOT17测试基准上实现MOTA 62.0 \%和IDF1 62.6 \%的最先进性能。此外,通过大量数据所学的性能和学习的能力一样多,它显示了自我监督方法的潜力。
Multi-Object Tracking (MOT) is the task that has a lot of potential for development, and there are still many problems to be solved. In the traditional tracking by detection paradigm, There has been a lot of work on feature based object re-identification methods. However, this method has a lack of training data problem. For labeling multi-object tracking dataset, every detection in a video sequence need its location and IDs. Since assigning consecutive IDs to each detection in every sequence is a very labor-intensive task, current multi-object tracking dataset is not sufficient enough to train re-identification network. So in this paper, we propose a novel self-supervised learning method using a lot of short videos which has no human labeling, and improve the tracking performance through the re-identification network trained in the self-supervised manner to solve the lack of training data problem. Despite the re-identification network is trained in a self-supervised manner, it achieves the state-of-the-art performance of MOTA 62.0\% and IDF1 62.6\% on the MOT17 test benchmark. Furthermore, the performance is improved as much as learned with a large amount of data, it shows the potential of self-supervised method.