端到端实例边缘检测

论文标题

端到端实例边缘检测

End-to-End Instance Edge Detection

论文作者

Zou, Xueyan, Liu, Haotian, Lee, Yong Jae

论文摘要

长期以来，边缘检测一直是计算机视觉领域的重要问题。先前的工作探索了类别不合时宜的或类别感知的边缘检测。在本文中，我们在对象实例的上下文中探索边缘检测。尽管对象边界可以轻松地从分割掩码中得出，但实际上，实例分割模型经过训练以最大化地面真相掩码，这意味着分割边界不被强制执行以与地面环境边界确切保持一致。因此，实例边缘检测本身的任务是不同且至关重要的。由于精确的边缘检测需要高分辨率的特征图，因此我们设计了一种新型的变压器体系结构，该架构有效地结合了FPN和变压器解码器，以在合理的计算预算中互相关注多尺度高分辨率特征图。此外，我们提出了一个适用于实例边缘和掩模检测的轻质密集预测头。最后，我们使用惩罚减少的焦点损失来通过实例边缘有效地训练模型，这可以降低注释成本。与最新的基线相比，我们证明了高度竞争的实例边缘检测性能，还表明所提出的任务和损失与实例分割和对象检测是互补的。

Edge detection has long been an important problem in the field of computer vision. Previous works have explored category-agnostic or category-aware edge detection. In this paper, we explore edge detection in the context of object instances. Although object boundaries could be easily derived from segmentation masks, in practice, instance segmentation models are trained to maximize IoU to the ground-truth mask, which means that segmentation boundaries are not enforced to precisely align with ground-truth edge boundaries. Thus, the task of instance edge detection itself is different and critical. Since precise edge detection requires high resolution feature maps, we design a novel transformer architecture that efficiently combines a FPN and a transformer decoder to enable cross attention on multi-scale high resolution feature maps within a reasonable computation budget. Further, we propose a light weight dense prediction head that is applicable to both instance edge and mask detection. Finally, we use a penalty reduced focal loss to effectively train the model with point supervision on instance edges, which can reduce annotation costs. We demonstrate highly competitive instance edge detection performance compared to state-of-the-art baselines, and also show that the proposed task and loss are complementary to instance segmentation and object detection.

下载PDF全文

下载文献需遵守相关版权规定

论文标题