论文标题
EXPNET:专家级分类的统一网络
ExpNet: A unified network for Expert-Level Classification
论文作者
论文摘要
与一般视觉分类不同,某些分类任务更具挑战性,因为它们需要图像的专业类别。在论文中,我们称它们为专家级分类。以前的细粒视力分类(FGVC)已在其某些特定子任务上做出了许多努力。但是,它们很难扩展到依赖于全球相关性和分层特征相互作用的全面分析的一般情况。在本文中,我们建议专家网络(EXPNET)通过统一网络解决专家级分类的独特挑战。在expnet中,我们从层次结构分离零件和上下文特征,并使用一种新颖的细心机制(称为凝视移动)分别处理它们。在每个阶段,视线转移都会为随后的抽象产生焦点零件特征,并记住与上下文相关的嵌入。然后,我们将最终焦点嵌入与所有记忆的上下文相关嵌入以进行预测融合。这样的体系结构实现了部分和全局信息以及分层特征交互的双轨处理。我们对三个代表性的专家级分类任务进行实验:FGVC,疾病分类和艺术品属性分类。在这些实验中,观察到我们的expnet的卓越性能与在各个领域的最新面积相比,表明我们的expnet的有效性和概括。该代码将公开可用。
Different from the general visual classification, some classification tasks are more challenging as they need the professional categories of the images. In the paper, we call them expert-level classification. Previous fine-grained vision classification (FGVC) has made many efforts on some of its specific sub-tasks. However, they are difficult to expand to the general cases which rely on the comprehensive analysis of part-global correlation and the hierarchical features interaction. In this paper, we propose Expert Network (ExpNet) to address the unique challenges of expert-level classification through a unified network. In ExpNet, we hierarchically decouple the part and context features and individually process them using a novel attentive mechanism, called Gaze-Shift. In each stage, Gaze-Shift produces a focal-part feature for the subsequent abstraction and memorizes a context-related embedding. Then we fuse the final focal embedding with all memorized context-related embedding to make the prediction. Such an architecture realizes the dual-track processing of partial and global information and hierarchical feature interactions. We conduct the experiments over three representative expert-level classification tasks: FGVC, disease classification, and artwork attributes classification. In these experiments, superior performance of our ExpNet is observed comparing to the state-of-the-arts in a wide range of fields, indicating the effectiveness and generalization of our ExpNet. The code will be made publicly available.