不平衡的稳健软效果用于深层学习

论文标题

不平衡的稳健软效果用于深层学习

Imbalance Robust Softmax for Deep Embeeding Learning

论文作者

Zhu, Hao, Yuan, Yang, Hu, Guosheng, Wu, Xiang, Robertson, Neil

论文摘要

深入嵌入学习有望学习一个度量空间，在该度量空间中，该特征的最大课内距离比最小的课堂间距离更小。近年来，一种研究重点是通过在面部识别（FR）和人重新识别（RE-ID）领域中歧视性深层学习来解决开放式问题。除了开放设定的问题外，我们发现培训数据不平衡是导致FR和RE-ID性能降低的另一个主要因素，并且在真实应用中广泛存在数据不平衡。但是，很少的研究探讨了数据不平衡如何通过SoftMax或其变体影响FR和RE-ID的性能。在这项工作中，我们从神经网络优化和有关软磁性的特征分布的角度深入研究数据不平衡。我们发现由数据不平衡引起的性能降解的一个主要原因是，权重（从倒数第二个完全连接的层）远非其特征空间中的类中心。基于这项调查，我们提出了一个统一的框架，不平衡的软马克斯（IR-Softmax），该框架可以同时解决开放集问题并减少数据不平衡的影响。 IR-SoftMax可以通过将权重作为班级中心直接设置为自然解决数据不平衡问题，从而将其推广到任何SoftMax及其变体（对于开放设定问题而言是歧视性）。在这项工作中，我们在IR-Softmax框架下明确重新构建了两个判别性软智能（A-Suffmax和Am-Softmax）。我们在FR数据库（LFW，Megaface）和Re-ID数据库（Market-1501，Duke）和IR-Softmax上进行了广泛的实验，并且优于许多最先进的方法。

Deep embedding learning is expected to learn a metric space in which features have smaller maximal intra-class distance than minimal inter-class distance. In recent years, one research focus is to solve the open-set problem by discriminative deep embedding learning in the field of face recognition (FR) and person re-identification (re-ID). Apart from open-set problem, we find that imbalanced training data is another main factor causing the performance degradation of FR and re-ID, and data imbalance widely exists in the real applications. However, very little research explores why and how data imbalance influences the performance of FR and re-ID with softmax or its variants. In this work, we deeply investigate data imbalance in the perspective of neural network optimisation and feature distribution about softmax. We find one main reason of performance degradation caused by data imbalance is that the weights (from the penultimate fully-connected layer) are far from their class centers in feature space. Based on this investigation, we propose a unified framework, Imbalance-Robust Softmax (IR-Softmax), which can simultaneously solve the open-set problem and reduce the influence of data imbalance. IR-Softmax can generalise to any softmax and its variants (which are discriminative for open-set problem) by directly setting the weights as their class centers, naturally solving the data imbalance problem. In this work, we explicitly re-formulate two discriminative softmax (A-Softmax and AM-Softmax) under the framework of IR-Softmax. We conduct extensive experiments on FR databases (LFW, MegaFace) and re-ID database (Market-1501, Duke), and IR-Softmax outperforms many state-of-the-art methods.

下载PDF全文

下载文献需遵守相关版权规定

论文标题