论文标题
基于群集的对比度解开,用于广义零射门学习
Cluster-based Contrastive Disentangling for Generalized Zero-Shot Learning
论文作者
论文摘要
广义的零射击学习(GZSL)旨在仅训练看到的班级,旨在识别所见类别和看不见的阶级,在这种阶级中,看不见的阶级往往会偏向于所见类别。在本文中,我们提出了一种基于群集的对比度解开(CCD)方法,以减轻语义差距和域转移问题来改善GZSL。具体来说,我们首先将批处数据聚集,以形成几个包含相似类别的集合。然后,我们将视觉特征分解为语义不合格和语义匹配的变量,并根据群集结果进一步将语义匹配的变量分为类共享和类唯一的变量。带有随机交换和语义 - 视觉对齐的分离学习模块桥接语义差距。此外,我们在语义匹配和类唯一的变量上介绍了对比度学习,以学习高度的内部和阶层内相似性,以及类间和类间的可区分性。然后,生成的视觉特征符合一般图像的基本特征,并具有强大的歧视性信息,从而可以很好地减轻域移动问题。我们在四个数据集上评估了我们提出的方法,并在常规和广义设置中获得最先进的结果。
Generalized Zero-Shot Learning (GZSL) aims to recognize both seen and unseen classes by training only the seen classes, in which the instances of unseen classes tend to be biased towards the seen class. In this paper, we propose a Cluster-based Contrastive Disentangling (CCD) method to improve GZSL by alleviating the semantic gap and domain shift problems. Specifically, we first cluster the batch data to form several sets containing similar classes. Then, we disentangle the visual features into semantic-unspecific and semantic-matched variables, and further disentangle the semantic-matched variables into class-shared and class-unique variables according to the clustering results. The disentangled learning module with random swapping and semantic-visual alignment bridges the semantic gap. Moreover, we introduce contrastive learning on semantic-matched and class-unique variables to learn high intra-set and intra-class similarity, as well as inter-set and inter-class discriminability. Then, the generated visual features conform to the underlying characteristics of general images and have strong discriminative information, which alleviates the domain shift problem well. We evaluate our proposed method on four datasets and achieve state-of-the-art results in both conventional and generalized settings.