论文标题
了解分布外检测的对比度学习的属性和局限性
Understanding the properties and limitations of contrastive learning for Out-of-Distribution detection
论文作者
论文摘要
最新流行的分发方法(OOD)检测方法是基于一种被称为对比度学习的自学学习技术。对比度学习有两个主要变体,即实例和阶级歧视,定位可以区分前者不同实例的特征,而后者的不同类别。 在本文中,我们旨在了解OOD检测的现有对比学习方法的有效性和局限性。我们以三种方式对此进行处理。首先,我们系统地研究了实例歧视与受监督的对比度学习变体之间的性能差异。其次,我们研究哪些分布(ID)类OOD数据倾向于将其分类为分类。最后,我们研究了不同对比度学习方法的光谱衰减特性,并研究了它与OOD检测性能的相关性。在ID和OOD数据集彼此之间完全不同的情况下,我们看到实例歧视在没有微调的情况下与OOD检测中的有监督方法具有竞争力。我们看到,OOD样品倾向于将其分类为具有类似于整个数据集的分布的类。此外,我们表明对比度学习学习了一个特征空间,其中包含具有较高方差的多个方向的奇异向量,根据所使用的推理方法,这可能有害或有益于OOD检测。
A recent popular approach to out-of-distribution (OOD) detection is based on a self-supervised learning technique referred to as contrastive learning. There are two main variants of contrastive learning, namely instance and class discrimination, targeting features that can discriminate between different instances for the former, and different classes for the latter. In this paper, we aim to understand the effectiveness and limitation of existing contrastive learning methods for OOD detection. We approach this in 3 ways. First, we systematically study the performance difference between the instance discrimination and supervised contrastive learning variants in different OOD detection settings. Second, we study which in-distribution (ID) classes OOD data tend to be classified into. Finally, we study the spectral decay property of the different contrastive learning approaches and examine how it correlates with OOD detection performance. In scenarios where the ID and OOD datasets are sufficiently different from one another, we see that instance discrimination, in the absence of fine-tuning, is competitive with supervised approaches in OOD detection. We see that OOD samples tend to be classified into classes that have a distribution similar to the distribution of the entire dataset. Furthermore, we show that contrastive learning learns a feature space that contains singular vectors containing several directions with a high variance which can be detrimental or beneficial to OOD detection depending on the inference approach used.