论文标题
开放式识别的混合模型
Hybrid Models for Open Set Recognition
论文作者
论文摘要
开放式识别需要分类器来检测不属于其培训集中任何类的样本。现有方法符合训练样本的概率分布在其嵌入空间上,并根据此分布检测异常值。嵌入空间通常是从歧视性分类器中获得的。但是,这种歧视性表示仅着眼于已知类别,这对于区分未知类别并不重要。我们认为,应从嵌入式分类器和密度估计器(用作离群值检测器)中共同学习表示空间。我们提出了由编码器组成的开hybrid框架,该框架将输入数据编码到关节嵌入空间中,将样品分类为Inlier类别,以及基于流的密度估计器,以检测样品是否属于未知类别。现有基于流的模型的一个典型问题是,它们可能会为异常值分配更高的可能性。但是,我们从经验上观察到,在学习歧视性和生成性成分的联合表示时,我们的实验中不会发生这样的问题。标准开放设置基准测试的实验还表明,端到端训练的OpenHybrid模型显着胜过最先进的方法和基于流动的基线。
Open set recognition requires a classifier to detect samples not belonging to any of the classes in its training set. Existing methods fit a probability distribution to the training samples on their embedding space and detect outliers according to this distribution. The embedding space is often obtained from a discriminative classifier. However, such discriminative representation focuses only on known classes, which may not be critical for distinguishing the unknown classes. We argue that the representation space should be jointly learned from the inlier classifier and the density estimator (served as an outlier detector). We propose the OpenHybrid framework, which is composed of an encoder to encode the input data into a joint embedding space, a classifier to classify samples to inlier classes, and a flow-based density estimator to detect whether a sample belongs to the unknown category. A typical problem of existing flow-based models is that they may assign a higher likelihood to outliers. However, we empirically observe that such an issue does not occur in our experiments when learning a joint representation for discriminative and generative components. Experiments on standard open set benchmarks also reveal that an end-to-end trained OpenHybrid model significantly outperforms state-of-the-art methods and flow-based baselines.