论文标题

成长与合并:连续类别发现的统一框架

Grow and Merge: A Unified Framework for Continuous Categories Discovery

论文作者

Zhang, Xinwei, Jiang, Jianwen, Feng, Yutong, Wu, Zhi-Fan, Zhao, Xibin, Wan, Hai, Tang, Mingqian, Jin, Rong, Gao, Yue

论文摘要

尽管许多研究专门用于新的类别发现,但其中大多数都假定一个静态设置,其中标记和未标记的数据立即给出以寻找新类别。在这项工作中,我们专注于应用程序方案,其中未标记的数据被连续馈送到类别发现系统中。我们将其称为{\ bf连续类别发现}({\ bf ccd})问题,它比静态设置更具挑战性。新颖类别发现面临的一个常见挑战是,分类和类别发现需要不同的功能集:类别歧视性特征是分类的优选,而丰富而多样的功能则更适合新类别采矿。随着系统的要求,随着时间的推移,该系统被要求提供良好的性能,同时不断从未标记的数据中发现新类,因此该挑战变得更加严重。为了应对这一挑战,我们开发了一个{\ bf增长和合并}({\ bf gm})的框架,该框架是通过在增长阶段和合并阶段交替交替工作的:在增长阶段,它通过连续的自学学习阶段增加了特征的多样性,以确保与一个阶段相满足,以确保与一个阶段保持一致的阶段,以确保与一个既有的模型保持一致的效果。我们的广泛研究证实,所提出的GM框架比连续类别发现的最新方法要有效得多。

Although a number of studies are devoted to novel category discovery, most of them assume a static setting where both labeled and unlabeled data are given at once for finding new categories. In this work, we focus on the application scenarios where unlabeled data are continuously fed into the category discovery system. We refer to it as the {\bf Continuous Category Discovery} ({\bf CCD}) problem, which is significantly more challenging than the static setting. A common challenge faced by novel category discovery is that different sets of features are needed for classification and category discovery: class discriminative features are preferred for classification, while rich and diverse features are more suitable for new category mining. This challenge becomes more severe for dynamic setting as the system is asked to deliver good performance for known classes over time, and at the same time continuously discover new classes from unlabeled data. To address this challenge, we develop a framework of {\bf Grow and Merge} ({\bf GM}) that works by alternating between a growing phase and a merging phase: in the growing phase, it increases the diversity of features through a continuous self-supervised learning for effective category mining, and in the merging phase, it merges the grown model with a static one to ensure satisfying performance for known classes. Our extensive studies verify that the proposed GM framework is significantly more effective than the state-of-the-art approaches for continuous category discovery.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源