论文标题
遥感图像场景分类在有限的标签样品下,用自我监督的范式进行
Remote Sensing Image Scene Classification with Self-Supervised Paradigm under Limited Labeled Samples
论文作者
论文摘要
随着深度学习的发展,监督学习方法在遥感图像(RSIS)场景分类中表现良好。但是,监督学习需要大量的注释数据进行培训。当标记的样品不足时,最常见的解决方案是使用大型自然图像数据集(例如Imagenet)微调训练模型。但是,这种学习范式不是灵丹妙药,尤其是当目标遥感图像(例如多光谱和高光谱数据)具有与RGB自然图像不同的成像机制时。为了解决这个问题,我们引入了新的自我监督学习(SSL)机制,以从大型未标记数据中获得RSIS场景分类的高性能预训练模型。在三个常用的RSIS场景分类数据集上进行的实验表明,这种新的学习范式的表现优于传统的主要成像网预培训模型。此外,我们分析了SSL中几个因素对RSIS场景分类任务的影响,包括选择自我监管信号,源和目标数据集之间的域差异以及预训练数据的量。从我们的研究中提取的见解可以帮助促进遥感社区中SSL的发展。由于SSL可以从非常容易获得的未标记的大量RSI中学习,因此这将是减轻对标签样品的依赖并因此有效地解决许多问题的潜在有希望的方法,例如全球映射。
With the development of deep learning, supervised learning methods perform well in remote sensing images (RSIs) scene classification. However, supervised learning requires a huge number of annotated data for training. When labeled samples are not sufficient, the most common solution is to fine-tune the pre-training models using a large natural image dataset (e.g. ImageNet). However, this learning paradigm is not a panacea, especially when the target remote sensing images (e.g. multispectral and hyperspectral data) have different imaging mechanisms from RGB natural images. To solve this problem, we introduce new self-supervised learning (SSL) mechanism to obtain the high-performance pre-training model for RSIs scene classification from large unlabeled data. Experiments on three commonly used RSIs scene classification datasets demonstrated that this new learning paradigm outperforms the traditional dominant ImageNet pre-trained model. Moreover, we analyze the impacts of several factors in SSL on RSIs scene classification tasks, including the choice of self-supervised signals, the domain difference between the source and target dataset, and the amount of pre-training data. The insights distilled from our studies can help to foster the development of SSL in the remote sensing community. Since SSL could learn from unlabeled massive RSIs which are extremely easy to obtain, it will be a potentially promising way to alleviate dependence on labeled samples and thus efficiently solve many problems, such as global mapping.