论文标题
谷物空间:一个大规模数据集,用于谷物谷物的细粒度和域自适应识别
GrainSpace: A Large-scale Dataset for Fine-grained and Domain-adaptive Recognition of Cereal Grains
论文作者
论文摘要
谷物是人类饮食的重要组成部分,是人们生计和国际贸易的重要商品。谷物外观检查(GAI)是确定谷物质量和谷物分层以进行适当循环,储存和食物加工等的关键步骤之一。 Automated Gai的好处是极大地协助检查员完成工作,但由于缺乏数据集和对任务的明确定义,因此受到限制。 在本文中,我们将GAI作为三个无处不在的计算机视觉任务:细粒度的识别,域的适应性和分布外识别。我们提出了一个称为Grainspace的大规模且公开的谷物数据集。具体而言,我们构建了三种类型的设备原型用于数据获取,总共有525万张由专业检查员确定的图像。从五个国家和30多个地区收集了包括小麦,玉米和大米在内的谷物样品。我们还基于半监督学习和自学学习技巧的全面基准。据我们所知,谷物空间是第一个公开发布的用于谷物谷物检查的数据集。
Cereal grains are a vital part of human diets and are important commodities for people's livelihood and international trade. Grain Appearance Inspection (GAI) serves as one of the crucial steps for the determination of grain quality and grain stratification for proper circulation, storage and food processing, etc. GAI is routinely performed manually by qualified inspectors with the aid of some hand tools. Automated GAI has the benefit of greatly assisting inspectors with their jobs but has been limited due to the lack of datasets and clear definitions of the tasks. In this paper we formulate GAI as three ubiquitous computer vision tasks: fine-grained recognition, domain adaptation and out-of-distribution recognition. We present a large-scale and publicly available cereal grains dataset called GrainSpace. Specifically, we construct three types of device prototypes for data acquisition, and a total of 5.25 million images determined by professional inspectors. The grain samples including wheat, maize and rice are collected from five countries and more than 30 regions. We also develop a comprehensive benchmark based on semi-supervised learning and self-supervised learning techniques. To the best of our knowledge, GrainSpace is the first publicly released dataset for cereal grain inspection.