论文标题
Fathomnet:海洋探索与发现的水下图像培训数据库
FathomNet: An underwater image training database for ocean exploration and discovery
论文作者
论文摘要
每年从远程操作的车辆(ROV)和其他水下资产中收集数千个小时的海洋视频数据。但是,当前的手动分析方法阻碍了对ROV和大型生物多样性分析的实时算法的全面利用。 Fathomnet是一种新颖的基线图像训练集,优化以加速对水下图像的现代,智能和自动化分析的发展。我们的种子数据集包括一个经过熟悉的注释和不断维护的数据库,其中包含超过26,000个小时的录像带,680万个注释和4,349个学期。 Fathomnet通过提供水下概念的图像,本地化和类标签来利用此数据集,以便使机器学习算法开发。迄今为止,有80,000多个图像和106,000个本地化,包括233种不同的类别,包括中间和底栖生物。我们的实验包括培训各种深度学习算法的方法,该算法通过解决弱监督的本地化,图像标记,对象检测和分类的方法组成,这些方法被证明是有希望的。虽然我们找到了有关此新数据集预测的质量结果,但我们的结果表明,我们最终需要更大的数据集用于海洋探索。
Thousands of hours of marine video data are collected annually from remotely operated vehicles (ROVs) and other underwater assets. However, current manual methods of analysis impede the full utilization of collected data for real time algorithms for ROV and large biodiversity analyses. FathomNet is a novel baseline image training set, optimized to accelerate development of modern, intelligent, and automated analysis of underwater imagery. Our seed data set consists of an expertly annotated and continuously maintained database with more than 26,000 hours of videotape, 6.8 million annotations, and 4,349 terms in the knowledge base. FathomNet leverages this data set by providing imagery, localizations, and class labels of underwater concepts in order to enable machine learning algorithm development. To date, there are more than 80,000 images and 106,000 localizations for 233 different classes, including midwater and benthic organisms. Our experiments consisted of training various deep learning algorithms with approaches to address weakly supervised localization, image labeling, object detection and classification which prove to be promising. While we find quality results on prediction for this new dataset, our results indicate that we are ultimately in need of a larger data set for ocean exploration.