论文标题

苏打水:用于施工深度学习的站点对象检测数据集

SODA: Site Object Detection dAtaset for Deep Learning in Construction

论文作者

Duan, Rui, Deng, Hui, Tian, Mao, Deng, Yichuan, Lin, Jiarui

论文摘要

基于计算机的深度学习对象检测算法已经开发出足够强大的功能,以支持识别各种对象的能力。尽管目前有用于对象检测的一般数据集,但仍缺乏用于建筑行业的大规模开源数据集,这限制了对象检测算法的发展,因为它们往往是备数据的。因此,本文开发了一个专门收集和注释的新的大规模图像数据集,称为站点对象检测数据集(SODA),其中包含15种由工人,材料,机器和布局分类的对象类。首先,在不同地点条件,天气条件和施工阶段的多个施工地点收集了20,000多张图像,这些图像涵盖了不同的角度和视角。经过仔细的筛选和处理后,获得了19,846张图像,包括286,201个对象,并根据预定义的类别对标签进行注释。统计分析表明,开发的数据集在多样性和数量方面是有利的。基于深度学习(Yolo V3/ Yolo V4)的两种广泛的对象检测算法的进一步评估也说明了数据集对典型构造方案的可行性,最大地图达到81.47%。通过这种方式,这项研究为建筑行业中基于深度学习的对象检测方法的开发提供了一个大规模的图像数据集,并为进一步评估该领域的相应算法建立了性能基准。

Computer vision-based deep learning object detection algorithms have been developed sufficiently powerful to support the ability to recognize various objects. Although there are currently general datasets for object detection, there is still a lack of large-scale, open-source dataset for the construction industry, which limits the developments of object detection algorithms as they tend to be data-hungry. Therefore, this paper develops a new large-scale image dataset specifically collected and annotated for the construction site, called Site Object Detection dAtaset (SODA), which contains 15 kinds of object classes categorized by workers, materials, machines, and layout. Firstly, more than 20,000 images were collected from multiple construction sites in different site conditions, weather conditions, and construction phases, which covered different angles and perspectives. After careful screening and processing, 19,846 images including 286,201 objects were then obtained and annotated with labels in accordance with predefined categories. Statistical analysis shows that the developed dataset is advantageous in terms of diversity and volume. Further evaluation with two widely-adopted object detection algorithms based on deep learning (YOLO v3/ YOLO v4) also illustrates the feasibility of the dataset for typical construction scenarios, achieving a maximum mAP of 81.47%. In this manner, this research contributes a large-scale image dataset for the development of deep learning-based object detection methods in the construction industry and sets up a performance benchmark for further evaluation of corresponding algorithms in this area.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源