论文标题
带有带注释的太阳能光伏阵列和安装元数据的航空图像的众包数据集
A crowdsourced dataset of aerial images with annotated solar photovoltaic arrays and installation metadata
论文作者
论文摘要
光伏(PV)能量产生在能量转变中起着至关重要的作用。小规模的PV安装以空前的速度部署,并且它们在电网中的集成可能具有挑战性,因为公共当局通常缺乏有关它们的质量数据。越来越多的机器学习模型能够自动映射这些安装,越来越多地用于提高住宅PV安装的知识。但是,由于图像采集的差异,这些模型不能轻易地从一个区域或数据源转移到另一个区域。为了解决此问题,称为域移动并促进了PV阵列映射管道的开发,我们提出了一个包含空中图像,注释和分割掩码的数据集。我们为28,000多个安装提供安装元数据。我们为13,000个装置提供地面真理细分面具,其中包括7,000个带有两个不同图像提供商的注释。最后,我们提供了与8,000多个安装的注释相匹配的安装元数据。数据集应用程序包括端到端的PV注册表构建,强大的PV安装映射以及众包数据集的分析。
Photovoltaic (PV) energy generation plays a crucial role in the energy transition. Small-scale PV installations are deployed at an unprecedented pace, and their integration into the grid can be challenging since public authorities often lack quality data about them. Overhead imagery is increasingly used to improve the knowledge of residential PV installations with machine learning models capable of automatically mapping these installations. However, these models cannot be easily transferred from one region or data source to another due to differences in image acquisition. To address this issue known as domain shift and foster the development of PV array mapping pipelines, we propose a dataset containing aerial images, annotations, and segmentation masks. We provide installation metadata for more than 28,000 installations. We provide ground truth segmentation masks for 13,000 installations, including 7,000 with annotations for two different image providers. Finally, we provide installation metadata that matches the annotation for more than 8,000 installations. Dataset applications include end-to-end PV registry construction, robust PV installations mapping, and analysis of crowdsourced datasets.