论文标题

基于OCSAFPN的对象检测在带噪声的航空图像中

Object Detection based on OcSaFPN in Aerial Images with Noise

论文作者

Li, Chengyuan, Liu, Jun, Hong, Hailong, Mao, Wenju, Wang, Chenjie, Hu, Chudi, Su, Xin, Luo, Bin

论文摘要

考虑基于深度学习的算法已成为提高空中图像中对象检测性能的关键方法。尽管已经开发了各种神经网络表示,但以前的工作仍然效率低下,无法研究降噪性能,尤其是在带有远摄镜头的摄像机拍摄的噪声的航空图像上,大多数研究集中在DeNoising领域。当然,Denoising通常需要额外的计算负担来获得更高质量的图像,而噪声弹性更多地描述了网络本身对不同噪声的鲁棒性,这是算法本身的属性。因此,这项工作将开始通过分析神经网络的噪声性能,然后提出两个假设以构建降噪结构。基于这些假设,我们比较了OCT-RESNET与频划分处理和常用的重新NET的噪声能力。此外,用于航空对象检测任务的先前特征金字塔网络不是专门为OCT-Resnet的频分部特征图设计的,并且通常不关注从不同深度的不同特征图之间弥合语义差距。在此基础上,提出了一种新型的基于八度卷积的语义注意特征金字塔网络(OCSAFPN),以便通过噪声在对象检测中获得更高的准确性。在三个数据集上测试的提出的算法表明,提出的OCSAFPN通过高斯噪声或乘法噪声实现了最新的检测性能。此外,更多的实验证明,可以轻松地将OCSAFPN结构添加到现有算法中,并且可以有效提高降噪能力。

Taking the deep learning-based algorithms into account has become a crucial way to boost object detection performance in aerial images. While various neural network representations have been developed, previous works are still inefficient to investigate the noise-resilient performance, especially on aerial images with noise taken by the cameras with telephoto lenses, and most of the research is concentrated in the field of denoising. Of course, denoising usually requires an additional computational burden to obtain higher quality images, while noise-resilient is more of a description of the robustness of the network itself to different noises, which is an attribute of the algorithm itself. For this reason, the work will be started by analyzing the noise-resilient performance of the neural network, and then propose two hypotheses to build a noise-resilient structure. Based on these hypotheses, we compare the noise-resilient ability of the Oct-ResNet with frequency division processing and the commonly used ResNet. In addition, previous feature pyramid networks used for aerial object detection tasks are not specifically designed for the frequency division feature maps of the Oct-ResNet, and they usually lack attention to bridging the semantic gap between diverse feature maps from different depths. On the basis of this, a novel octave convolution-based semantic attention feature pyramid network (OcSaFPN) is proposed to get higher accuracy in object detection with noise. The proposed algorithm tested on three datasets demonstrates that the proposed OcSaFPN achieves a state-of-the-art detection performance with Gaussian noise or multiplicative noise. In addition, more experiments have proved that the OcSaFPN structure can be easily added to existing algorithms, and the noise-resilient ability can be effectively improved.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源