Swinfir：通过快速的傅立叶卷积和改进的图像超分辨率训练，重新访问Swinir

论文标题

Swinfir：通过快速的傅立叶卷积和改进的图像超分辨率训练，重新访问Swinir

SwinFIR: Revisiting the SwinIR with Fast Fourier Convolution and Improved Training for Image Super-Resolution

论文作者

Zhang, Dafeng, Huang, Feiyu, Liu, Shizhuo, Wang, Xiaobing, Jin, Zhezhu

论文摘要

基于变压器的方法与基于CNN的方法相比，由于其对长期依赖性的模型，因此获得了令人印象深刻的图像恢复性能。但是，像Swinir这样的进步采用了基于窗口的和本地关注策略来平衡性能和计算开销，这限制了采用大型接收领域来捕获全球信息并在早期层中建立长期依赖性。为了进一步提高捕获全球信息的效率，在这项工作中，我们建议Swinfir通过替换具有具有整个图像范围接收场的快速傅立叶卷积（FFC）组件来扩展Swinir。我们还重新访问其他先进技术，即数据增强，预训练和功能集合，以改善图像重建的效果。而我们的功能集合方法使模型的性能可以大大增强，而无需增加训练和测试时间。与现有方法相比，我们将算法应用于多个流行的大规模基准，并实现了最先进的性能。例如，我们的Swinfir在漫画109数据集上达到了32.83 dB的PSNR，该PSNR比最先进的Swinir方法高0.8 dB。

Transformer-based methods have achieved impressive image restoration performance due to their capacities to model long-range dependency compared to CNN-based methods. However, advances like SwinIR adopts the window-based and local attention strategy to balance the performance and computational overhead, which restricts employing large receptive fields to capture global information and establish long dependencies in the early layers. To further improve the efficiency of capturing global information, in this work, we propose SwinFIR to extend SwinIR by replacing Fast Fourier Convolution (FFC) components, which have the image-wide receptive field. We also revisit other advanced techniques, i.e, data augmentation, pre-training, and feature ensemble to improve the effect of image reconstruction. And our feature ensemble method enables the performance of the model to be considerably enhanced without increasing the training and testing time. We applied our algorithm on multiple popular large-scale benchmarks and achieved state-of-the-art performance comparing to the existing methods. For example, our SwinFIR achieves the PSNR of 32.83 dB on Manga109 dataset, which is 0.8 dB higher than the state-of-the-art SwinIR method.

下载PDF全文

下载文献需遵守相关版权规定

论文标题