论文标题
如何利用学习图像压缩到常规编解码器的可传递性
How to Exploit the Transferability of Learned Image Compression to Conventional Codecs
论文作者
论文摘要
损耗的图像压缩通常受到所选损失度量的简单性的限制。最近的研究表明,生成的对抗网络具有克服这一局限性并成为多模式损失的能力,尤其是对于纹理而言。与学习的图像压缩一起,这两种技术可以在放松常用的严格失真措施时可以极大地效果。但是,基于卷积神经网络的算法具有较大的计算足迹。理想情况下,现有的常规编解码器应留在原地,这将确保更快地采用并遵守平衡的计算信封。 作为实现这一目标的可能途径,在这项工作中,我们提出并调查了如何将学习的图像编码用作替代物来优化编码图像。图像通过学习的过滤器更改,以优化针对不同的性能度量或特定任务。通过生成的对抗网络扩展了这个想法,我们展示了整个纹理如何被编码较低但保留细节感的成本较低的纹理所取代。 我们的方法可以重塑常规编解码器,以调整MS-SSSIM失真,并提高率超过20%,而无需任何解码开销。在任务感知的图像压缩上,我们对类似但特定于编解码的方法的表现良好。
Lossy image compression is often limited by the simplicity of the chosen loss measure. Recent research suggests that generative adversarial networks have the ability to overcome this limitation and serve as a multi-modal loss, especially for textures. Together with learned image compression, these two techniques can be used to great effect when relaxing the commonly employed tight measures of distortion. However, convolutional neural network based algorithms have a large computational footprint. Ideally, an existing conventional codec should stay in place, which would ensure faster adoption and adhering to a balanced computational envelope. As a possible avenue to this goal, in this work, we propose and investigate how learned image coding can be used as a surrogate to optimize an image for encoding. The image is altered by a learned filter to optimise for a different performance measure or a particular task. Extending this idea with a generative adversarial network, we show how entire textures are replaced by ones that are less costly to encode but preserve sense of detail. Our approach can remodel a conventional codec to adjust for the MS-SSIM distortion with over 20% rate improvement without any decoding overhead. On task-aware image compression, we perform favourably against a similar but codec-specific approach.