图像和视频域的新面部交换方法：技术报告

论文标题

图像和视频域的新面部交换方法：技术报告

A new face swap method for image and video domains: a technical report

论文作者

Chesakov, Daniil, Maltseva, Anastasia, Groshev, Alexander, Kuznetsov, Andrey, Dimitrov, Denis

论文摘要

在过去的几年中，深层假技术成为了一个热门研究领域。研究人员调查了复杂的生成对抗网络（GAN），自动编码器以及其他方法来建立面部交换的精确且健壮的算法。实现的结果表明，在生成的数据的视觉质量方面，深度假的无监督合成任务存在问题。当专家分析时，这些问题通常会导致较高的虚假检测准确性。第一个问题是现有的图像到图像方法不考虑视频域的特异性和逐帧处理会导致面临抖动和其他清晰可见的扭曲。另一个问题是生成的数据分辨率，由于较高的计算复杂性，许多现有方法较低。当源面的比例较大时（例如较大的脸颊）时，出现第三个问题，在更换后，它在面部边框上变得可见。我们的主要目标是开发一种可以解决这些问题并在许多线索指标上胜过现有解决方案的方法。我们引入了基于面部架构的新面部交换管道，并解决了上述问题。随着新的眼睛损失功能，超分辨率块和基于高斯的面罩的产生，可以改善质量，这在评估过程中得到了证实。

Deep fake technology became a hot field of research in the last few years. Researchers investigate sophisticated Generative Adversarial Networks (GAN), autoencoders, and other approaches to establish precise and robust algorithms for face swapping. Achieved results show that the deep fake unsupervised synthesis task has problems in terms of the visual quality of generated data. These problems usually lead to high fake detection accuracy when an expert analyzes them. The first problem is that existing image-to-image approaches do not consider video domain specificity and frame-by-frame processing leads to face jittering and other clearly visible distortions. Another problem is the generated data resolution, which is low for many existing methods due to high computational complexity. The third problem appears when the source face has larger proportions (like bigger cheeks), and after replacement it becomes visible on the face border. Our main goal was to develop such an approach that could solve these problems and outperform existing solutions on a number of clue metrics. We introduce a new face swap pipeline that is based on FaceShifter architecture and fixes the problems stated above. With a new eye loss function, super-resolution block, and Gaussian-based face mask generation leads to improvements in quality which is confirmed during evaluation.

下载PDF全文

下载文献需遵守相关版权规定

论文标题