论文标题
葡萄酒:小波引导的gan倒置和编辑高保真精致
WINE: Wavelet-Guided GAN Inversion and Editing for High-Fidelity Refinement
论文作者
论文摘要
最近的高级GAN反转模型旨在通过使用发电机调整或高维功能学习的方法传达从原始图像到发电机的高保真信息。尽管做出了这些努力,但由于训练和结构方面的固有局限性,准确地重建特定图像的细节仍然是一个挑战,从而导致对低频信息的偏见。在本文中,我们研究了GAN倒置中广泛使用的像素损失,揭示了其主要关注低频特征的重建。然后,我们提出了葡萄酒(一种小波引导的GAN倒置和编辑模型),通过小波系数通过新提出的小波损耗和小波融合方案传输高频信息。值得注意的是,葡萄酒是解释频域中GAN反转的首次尝试。我们的实验结果展示了葡萄酒在保留高频细节和增强图像质量方面的精度。即使在编辑方案中,葡萄酒的表现都优于现有的最先进的倒置模型,在编辑性和重建质量之间具有良好的平衡。
Recent advanced GAN inversion models aim to convey high-fidelity information from original images to generators through methods using generator tuning or high-dimensional feature learning. Despite these efforts, accurately reconstructing image-specific details remains as a challenge due to the inherent limitations both in terms of training and structural aspects, leading to a bias towards low-frequency information. In this paper, we look into the widely used pixel loss in GAN inversion, revealing its predominant focus on the reconstruction of low-frequency features. We then propose WINE, a Wavelet-guided GAN Inversion aNd Editing model, which transfers the high-frequency information through wavelet coefficients via newly proposed wavelet loss and wavelet fusion scheme. Notably, WINE is the first attempt to interpret GAN inversion in the frequency domain. Our experimental results showcase the precision of WINE in preserving high-frequency details and enhancing image quality. Even in editing scenarios, WINE outperforms existing state-of-the-art GAN inversion models with a fine balance between editability and reconstruction quality.