Sofgan：具有动态样式的肖像图像生成器

论文标题

Sofgan：具有动态样式的肖像图像生成器

SofGAN: A Portrait Image Generator with Dynamic Styling

论文作者

Chen, Anpei, Liu, Ruiyang, Xie, Ling, Chen, Zhang, Su, Hao, Yu, Jingyi

论文摘要

最近，生成的对抗网络（GAN）}已被广泛用于肖像图像生成。但是，在甘恩学到的潜在空间中，通常纠缠着不同的属性，例如姿势，形状和纹理样式，使得对特定属性的明确控制变得困难。为了解决这个问题，我们提出了一个Sofgan图像生成器，将肖像的潜在空间分解为两个子空间：几何空间和一个纹理空间。从两个子空间采样的潜在代码分别馈送到两个网络分支，一个用于生成带有规范姿势的肖像的3D几何形状，另一个用于生成纹理。对齐的3D几何形状也带有语义部分分割，编码为语义占用场（SOF）。 SOF允许在任意视图上渲染一致的2D语义分割图，然后将其与生成的纹理图融合，并使用我们的语义实例（SIW）模块进行风格化为肖像照片。通过广泛的实验，我们表明我们的系统可以生成具有独立控制的几何形状和纹理属性的高质量肖像图像。该方法还可以很好地概括在各种应用程序中，例如外观一致的面部动画和动态样式。

Recently, Generative Adversarial Networks (GANs)} have been widely used for portrait image generation. However, in the latent space learned by GANs, different attributes, such as pose, shape, and texture style, are generally entangled, making the explicit control of specific attributes difficult. To address this issue, we propose a SofGAN image generator to decouple the latent space of portraits into two subspaces: a geometry space and a texture space. The latent codes sampled from the two subspaces are fed to two network branches separately, one to generate the 3D geometry of portraits with canonical pose, and the other to generate textures. The aligned 3D geometries also come with semantic part segmentation, encoded as a semantic occupancy field (SOF). The SOF allows the rendering of consistent 2D semantic segmentation maps at arbitrary views, which are then fused with the generated texture maps and stylized to a portrait photo using our semantic instance-wise (SIW) module. Through extensive experiments, we show that our system can generate high quality portrait images with independently controllable geometry and texture attributes. The method also generalizes well in various applications such as appearance-consistent facial animation and dynamic styling.

下载PDF全文

下载文献需遵守相关版权规定

论文标题