论文标题

神经任意风格的转移用于肖像图像的神经风格转移使用注意机制

Neural arbitrary style transfer for portrait images using the attention mechanism

论文作者

Berezin, S. A., Volkova, V. M.

论文摘要

任意样式转移是使用两个给定的图像:内容图像和样式图像合成以前从未见过的图像的任务。内容图像形成结构,基本的几何线和所得图像的形状,而样式图像则设置了结果的颜色和纹理。在这种情况下,“任意”一词意味着没有任何预测的样式。因此,例如,仅在训练或对新数据进行训练或重新训练后才能够转移新样式的卷积神经网络不会解决此类问题,而基于注意力机构的网络可以在不进行重新训练的情况下进行此类转换,是的。可以例如,原始图像可以是照片,风格的图像可以是著名艺术家的绘画。在这种情况下,由此产生的图像将是原始照片中描绘的场景,该照片是在这张照片的造型中制作的。最近的任意风格转移算法使得能够在此任务中实现良好的重新构成,但是,在处理人的肖像图像时,由于面部特征的过度失真,或者不具有样式图像的特征,因此这种算法的结果是无法接受的。在本文中,我们考虑了一种方法,使用深层神经网络的组合结构,并具有一种注意机制,具有基于特定图像段的内容来转移样式的方法:在IM-AGE的背景部分中,样式明显占主导地位,并且在图像部分中,图像部分直接构成了一个人的图像。

Arbitrary style transfer is the task of synthesis of an image that has never been seen before, using two given images: content image and style image. The content image forms the structure, the basic geometric lines and shapes of the resulting image, while the style image sets the color and texture of the result. The word "arbitrary" in this context means the absence of any one pre-learned style. So, for example, convolutional neural networks capable of transferring a new style only after training or retraining on a new amount of data are not con-sidered to solve such a problem, while networks based on the attention mech-anism that are capable of performing such a transformation without retraining - yes. An original image can be, for example, a photograph, and a style image can be a painting of a famous artist. The resulting image in this case will be the scene depicted in the original photograph, made in the stylie of this picture. Recent arbitrary style transfer algorithms make it possible to achieve good re-sults in this task, however, in processing portrait images of people, the result of such algorithms is either unacceptable due to excessive distortion of facial features, or weakly expressed, not bearing the characteristic features of a style image. In this paper, we consider an approach to solving this problem using the combined architecture of deep neural networks with a attention mechanism that transfers style based on the contents of a particular image segment: with a clear predominance of style over the form for the background part of the im-age, and with the prevalence of content over the form in the image part con-taining directly the image of a person.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源