论文标题
Strokegan+:用中风编码的几个射击半监督中国字体一代
StrokeGAN+: Few-Shot Semi-Supervised Chinese Font Generation with Stroke Encoding
论文作者
论文摘要
中国字体的一代具有广泛的应用。当前主要的方法主要基于深层生成模型,尤其是生成对抗网络(GAN)。但是,现有的基于GAN的模型通常会遇到众所周知的模式崩溃问题。当发生模式崩溃时,基于GAN的模型将无法产生正确的字体。为了解决这个问题,我们介绍了一个单位中风编码和一些半监督的方案(即,以半监督信息为半监督信息)分别探索了汉字的本地和全球结构信息,这些信息是由直觉和角色直接体现了某些本地和全球汉语模式的汉字和全球结构信息。基于这些思想,本文提出了一个称为\ textit {Strokegan+}的有效模型,该模型将stroke编码和几个弹药半监督方案结合到了Cyclegan模型中。提出的模型的有效性通过实验量证明。实验结果表明,通过引入的一位中风编码和少数射击的半监督训练方案可以有效地减轻模式崩溃问题,并且所提出的模型以四个重要的评估元素和生成字符的质量而言,提出的模型优于14个字体生成任务中最先进的模型。除了自行车之外,我们还表明,提出的想法可以适应其他现有模型以提高其性能。本文还评估了提出的模型对中国传统字体产生的拟议模型的有效性。
The generation of Chinese fonts has a wide range of applications. The currently predominated methods are mainly based on deep generative models, especially the generative adversarial networks (GANs). However, existing GAN-based models usually suffer from the well-known mode collapse problem. When mode collapse happens, the kind of GAN-based models will be failure to yield the correct fonts. To address this issue, we introduce a one-bit stroke encoding and a few-shot semi-supervised scheme (i.e., using a few paired data as semi-supervised information) to explore the local and global structure information of Chinese characters respectively, motivated by the intuition that strokes and characters directly embody certain local and global modes of Chinese characters. Based on these ideas, this paper proposes an effective model called \textit{StrokeGAN+}, which incorporates the stroke encoding and the few-shot semi-supervised scheme into the CycleGAN model. The effectiveness of the proposed model is demonstrated by amounts of experiments. Experimental results show that the mode collapse issue can be effectively alleviated by the introduced one-bit stroke encoding and few-shot semi-supervised training scheme, and that the proposed model outperforms the state-of-the-art models in fourteen font generation tasks in terms of four important evaluation metrics and the quality of generated characters. Besides CycleGAN, we also show that the proposed idea can be adapted to other existing models to improve their performance. The effectiveness of the proposed model for the zero-shot traditional Chinese font generation is also evaluated in this paper.