半监督的人类姿势估计在艺术历史图像中

论文标题

半监督的人类姿势估计在艺术历史图像中

Semi-supervised Human Pose Estimation in Art-historical Images

论文作者

Springstein, Matthias, Schneider, Stefanie, Althaus, Christian, Ewerth, Ralph

论文摘要

自从17世纪以来，理论上就建立了作为非语言交流语言的手势。但是，它与视觉艺术的相关性仅偶尔表达。这可能主要是由于传统上必须手工处理的大量数据所致。但是，随着数字化的稳定进步，越来越多的历史文物被索引并提供给公众，从而需要自动检索具有相似身体星座或姿势的艺术历史图案。由于艺术领域因其风格差异而与现有的人类姿势估计的现有数据集有显着不同，因此这提出了新的挑战。在本文中，我们提出了一种新颖的方法来估计艺术史图像中人类姿势。与以前试图用预训练模型或通过样式转移弥合域间隙的工作相反，我们建议对对象和关键点检测进行半监督学习。此外，我们引入了一个新颖的特定领域的艺术数据集，其中包括人物的边界框和关键点注释。与使用预训练模型或样式转移的方法相比，我们的方法取得了明显更好的结果。

Gesture as language of non-verbal communication has been theoretically established since the 17th century. However, its relevance for the visual arts has been expressed only sporadically. This may be primarily due to the sheer overwhelming amount of data that traditionally had to be processed by hand. With the steady progress of digitization, though, a growing number of historical artifacts have been indexed and made available to the public, creating a need for automatic retrieval of art-historical motifs with similar body constellations or poses. Since the domain of art differs significantly from existing real-world data sets for human pose estimation due to its style variance, this presents new challenges. In this paper, we propose a novel approach to estimate human poses in art-historical images. In contrast to previous work that attempts to bridge the domain gap with pre-trained models or through style transfer, we suggest semi-supervised learning for both object and keypoint detection. Furthermore, we introduce a novel domain-specific art data set that includes both bounding box and keypoint annotations of human figures. Our approach achieves significantly better results than methods that use pre-trained models or style transfer.

下载PDF全文

下载文献需遵守相关版权规定

论文标题