论文标题

言语的自然主义头部运动产生

Naturalistic Head Motion Generation from Speech

论文作者

Mittal, Trisha, Aldeneh, Zakaria, Fedzechkina, Masha, Ranjan, Anurag, Theobald, Barry-John

论文摘要

为了提供丰富的互动体验,必须将自然头运动综合以伴随体现的对话代理。大多数先前的作品通过使用客观度量进行比较,评估了产生的头部运动的质量。然而,有许多合理的头部运动序列伴随着语音发言。在这项工作中,我们研究了从生成模型中采样的头部运动的感知质量的变化。我们表明,尽管提供了更多多样化的头部动作,但生成模型仍会产生不同程度的感知质量的动作。我们最终表明,以前研究中常用的客观指标不能准确反映产生的头部运动的感知质量。这些结果为将来的工作开辟了一个有趣的途径,以调查与人类对质量认识相关的更好的客观指标。

Synthesizing natural head motion to accompany speech for an embodied conversational agent is necessary for providing a rich interactive experience. Most prior works assess the quality of generated head motion by comparing them against a single ground-truth using an objective metric. Yet there are many plausible head motion sequences to accompany a speech utterance. In this work, we study the variation in the perceptual quality of head motions sampled from a generative model. We show that, despite providing more diverse head motions, the generative model produces motions with varying degrees of perceptual quality. We finally show that objective metrics commonly used in previous research do not accurately reflect the perceptual quality of generated head motions. These results open an interesting avenue for future work to investigate better objective metrics that correlate with human perception of quality.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源