论文标题
为数据有效语言获取的自我播放
Self-play for Data Efficient Language Acquisition
论文作者
论文摘要
在交流时,人们在对话角色上始终保持行为:人们理解他们说的话,并能够产生他们听到的单词。迄今为止,为语言任务开发的人工代理缺乏这种对称性,这意味着接受过语言的训练的代理人无法理解它,反之亦然。在这项工作中,我们利用了交流的对称性质,以提高学习代理中语言获取的效率和质量。具体而言,我们考虑代理必须学会以现有语言来理解和生成单词的设置,但是假设访问与该语言的“ Oracle”扬声器的互动非常有限。我们表明,使用自我戏剧作为直接监督的替代品,使代理商可以跨角色转移其知识(例如,作为听众的培训,但作为演讲者进行测试),并仅使用与甲骨文的几次互动来更好地推断地面真相词典。
When communicating, people behave consistently across conversational roles: People understand the words they say and are able to produce the words they hear. To date, artificial agents developed for language tasks have lacked such symmetry, meaning agents trained to produce language are unable to understand it and vice-versa. In this work, we exploit the symmetric nature of communication in order to improve both the efficiency and quality of language acquisition in learning agents. Specifically, we consider the setting in which an agent must learn to both understand and generate words in an existing language, but with the assumption that access to interaction with "oracle" speakers of the language is very limited. We show that using self-play as a substitute for direct supervision enables the agent to transfer its knowledge across roles (e.g. training as a listener but testing as a speaker) and make better inferences about the ground truth lexicon using only a handful of interactions with the oracle.