论文标题

机器人学习理论通过自我观察:利用意图信号协同作用

Robot Learning Theory of Mind through Self-Observation: Exploiting the Intentions-Beliefs Synergy

论文作者

Bianco, Francesca, Ognibene, Dimitri

论文摘要

在复杂的环境中,人类感觉系统达到了极限,我们的行为是由我们对周围世界状况的信念的强烈驱动。因此,一般来说,掌握他人的信念,意图或精神状态,可以在自然背景下进行更有效的社交互动。然而,这些变量无法直接观察到。心理理论(汤姆),将归因于其他代理人的信念,意图或精神状态的能力, 是人类社会互动的关键特征,并已成为机器人社区的关注。最近,已经引入了能够学习汤姆的新模型。在本文中,我们展示了学习预测低级心理状态(例如意图和目标)之间的协同作用,以及归因于信念等高级精神状态。假设可以通过观察自己的决策和信念估计过程,并使用简单的馈送深度学习模型来观察自己的决策和信念估计过程,那么我们表明,如果学习信念归因于行动和意图预测,就可以获取更快,更准确的预测。我们表明,学习成绩即使观察具有不同决策过程的代理,并且在观察信念驱动的行为块时也更高。我们建议我们的建筑方法与未来的自适应社会机器人的设计有关,这些机器人应该能够自主理解和帮助人类合作伙伴在新颖的自然环境和任务中。

In complex environments, where the human sensory system reaches its limits, our behaviour is strongly driven by our beliefs about the state of the world around us. Accessing others' beliefs, intentions, or mental states in general, could thus allow for more effective social interactions in natural contexts. Yet these variables are not directly observable. Theory of Mind (TOM), the ability to attribute to other agents' beliefs, intentions, or mental states in general, is a crucial feature of human social interaction and has become of interest to the robotics community. Recently, new models that are able to learn TOM have been introduced. In this paper, we show the synergy between learning to predict low-level mental states, such as intentions and goals, and attributing high-level ones, such as beliefs. Assuming that learning of beliefs can take place by observing own decision and beliefs estimation processes in partially observable environments and using a simple feed-forward deep learning model, we show that when learning to predict others' intentions and actions, faster and more accurate predictions can be acquired if beliefs attribution is learnt simultaneously with action and intentions prediction. We show that the learning performance improves even when observing agents with a different decision process and is higher when observing beliefs-driven chunks of behaviour. We propose that our architectural approach can be relevant for the design of future adaptive social robots that should be able to autonomously understand and assist human partners in novel natural environments and tasks.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源