论文标题
使用自由能原理在体现系统上的感觉运动视觉感知
Sensorimotor Visual Perception on Embodied System Using Free Energy Principle
论文作者
论文摘要
我们提出了一个基于自由能原理(FEP)的体现系统,以进行感觉运动视觉感知。我们在使用MNIST数据集的角色识别任务中对其进行了评估。尽管FEP成功地描述了一条规则,即生存事物是数学上遵守的,并声称生物系统继续改变其内部模型和行为,以最大程度地减少预测感觉输入的差异,但它不足以模拟感觉运动视觉感知。系统的实施例是实现感觉运动视觉感知的关键。提出的体现系统由身体和内存配置。人体具有控制眼睛凝视方向的眼部运动系统,这意味着眼睛只能观察到环境中的一个小针对性区域。内存不是摄影的,而是一种生成模型,该模型具有各种自动编码器,其中包含有关环境的先验知识,并且该知识已被分类。通过限制身体和记忆能力并根据FEP运行,体现的系统反复采取行动以基于未来感觉输入的各种潜力获得下一个感觉输入。在评估中,环境的推论表示为字符的后验分布(0-9)。随着重复的数量增加,注意区域不断移动,逐渐减少了字符的不确定性。最后,正确字符的概率成为角色中最高的。改变初始注意力位置提供了不同的最终分布,表明拟议的系统具有确认偏差。
We propose an embodied system based on the free energy principle (FEP) for sensorimotor visual perception. We evaluated it in a character-recognition task using the MNIST dataset. Although the FEP has successfully described a rule that living things obey mathematically and claims that a biological system continues to change its internal models and behaviors to minimize the difference in predicting sensory input, it is not enough to model sensorimotor visual perception. An embodiment of the system is the key to achieving sensorimotor visual perception. The proposed embodied system is configured by a body and memory. The body has an ocular motor system controlling the direction of eye gaze, which means that the eye can only observe a small focused area of the environment. The memory is not photographic, but is a generative model implemented with a variational autoencoder that contains prior knowledge about the environment, and that knowledge is classified. By limiting body and memory abilities and operating according to the FEP, the embodied system repeatedly takes action to obtain the next sensory input based on various potentials of future sensory inputs. In the evaluation, the inference of the environment was represented as an approximate posterior distribution of characters (0 - 9). As the number of repetitions increased, the attention area moved continuously, gradually reducing the uncertainty of characters. Finally, the probability of the correct character became the highest among the characters. Changing the initial attention position provides a different final distribution, suggesting that the proposed system has a confirmation bias.