论文标题
将人类对音乐性的看法与预测编码模型中的预测有关
Relating Human Perception of Musicality to Prediction in a Predictive Coding Model
论文作者
论文摘要
我们探讨了受到对人类音乐感知建模的预测编码启发的神经网络的使用。该网络是基于分层视觉皮层中复发相互作用的计算神经科学理论开发的。当使用自我监督的学习接受视频数据训练时,该模型表现出与人类视觉错觉一致的行为。在这里,我们适应了该网络以建模分层听觉系统,并研究它是否会与人类在一组随机音调序列的音乐性方面做出类似的选择。当模型接受大量器乐古典音乐和作为MEL频谱图呈现的流行旋律训练时,它对随机音调序列的预测错误显示出更大的预测错误,这些序列被人类受试者评级较少。我们发现预测错误取决于有关后续注意,音高间隔和时间上下文的信息量。我们的发现表明,可预测性与人类对音乐性的看法相关,并且在音乐中训练的预测编码神经网络可用于表征有助于人类对音乐感知的功能和图案。
We explore the use of a neural network inspired by predictive coding for modeling human music perception. This network was developed based on the computational neuroscience theory of recurrent interactions in the hierarchical visual cortex. When trained with video data using self-supervised learning, the model manifests behaviors consistent with human visual illusions. Here, we adapt this network to model the hierarchical auditory system and investigate whether it will make similar choices to humans regarding the musicality of a set of random pitch sequences. When the model is trained with a large corpus of instrumental classical music and popular melodies rendered as mel spectrograms, it exhibits greater prediction errors for random pitch sequences that are rated less musical by human subjects. We found that the prediction error depends on the amount of information regarding the subsequent note, the pitch interval, and the temporal context. Our findings suggest that predictability is correlated with human perception of musicality and that a predictive coding neural network trained on music can be used to characterize the features and motifs contributing to human perception of music.