论文标题
笔记本电脑合奏性能系统使用经常性神经网络
A Laptop Ensemble Performance System using Recurrent Neural Networks
论文作者
论文摘要
在音乐领域应用机器学习技术的普及已创造了可自由访问的预训练的神经网络(NN)模型的固有可用性,该模型准备在创意应用程序中使用。这项工作概述了一种辅助工具的形式,旨在实施一个辅助工具,该工具为笔记本电脑合奏的现场表演而设计。主要目的是利用现成的预训练的NN模型,作为帮助单个表演者作为音乐新手,希望与更有经验的表演者互动,或者作为通过新形式的创意表达来扩展音乐可能性的工具。该系统扩展了在不同研究领域发现的各种想法,包括音乐表达的新接口,生成音乐和群体性能,以生成通过Web浏览器界面提供的网络性能解决方案。该系统的最终实现为表演者提供了高级和低级控件的混合物,以实时影响局部运行的NN模型的音符序列,还允许表演者定义他们与辅助生成模型的参与水平。播放了两次测试表演,该系统显示出在四分钟的演出中可行地支持四个表演者,同时制作了音乐凝聚力和引人入胜的音乐。系统设计的迭代涉及在现场音乐上下文中使用JavaScript环境来使用JavaScript环境的技术约束,这主要来自不可避免的处理开销。
The popularity of applying machine learning techniques in musical domains has created an inherent availability of freely accessible pre-trained neural network (NN) models ready for use in creative applications. This work outlines the implementation of one such application in the form of an assistance tool designed for live improvisational performances by laptop ensembles. The primary intention was to leverage off-the-shelf pre-trained NN models as a basis for assisting individual performers either as musical novices looking to engage with more experienced performers or as a tool to expand musical possibilities through new forms of creative expression. The system expands upon a variety of ideas found in different research areas including new interfaces for musical expression, generative music and group performance to produce a networked performance solution served via a web-browser interface. The final implementation of the system offers performers a mixture of high and low-level controls to influence the shape of sequences of notes output by locally run NN models in real time, also allowing performers to define their level of engagement with the assisting generative models. Two test performances were played, with the system shown to feasibly support four performers over a four minute piece while producing musically cohesive and engaging music. Iterations on the design of the system exposed technical constraints on the use of a JavaScript environment for generative models in a live music context, largely derived from inescapable processing overheads.