论文标题
学习调节随机权重:神经调节启发的神经网络,以进行有效的持续学习
Learning to Modulate Random Weights: Neuromodulation-inspired Neural Networks For Efficient Continual Learning
论文作者
论文摘要
现有的持续学习(CL)方法致力于通过利用正则化方法,重播缓冲区和特定于任务的组件来解决灾难性遗忘。但是,现实的CL解决方案不仅必须由灾难性遗忘的指标来塑造,而且还必须通过计算效率和运行时间来塑造。在这里,我们介绍了一种受生物神经系统中神经调节启发的新型神经网络架构,以经济有效地解决灾难性遗忘,并为解释学习的表示形式提供了新的途径。神经调节是一种生物学机制,在机器学习中受到了有限的关注。它可以动态控制和微调实时的突触动态,以跟踪不同行为环境的需求。受此启发的启发,我们提出的架构学习了每个任务上下文的一组相对较小的参数集,这些参数\ emph {neuroModululy}}不变的,随机的权重转换输入的活动。我们表明,尽管可学习的参数数量很少,但这种方法的每项任务具有强大的学习成绩。此外,由于上下文向量是如此紧凑,因此可以同时存储多个网络,没有干扰和几乎没有空间足迹,从而完全消除了灾难性的遗忘和加速训练过程。
Existing Continual Learning (CL) approaches have focused on addressing catastrophic forgetting by leveraging regularization methods, replay buffers, and task-specific components. However, realistic CL solutions must be shaped not only by metrics of catastrophic forgetting but also by computational efficiency and running time. Here, we introduce a novel neural network architecture inspired by neuromodulation in biological nervous systems to economically and efficiently address catastrophic forgetting and provide new avenues for interpreting learned representations. Neuromodulation is a biological mechanism that has received limited attention in machine learning; it dynamically controls and fine tunes synaptic dynamics in real time to track the demands of different behavioral contexts. Inspired by this, our proposed architecture learns a relatively small set of parameters per task context that \emph{neuromodulates} the activity of unchanging, randomized weights that transform the input. We show that this approach has strong learning performance per task despite the very small number of learnable parameters. Furthermore, because context vectors are so compact, multiple networks can be stored concurrently with no interference and little spatial footprint, thus completely eliminating catastrophic forgetting and accelerating the training process.