关于稀疏连通性，对抗性鲁棒性和人造神经元的新型模型

论文标题

关于稀疏连通性，对抗性鲁棒性和人造神经元的新型模型

On sparse connectivity, adversarial robustness, and a novel model of the artificial neuron

论文作者

Bochkanov, Sergey

论文摘要

深度神经网络几乎在所有感知基准上都达到了人类水平的准确性。有趣的是，这些进步是使用了几十年历史的两个想法进行的：（a）基于线性夏季和（b）SGD培训的人造神经元。但是，超出准确性的重要指标：计算效率和稳定性针对对抗性扰动。在本文中，我们提出了两种紧密连接的方法，以改善这些指标在轮廓识别任务上：（a）一种人造神经元的新型模型，一种“强型神经元”，具有低硬件的要求和对对抗性扰动的固有性固有性，并且（b）具有新颖的构建训练algorithm与$ o（1）$ O（1）$ o（1）$ o（1）$ o（1）$ o（1）$ o（1）$ o（1）。我们通过对SVHN和GTSRB基准测试的实验证明了我们的方法的可行性。我们的操作计数降低了令人印象深刻的10倍100倍（与其他稀疏方法相比，与密集网络相比，100倍）和硬件需求（使用8位固定点数学）的大幅度降低，而模型准确性没有降低。仅依靠强神经元的稳健性，实现了针对对抗性扰动（超过对抗性训练的较高稳定性）。我们还证明，我们强神经元的组成块是唯一具有针对对抗攻击的完美稳定性的激活功能。

Deep neural networks have achieved human-level accuracy on almost all perceptual benchmarks. It is interesting that these advances were made using two ideas that are decades old: (a) an artificial neuron based on a linear summator and (b) SGD training. However, there are important metrics beyond accuracy: computational efficiency and stability against adversarial perturbations. In this paper, we propose two closely connected methods to improve these metrics on contour recognition tasks: (a) a novel model of an artificial neuron, a "strong neuron," with low hardware requirements and inherent robustness against adversarial perturbations and (b) a novel constructive training algorithm that generates sparse networks with $O(1)$ connections per neuron. We demonstrate the feasibility of our approach through experiments on SVHN and GTSRB benchmarks. We achieved an impressive 10x-100x reduction in operations count (10x when compared with other sparsification approaches, 100x when compared with dense networks) and a substantial reduction in hardware requirements (8-bit fixed-point math was used) with no reduction in model accuracy. Superior stability against adversarial perturbations (exceeding that of adversarial training) was achieved without any counteradversarial measures, relying on the robustness of strong neurons alone. We also proved that constituent blocks of our strong neuron are the only activation functions with perfect stability against adversarial attacks.

下载PDF全文

下载文献需遵守相关版权规定

论文标题