论文标题

软误差影响神经网络的统计建模

Statistical Modeling of Soft Error Influence on Neural Networks

论文作者

Huang, Haitong, Xue, Xinghua, Liu, Cheng, Wang, Ying, Luo, Tao, Cheng, Long, Li, Huawei, Li, Xiaowei

论文摘要

大型VLSI电路中的软误差对计算和记忆密集型神经网络(NN)处理产生了巨大影响。了解软错误对NNS的影响对于防止可靠的NN处理的软错误至关重要。先前的工作主要依靠故障模拟来分析软误差对NN处理的影响。它们是准确的,但通常特定于有限的错误和NN模型配置,这是由于较慢的仿真速度,尤其是对于大型NN模型和数据集而导致的。通过观察到软误差的影响会在许多神经元中传播并累积,我们建议根据中心限制定理,通过正态分布模型来表征每个神经元对每个神经元的数据扰动,并开发一系列统计模型,以分析一般软误差下NN模型的行为。统计模型不仅揭示了软误差与NN模型精度之间的相关性,还揭示了NN参数(例如量化和体系结构)如何影响NNS的可靠性。将所提出的模型与故障模拟进行比较,并全面验证。此外,我们观察到,表征软误差影响的统计模型也可以用于预测许多情况下的故障模拟结果,并且我们探索了提出的统计模型的使用来加速NNS的故障模拟。根据我们的实验,加速故障模拟显示了几乎两个数量级的速度,对基线故障模拟的模拟精度损失微不足道。

Soft errors in large VLSI circuits pose dramatic influence on computing- and memory-intensive neural network (NN) processing. Understanding the influence of soft errors on NNs is critical to protect against soft errors for reliable NN processing. Prior work mainly rely on fault simulation to analyze the influence of soft errors on NN processing. They are accurate but usually specific to limited configurations of errors and NN models due to the prohibitively slow simulation speed especially for large NN models and datasets. With the observation that the influence of soft errors propagates across a large number of neurons and accumulates as well, we propose to characterize the soft error induced data disturbance on each neuron with normal distribution model according to central limit theorem and develop a series of statistical models to analyze the behavior of NN models under soft errors in general. The statistical models reveal not only the correlation between soft errors and NN model accuracy, but also how NN parameters such as quantization and architecture affect the reliability of NNs. The proposed models are compared with fault simulation and verified comprehensively. In addition, we observe that the statistical models that characterize the soft error influence can also be utilized to predict fault simulation results in many cases and we explore the use of the proposed statistical models to accelerate fault simulations of NNs. According to our experiments, the accelerated fault simulation shows almost two orders of magnitude speedup with negligible simulation accuracy loss over the baseline fault simulations.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源