论文标题

跨识别批量归一化

Cross-Iteration Batch Normalization

论文作者

Yao, Zhuliang, Cao, Yue, Zheng, Shuxin, Huang, Gao, Lin, Stephen

论文摘要

批处理标准化的一个众所周知的问题是,在小型小批量尺寸的情况下,其有效性大大降低了。当迷你批次包含很少的示例时,定义标准化的统计数据在训练迭代期间无法可靠地估计。为了解决这个问题,我们提出了交叉批准归一化(CBN),其中联合使用了多个近期迭代的示例来提高估计质量。在多次迭代中计算统计的一个挑战是,由于网络权重的变化,来自不同迭代的网络激活无法彼此相提并论。因此,我们通过基于泰勒多项式的提议的技术来补偿网络权重的变化,以便可以准确估算统计数据,并可以有效地应用批次归一化。在对象检测和图像分类的小型微型批量尺寸的情况下,发现CBN的表现优于原始批次归一化,并且直接计算了先前迭代的统计信息,而没有提出的补偿技术。代码可从https://github.com/howal/cross-iterationbatchnorm获得。

A well-known issue of Batch Normalization is its significantly reduced effectiveness in the case of small mini-batch sizes. When a mini-batch contains few examples, the statistics upon which the normalization is defined cannot be reliably estimated from it during a training iteration. To address this problem, we present Cross-Iteration Batch Normalization (CBN), in which examples from multiple recent iterations are jointly utilized to enhance estimation quality. A challenge of computing statistics over multiple iterations is that the network activations from different iterations are not comparable to each other due to changes in network weights. We thus compensate for the network weight changes via a proposed technique based on Taylor polynomials, so that the statistics can be accurately estimated and batch normalization can be effectively applied. On object detection and image classification with small mini-batch sizes, CBN is found to outperform the original batch normalization and a direct calculation of statistics over previous iterations without the proposed compensation technique. Code is available at https://github.com/Howal/Cross-iterationBatchNorm .

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源