论文标题

深神经网络中贝叶斯的后验有多好?

How Good is the Bayes Posterior in Deep Neural Networks Really?

论文作者

Wenzel, Florian, Roth, Kevin, Veeling, Bastiaan S., Świątkowski, Jakub, Tran, Linh, Mandt, Stephan, Snoek, Jasper, Salimans, Tim, Jenatton, Rodolphe, Nowozin, Sebastian

论文摘要

在过去的五年中,贝叶斯深度学习社区已经发展了越来越准确,有效的近似推理程序,从而允许在深神经网络中推断贝叶斯的推断。然而,尽管有这种算法的进展以及改善不确定性量化和样本效率的希望,但截至2020年初 - - 尚未在工业实践中宣传贝叶斯神经网络的部署。在这项工作中,我们对流行深层神经网络中贝叶斯后期的当前理解产生了怀疑:我们通过仔细的MCMC抽样证明,与更简单的方法相比,贝叶斯后验引起的后验预测在系统上更差的预测,包括从SGD获得的点估计值。此外,我们证明,通过使用过度表达证据的“冷后验”,可以显着提高预测性能。如此冷的后代急剧偏离了贝叶斯范式,但在贝叶斯深度学习论文中通常被用作启发式。我们提出了几种假设,可以解释冷后代并通过实验评估这些假设。我们的工作质疑贝叶斯深度学习中准确的后近似值的目标:如果真正的贝叶斯后部很差,那么更准确的近似值是什么?取而代之的是,我们认为专注于理解改善冷后代性能的起源是及时的。

During the past five years the Bayesian deep learning community has developed increasingly accurate and efficient approximate inference procedures that allow for Bayesian inference in deep neural networks. However, despite this algorithmic progress and the promise of improved uncertainty quantification and sample efficiency there are---as of early 2020---no publicized deployments of Bayesian neural networks in industrial practice. In this work we cast doubt on the current understanding of Bayes posteriors in popular deep neural networks: we demonstrate through careful MCMC sampling that the posterior predictive induced by the Bayes posterior yields systematically worse predictions compared to simpler methods including point estimates obtained from SGD. Furthermore, we demonstrate that predictive performance is improved significantly through the use of a "cold posterior" that overcounts evidence. Such cold posteriors sharply deviate from the Bayesian paradigm but are commonly used as heuristic in Bayesian deep learning papers. We put forward several hypotheses that could explain cold posteriors and evaluate the hypotheses through experiments. Our work questions the goal of accurate posterior approximations in Bayesian deep learning: If the true Bayes posterior is poor, what is the use of more accurate approximations? Instead, we argue that it is timely to focus on understanding the origin of the improved performance of cold posteriors.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源