论文标题
半监督学习,以通过神经图共识来理解多任务场景
Semi-Supervised Learning for Multi-Task Scene Understanding by Neural Graph Consensus
论文作者
论文摘要
我们通过在神经网络图中找到共识,在世界上多种视觉解释的背景下解决了半监督学习的具有挑战性的问题。每个图节点都是场景解释层,而每个边缘都是一个深网,它从另一个节点将一个节点的一个节点转换为另一个节点。在有监督的阶段期间,边缘网络进行独立培训。在下一个无监督的阶段边缘网络中,网络是通过在到达网络开始和结束节点的多个路径之间达成共识所提供的伪地真相。这些路径充当任何给定边缘的集合教师,并且使用强有力的共识用于高信心监督信号。在几代人中重复了无监督的学习过程,其中每个边缘成为一个“学生”,也是不同合奏“老师”的一部分,以培训其他学生。通过在未知标签面前优化不同路径之间的这种共识,该图在多个解释和世代上达到了一致性和鲁棒性。我们给出了拟议思想的理论理由,并在大型数据集上验证它。我们展示了如何通过我们的图表中的自我监督共识来有效地了解不同表示,例如深度,语义分割,表面正态和姿势的预测。我们还与多任务和半监督学习的最新方法进行比较,并显示出卓越的表现。
We address the challenging problem of semi-supervised learning in the context of multiple visual interpretations of the world by finding consensus in a graph of neural networks. Each graph node is a scene interpretation layer, while each edge is a deep net that transforms one layer at one node into another from a different node. During the supervised phase edge networks are trained independently. During the next unsupervised stage edge nets are trained on the pseudo-ground truth provided by consensus among multiple paths that reach the nets' start and end nodes. These paths act as ensemble teachers for any given edge and strong consensus is used for high-confidence supervisory signal. The unsupervised learning process is repeated over several generations, in which each edge becomes a "student" and also part of different ensemble "teachers" for training other students. By optimizing such consensus between different paths, the graph reaches consistency and robustness over multiple interpretations and generations, in the face of unknown labels. We give theoretical justifications of the proposed idea and validate it on a large dataset. We show how prediction of different representations such as depth, semantic segmentation, surface normals and pose from RGB input could be effectively learned through self-supervised consensus in our graph. We also compare to state-of-the-art methods for multi-task and semi-supervised learning and show superior performance.