以自然的聚类来理解深度学习

论文标题

以自然的聚类来理解深度学习

Towards understanding deep learning with the natural clustering prior

论文作者

Carbonnelle, Simon

论文摘要

融入机器学习系统设计中的先验知识（又名先验）强烈影响其泛化能力。在深度学习的具体背景下，其中一些先验的理解很众所周知，因为它们从涉及深度学习设计的生物学大脑的成功启发式方法和暂定近似。通过监督图像分类问题的镜头，本论文研究了由三个陈述组成的自然聚类的隐式整合：（i）自然图像表现出丰富的簇结构，（ii）图像类别类别由多个群集组成，（iii）每个群集包含一个类别的示例。将类别分解为多个集群的分解意味着，监督的深度学习系统可以从无监督的聚类中受益，以定义适当的决策范围。因此，本论文试图在深度学习系统中确定隐式聚类能力，机制和超参数，并评估它们在解释这些系统的概括能力方面的相关性。我们通过对训练动力学以及深层神经网络的神经元和层级表示的广泛实证研究来做到这一点。由此产生的实验收集为自然聚类的相关性提供了初步证据，以理解深度学习。

The prior knowledge (a.k.a. priors) integrated into the design of a machine learning system strongly influences its generalization abilities. In the specific context of deep learning, some of these priors are poorly understood as they implicitly emerge from the successful heuristics and tentative approximations of biological brains involved in deep learning design. Through the lens of supervised image classification problems, this thesis investigates the implicit integration of a natural clustering prior composed of three statements: (i) natural images exhibit a rich clustered structure, (ii) image classes are composed of multiple clusters and (iii) each cluster contains examples from a single class. The decomposition of classes into multiple clusters implies that supervised deep learning systems could benefit from unsupervised clustering to define appropriate decision boundaries. Hence, this thesis attempts to identify implicit clustering abilities, mechanisms and hyperparameters in deep learning systems and evaluate their relevance for explaining the generalization abilities of these systems. We do so through an extensive empirical study of the training dynamics as well as the neuron- and layer-level representations of deep neural networks. The resulting collection of experiments provides preliminary evidence for the relevance of the natural clustering prior for understanding deep learning.

下载PDF全文

下载文献需遵守相关版权规定

论文标题