论文标题

通过代表学习在现实世界中概括

Generalizing in the Real World with Representation Learning

论文作者

Maharaj, Tegan

论文摘要

机器学习(ML)正式化了使计算机在一组数据示例中根据某些度量的优化经验从经验中学习的问题。这与需要提前指定的行为(例如,通过硬编码规则)形成鲜明对比。这个问题的形式化使许多现实影响的应用程序都取得了巨大进展,包括翻译,语音识别,自动驾驶汽车和药物发现。但是,这种形式主义的实际实例化提出了许多假设 - 例如,数据是I.I.D。:独立且相同分布的 - 很少研究其健全性。在这么短的时间内取得了长足的进步,该领域已经制定了许多规范和临时标准,重点是相对较小的问题设置。随着ML的应用,尤其是在人工智能(AI)系统中,在现实世界中变得更加普遍,我们需要批判性地检查这些假设,规范和问题设置,以及已成为事实上标准的方法。我们仍然不了解如何以及为什么接受随机梯度下降训练的深层网络能够像他们一样概括,为什么在这样做时失败,以及如何在分发数据范围内执行。在这篇论文中,我涵盖了一些工作,以更好地理解深网的概括,确定几种假设和问题设置无法推广到现实世界,并提出解决实践中这些失败的方法。

Machine learning (ML) formalizes the problem of getting computers to learn from experience as optimization of performance according to some metric(s) on a set of data examples. This is in contrast to requiring behaviour specified in advance (e.g. by hard-coded rules). Formalization of this problem has enabled great progress in many applications with large real-world impact, including translation, speech recognition, self-driving cars, and drug discovery. But practical instantiations of this formalism make many assumptions - for example, that data are i.i.d.: independent and identically distributed - whose soundness is seldom investigated. And in making great progress in such a short time, the field has developed many norms and ad-hoc standards, focused on a relatively small range of problem settings. As applications of ML, particularly in artificial intelligence (AI) systems, become more pervasive in the real world, we need to critically examine these assumptions, norms, and problem settings, as well as the methods that have become de-facto standards. There is much we still do not understand about how and why deep networks trained with stochastic gradient descent are able to generalize as well as they do, why they fail when they do, and how they will perform on out-of-distribution data. In this thesis I cover some of my work towards better understanding deep net generalization, identify several ways assumptions and problem settings fail to generalize to the real world, and propose ways to address those failures in practice.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源