论文标题

低资源分类的概念匹配

Concept Matching for Low-Resource Classification

论文作者

Errica, Federico, Denoyer, Ludovic, Edizel, Bora, Petroni, Fabio, Plachouras, Vassilis, Silvestri, Fabrizio, Riedel, Sebastian

论文摘要

我们提出了一个模型,以在很少的培训数据的存在下处理分类任务。为此,我们将精确匹配的概念与理论上声音机制相匹配,该机制计算在输入空间中匹配的概率。重要的是,该模型学会着眼于与手头任务相关的输入要素。通过利用突出显示培训数据的一部分,促进错误的技术指导了学习过程。实际上,它增加了与输入的相关部分相关的错误。文本分类任务的显着结果证实了在平衡和不平衡案例中提出的方法的好处,因此在标记新示例时是实际使用的。此外,通过检查其权重,通常可以收集对模型学到的知识的见解。

We propose a model to tackle classification tasks in the presence of very little training data. To this aim, we approximate the notion of exact match with a theoretically sound mechanism that computes a probability of matching in the input space. Importantly, the model learns to focus on elements of the input that are relevant for the task at hand; by leveraging highlighted portions of the training data, an error boosting technique guides the learning process. In practice, it increases the error associated with relevant parts of the input by a given factor. Remarkable results on text classification tasks confirm the benefits of the proposed approach in both balanced and unbalanced cases, thus being of practical use when labeling new examples is expensive. In addition, by inspecting its weights, it is often possible to gather insights on what the model has learned.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源