学习有效的上下文响应匹配模型，并使用自制任务进行基于检索的对话

论文标题

学习有效的上下文响应匹配模型，并使用自制任务进行基于检索的对话

Learning an Effective Context-Response Matching Model with Self-Supervised Tasks for Retrieval-based Dialogues

论文作者

Xu, Ruijian, Tao, Chongyang, Jiang, Daxin, Zhao, Xueliang, Zhao, Dongyan, Yan, Rui

论文摘要

建立一个智能对话系统，能够根据多转弯环境选择适当的响应是一项艰巨的任务。现有的研究重点是建立具有各种神经体系结构或PLM的上下文响应匹配模型，并通常通过单个响应预测任务进行学习。这些方法忽略了对话数据中包含的许多潜在培训信号，这可能对上下文理解有益，并为响应预测提供了更好的功能。此外，从传统方式监督的现有对话系统中检索到的响应仍然面临着一些关键挑战，包括不一致和不一致。为了解决这些问题，在本文中，我们建议通过基于预先训练的语言模型为对话数据设计的辅助自我监督任务学习上下文响应匹配模型。具体而言，我们介绍了四个自我监督的任务，包括下一个会话预测，话语恢复，不一致的检测和一致性歧视，并以多任务方式与这些辅助任务共同训练基于PLM的响应选择模型。通过这种方式，辅助任务可以指导匹配模型的学习，以实现更好的本地最佳选择并选择更正确的响应。两个基准的实验结果表明，在基于检索的对话过程中，提出的辅助自我监督任务可以显着改善多转响应的选择，并且我们的模型在两个数据集中都取得了新的最新结果。

Building an intelligent dialogue system with the ability to select a proper response according to a multi-turn context is a great challenging task. Existing studies focus on building a context-response matching model with various neural architectures or PLMs and typically learning with a single response prediction task. These approaches overlook many potential training signals contained in dialogue data, which might be beneficial for context understanding and produce better features for response prediction. Besides, the response retrieved from existing dialogue systems supervised by the conventional way still faces some critical challenges, including incoherence and inconsistency. To address these issues, in this paper, we propose learning a context-response matching model with auxiliary self-supervised tasks designed for the dialogue data based on pre-trained language models. Specifically, we introduce four self-supervised tasks including next session prediction, utterance restoration, incoherence detection and consistency discrimination, and jointly train the PLM-based response selection model with these auxiliary tasks in a multi-task manner. By this means, the auxiliary tasks can guide the learning of the matching model to achieve a better local optimum and select a more proper response. Experiment results on two benchmarks indicate that the proposed auxiliary self-supervised tasks bring significant improvement for multi-turn response selection in retrieval-based dialogues, and our model achieves new state-of-the-art results on both datasets.

下载PDF全文

下载文献需遵守相关版权规定

论文标题