论文标题
Convlab-2:用于构建,评估和诊断对话系统的开源工具包
ConvLab-2: An Open-Source Toolkit for Building, Evaluating, and Diagnosing Dialogue Systems
论文作者
论文摘要
我们提出了Convlab-2,这是一种开源工具包,使研究人员能够通过最先进的模型构建面向任务的对话系统,进行端到端评估,并诊断系统的弱点。作为Convlab的继任者(Lee等人,2019b),Convlab-2继承了Convlab的框架,但整合了更强大的对话模型并支持更多的数据集。此外,我们开发了一个分析工具和一种交互式工具,以帮助研究人员诊断对话系统。该分析工具提供了丰富的统计数据,并总结了模拟对话中的常见错误,这有助于误差分析和系统改进。交互式工具提供了一个用户界面,该界面允许开发人员通过与系统进行交互并修改每个系统组件的输出来诊断组装对话系统。
We present ConvLab-2, an open-source toolkit that enables researchers to build task-oriented dialogue systems with state-of-the-art models, perform an end-to-end evaluation, and diagnose the weakness of systems. As the successor of ConvLab (Lee et al., 2019b), ConvLab-2 inherits ConvLab's framework but integrates more powerful dialogue models and supports more datasets. Besides, we have developed an analysis tool and an interactive tool to assist researchers in diagnosing dialogue systems. The analysis tool presents rich statistics and summarizes common mistakes from simulated dialogues, which facilitates error analysis and system improvement. The interactive tool provides a user interface that allows developers to diagnose an assembled dialogue system by interacting with the system and modifying the output of each system component.