论文标题
通过压缩图选择器网络回答长文档问题中的全局结构信息
Capturing Global Structural Information in Long Document Question Answering with Compressive Graph Selector Network
论文作者
论文摘要
长文档问题回答是一项具有挑战性的任务,因为它要求对长文进行复杂的推理。以前的作品通常将长文档作为非结构化平面文本,或仅考虑长文档中的本地结构。但是,这些方法通常忽略了长文档的全球结构,这对于远程理解至关重要。为了解决此问题,我们提出了压缩图选择器网络(CGSN),以压缩和迭代的方式捕获全局结构。拟议的模型主要集中在长文档问题回答的证据选择阶段。具体而言,它由三个模块组成:本地图网络,全局图网络和证据内存网络。首先,本地图网络在令牌,句子,段落和段级别中构建了块段的图形结构,以捕获文本的短期依赖性。其次,全局图网络选择性地从本地图中接收每个级别的信息,将它们压缩到全局图节点中,并将图形注意力应用于全局图节点,以迭代方式构建对整个文本的远程推理。第三,证据内存网络旨在通过在先前的步骤中保存所选结果来减轻证据选择中的冗余问题。广泛的实验表明,所提出的模型优于两个数据集上的先前方法。
Long document question answering is a challenging task due to its demands for complex reasoning over long text. Previous works usually take long documents as non-structured flat texts or only consider the local structure in long documents. However, these methods usually ignore the global structure of the long document, which is essential for long-range understanding. To tackle this problem, we propose Compressive Graph Selector Network (CGSN) to capture the global structure in a compressive and iterative manner. The proposed model mainly focuses on the evidence selection phase of long document question answering. Specifically, it consists of three modules: local graph network, global graph network and evidence memory network. Firstly, the local graph network builds the graph structure of the chunked segment in token, sentence, paragraph and segment levels to capture the short-term dependency of the text. Secondly, the global graph network selectively receives the information of each level from the local graph, compresses them into the global graph nodes and applies graph attention to the global graph nodes to build the long-range reasoning over the entire text in an iterative way. Thirdly, the evidence memory network is designed to alleviate the redundancy problem in the evidence selection by saving the selected result in the previous steps. Extensive experiments show that the proposed model outperforms previous methods on two datasets.