论文标题

NAPG:用于混合表格文本问题的非自动入学计划生成回答

NAPG: Non-Autoregressive Program Generation for Hybrid Tabular-Textual Question Answering

论文作者

Zhang, Tengxun, Xu, Hongfei, van Genabith, Josef, Xiong, Deyi, Zan, Hongying

论文摘要

混合表格文本问题回答(QA)需要从异质信息中推理,并且推理的类型主要分为数值推理和跨度提取。当前的数值推理方法自动加压解码程序序列,每个解码步骤都会产生操作员或操作数。但是,逐步解码会遭受暴露偏差的影响,并且随着由于错误传播而展开的解码步骤,程序生成的准确性急剧下降。在本文中,我们提出了一个非自动进取的计划生成框架,该框架独立生成包含操作员和操作数的完整程序元组,可以解决错误传播问题,同时显着提高程序生成的速度。 Experiments on the ConvFinQA and MultiHiertt datasets show that our non-autoregressive program generation method can bring about substantial improvements over the strong FinQANet (+5.06 Exe Acc and +4.80 Prog Acc points) and MT2Net (+7.97 EM and +6.38 F1 points) baselines, establishing the new state-of-the-art performance, while being much faster (21x) in program generation.最后,随着数值推理步骤数量的越来越多,我们方法的性能下降明显小于基准的性能。我们的代码即将公开可用。

Hybrid tabular-textual question answering (QA) requires reasoning from heterogeneous information, and the types of reasoning are mainly divided into numerical reasoning and span extraction. Current numerical reasoning methods autoregressively decode program sequences, and each decoding step produces either an operator or an operand. However, the step-by-step decoding suffers from exposure bias, and the accuracy of program generation drops sharply as the decoding steps unfold due to error propagation. In this paper, we propose a non-autoregressive program generation framework, which independently generates complete program tuples containing both operators and operands, can address the error propagation issue while significantly boosting the speed of program generation. Experiments on the ConvFinQA and MultiHiertt datasets show that our non-autoregressive program generation method can bring about substantial improvements over the strong FinQANet (+5.06 Exe Acc and +4.80 Prog Acc points) and MT2Net (+7.97 EM and +6.38 F1 points) baselines, establishing the new state-of-the-art performance, while being much faster (21x) in program generation. Finally, with increasing numbers of numerical reasoning steps the performance drop of our method is significantly smaller than that of the baselines. Our code will be publicly available soon.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源