论文标题
通过递归神经网络评分方程来解决算术单词问题
Solving Arithmetic Word Problems by Scoring Equations with Recursive Neural Networks
论文作者
论文摘要
解决算术单词问题是评估NLP系统中语言理解和推理能力的基石任务。最近的作品使用自动提取和对候选解决方案方程的排名,为算术单词问题提供答案。在这项工作中,我们探索了使用树结构化递归神经网络(Tree-RNN)配置来评分此类候选解决方案方程的新方法。这种树-RNN方法比使用更既定的顺序表示的优点是它可以自然捕获方程的结构。我们提出的方法包括将方程式的数学表达转换为表达树。此外,我们通过使用不同的树-LSTM体系结构将该树编码为树-RNN。实验结果表明,我们提出的方法(i)与先前的最新方法相比,精度超过3%,在需要更复杂推理的一部分问题上提高了超过3%的精度点,并且(ii)在如此复杂的问题上优于4%的精度。
Solving arithmetic word problems is a cornerstone task in assessing language understanding and reasoning capabilities in NLP systems. Recent works use automatic extraction and ranking of candidate solution equations providing the answer to arithmetic word problems. In this work, we explore novel approaches to score such candidate solution equations using tree-structured recursive neural network (Tree-RNN) configurations. The advantage of this Tree-RNN approach over using more established sequential representations, is that it can naturally capture the structure of the equations. Our proposed method consists of transforming the mathematical expression of the equation into an expression tree. Further, we encode this tree into a Tree-RNN by using different Tree-LSTM architectures. Experimental results show that our proposed method (i) improves overall performance with more than 3% accuracy points compared to previous state-of-the-art, and with over 15% points on a subset of problems that require more complex reasoning, and (ii) outperforms sequential LSTMs by 4% accuracy points on such more complex problems.