论文标题
多句子问题的查询意图
Querent Intent in Multi-Sentence Questions
论文作者
论文摘要
多句问题(MSQ)是通过关系连接的问题的序列,与独立问题的序列不同,需要作为一个单位回答。遵循修辞学理论(RST),我们认识到,MSQ子部分之间的不同“问题话语关系”反映了不同的说话者的意图,因此引起了不同的答案策略。因此,正确识别这些关系是自动回答MSQ的关键步骤。我们在英语中确定了五种不同类型的MSQ,并定义了五个新颖的关系来形容它们。我们从Stack Exchange中提取超过162,000个MSQ,以实现未来的研究。最后,我们基于表面特征实现了高精度基线分类器。
Multi-sentence questions (MSQs) are sequences of questions connected by relations which, unlike sequences of standalone questions, need to be answered as a unit. Following Rhetorical Structure Theory (RST), we recognise that different "question discourse relations" between the subparts of MSQs reflect different speaker intents, and consequently elicit different answering strategies. Correctly identifying these relations is therefore a crucial step in automatically answering MSQs. We identify five different types of MSQs in English, and define five novel relations to describe them. We extract over 162,000 MSQs from Stack Exchange to enable future research. Finally, we implement a high-precision baseline classifier based on surface features.