论文标题
法律领域的细粒度分类
Fine-grained Intent Classification in the Legal Domain
论文作者
论文摘要
法律从业人员必须经过许多长期的法律案件程序。要了解法律案件中不同政党/个人的行为背后的动机,必须清楚地理解表达与案件相对应的文档的部分。在本文中,我们介绍了一个属于谋杀,土地纠纷,抢劫或腐败的案件类别的93个法律文件的数据集,其中短语表达的意图与文件类别相同。另外,我们注释了每个这样的短语的细粒度意图,以使对读者的情况有更深入的了解。最后,我们分析了几种基于变压器的模型的性能,以使提取意图短语的过程(在粗粒和细粒度级别上),并将文档分类为可能的4个类别之一,并观察到,我们的数据集具有挑战性,尤其是在细粒度的意图分类的情况下。
A law practitioner has to go through a lot of long legal case proceedings. To understand the motivation behind the actions of different parties/individuals in a legal case, it is essential that the parts of the document that express an intent corresponding to the case be clearly understood. In this paper, we introduce a dataset of 93 legal documents, belonging to the case categories of either Murder, Land Dispute, Robbery, or Corruption, where phrases expressing intent same as the category of the document are annotated. Also, we annotate fine-grained intents for each such phrase to enable a deeper understanding of the case for a reader. Finally, we analyze the performance of several transformer-based models in automating the process of extracting intent phrases (both at a coarse and a fine-grained level), and classifying a document into one of the possible 4 categories, and observe that, our dataset is challenging, especially in the case of fine-grained intent classification.