论文标题

互操作性的类别理论方法

A Category Theory Approach to Interoperability

论文作者

Del Gratta, Riccardo

论文摘要

在本文中,我们提出了语言工具之间(句法)互操作性的类别理论方法。最终的类别包括文本文档,包括任何语言注释,NLP工具分析文本并添加其他语言信息以及格式转换器。格式转换器对于使工具能够读取并产生不同的输出格式,这是互操作性的关键。该文档背后的想法是与NLP管道类别理论中组成概念与关联性概念之间的平行性。我们展示了如何将语言工具的管道建模到类别理论的概念框架中,并成功地将此方法应用于两个现实生活示例。

In this article, we propose a Category Theory approach to (syntactic) interoperability between linguistic tools. The resulting category consists of textual documents, including any linguistic annotations, NLP tools that analyze texts and add additional linguistic information, and format converters. Format converters are necessary to make the tools both able to read and to produce different output formats, which is the key to interoperability. The idea behind this document is the parallelism between the concepts of composition and associativity in Category Theory with the NLP pipelines. We show how pipelines of linguistic tools can be modeled into the conceptual framework of Category Theory and we successfully apply this method to two real-life examples.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源