论文标题
嵌套命名实体识别的边界回归模型
A Boundary Regression Model for Nested Named Entity Recognition
论文作者
论文摘要
识别指定实体(NES)通常是作为一个分类问题,可以预测句子中单词或NE候选人的类标签。在浅层结构中,对分类的特征进行加权以支持预测。神经网络的最新发展采用了深层结构,将特征分类为连续表示。这种方法展开了一个充满高阶抽象语义信息的密集空间,其中预测基于分布式特征表示。在本文中,句子中NES的位置表示为连续值。然后,引入了回归操作,以回归句子中NES的边界。基于边界回归,我们设计了一个边界回归模型以支持嵌套的NE识别。这是一个多物理学习框架,同时预测了NE候选者的分类评分,并在句子中完善其空间位置。它具有解决嵌套NES并支持边界回归的优势。通过共享用于预测和定位的参数,该模型可以使更多有效的非线性函数近似器增强模型可区分性。实验证明了嵌套NE识别\ footNote的最新性能{我们实现BR模型的代码可在:\ url {https://github.com/wuyuefei3/br}}中获得。
Recognizing named entities (NEs) is commonly conducted as a classification problem that predicts a class tag for a word or a NE candidate in a sentence. In shallow structures, categorized features are weighted to support the prediction. Recent developments in neural networks have adopted deep structures that map categorized features into continuous representations. This approach unfolds a dense space saturated with high-order abstract semantic information, where the prediction is based on distributed feature representations. In this paper, positions of NEs in a sentence are represented as continuous values. Then, a regression operation is introduced to regress boundaries of NEs in a sentence. Based on boundary regression, we design a boundary regression model to support nested NE recognition. It is a multiobjective learning framework, which simultaneously predicts the classification score of a NE candidate and refine its spatial location in a sentence. It has the advantage to resolve nested NEs and support boundary regression for locating NEs in a sntence. By sharing parameters for predicting and locating, this model enables more potent nonlinear function approximators to enhance model discriminability. Experiments demonstrate state-of-the-art performance for nested NE recognition\footnote{Our codes to implement the BR model are available at: \url{https://github.com/wuyuefei3/BR}.}.