HELD: Hierarchical entity-label disambiguation in named entity recognition task using deep learning

Neves Oliveira, Bárbara Stéphanie; Fernandes de Oliveira, Andreza; Monteiro de Lira, Vinicius; Linhares Coelho da Silva, Ticiana; Fernandes de Macêdo, José Antônio

doi:10.3233/IDA-205720

HELD: Hierarchical entity-label disambiguation in named entity recognition task using deep learning

Article type: Research Article

Authors: Neves Oliveira, Bárbara Stéphanie^{a; *} | Fernandes de Oliveira, Andreza^a | Monteiro de Lira, Vinicius^b | Linhares Coelho da Silva, Ticiana^a | Fernandes de Macêdo, José Antônio^a

Affiliations: [a] Insight Data Science Lab, Federal University of Ceará, Ceará, Brazil | [b] Institute of Information Science and Technologies, National Research Council, Pisa, Italy

Correspondence: [*] Corresponding author: Bárbara Stéphanie Neves Oliveira, Insight Data Science Lab, Federal University of Ceará, Ceará, Brazil. E-mail: barbaraneves@insightlab.ufc.br.

Abstract: Named Entity Recognition (NER) is a challenging learning task of identifying and classifying entity mentions in texts into predefined categories. In recent years, deep learning (DL) methods empowered by distributed representations, such as word- and character-level embeddings, have been employed in NER systems. However, for information extraction in Police narrative reports, the performance of a DL-based NER approach is limited due to the presence of fine-grained ambiguous entities. For example, given the narrative report “Anna stole Ada’s car”, imagine that we intend to identify the VICTIM and the ROBBER, two sub-labels of PERSON. Traditional NER systems have limited performance in categorizing entity labels arranged in a hierarchical structure. Furthermore, it is unfeasible to obtain information from knowledge bases to give a disambiguated meaning between the entity mentions and the actual labels. This information must be extracted directly from the context dependencies. In this paper, we deal with the Hierarchical Entity-Label Disambiguation problem in Police reports without the use of knowledge bases. To tackle such a problem, we present HELD, an ensemble model that combines two components for NER: a BLSTM-CRF architecture and a NER tool. Experiments conducted on a real Police reports dataset show that HELD significantly outperforms baseline approaches.

Keywords: Fine-grained entity labels, hierarchical entity-label disambiguation using context, named entity recognition, deep learning, police reports domain

DOI: 10.3233/IDA-205720

Journal: Intelligent Data Analysis, vol. 26, no. 3, pp. 637-657, 2022

Published: 18 April 2022

Price: EUR 27.50

North America

IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA

Tel: +1 703 830 6300
Fax: +1 703 830 2300
sales@iospress.com

For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl

Europe

IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands

Tel: +31 20 688 3355
Fax: +31 20 687 0091
info@iospress.nl

For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office info@iospress.nl

Asia

Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China

Free service line: 400 661 8717
Fax: +86 10 8446 7947
china@iospress.cn

For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl

如果您在出版方面需要帮助或有任何建, 件至: editorial@iospress.nl

Share this:

North America

Europe

Asia