UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Subsequence Based Deep Active Learning for Named Entity Recognition

Radmard, P; Fathullah, Y; Lipani, A; (2021) Subsequence Based Deep Active Learning for Named Entity Recognition. In: Zong, C and Xia, F and Li, W and Navigli, R, (eds.) Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing. (pp. pp. 4310-4321). Association for Computational Linguistics Green open access

[thumbnail of Lipani_2021.acl-long.332.pdf]
Preview
Text
Lipani_2021.acl-long.332.pdf - Published Version

Download (677kB) | Preview

Abstract

Active Learning (AL) has been successfully applied to Deep Learning in order to drastically reduce the amount of data required to achieve high performance. Previous works have shown that lightweight architectures for Named Entity Recognition (NER) can achieve optimal performance with only 25% of the original training data. However, these methods do not exploit the sequential nature of language and the heterogeneity of uncertainty within each instance, requiring the labelling of whole sentences. Additionally, this standard method requires that the annotator has access to the full sentence when labelling. In this work, we overcome these limitations by allowing the AL algorithm to query subsequences within sentences, and propagate their labels to other sentences. We achieve highly efficient results on OntoNotes 5.0, only requiring 13% of the original training data, and CoNLL 2003, requiring only 27%. This is an improvement of 39% and 37% compared to querying full sentences.

Type: Proceedings paper
Title: Subsequence Based Deep Active Learning for Named Entity Recognition
Event: 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing
ISBN-13: 978-1-954085-52-7
Open access status: An open access version is available from UCL Discovery
Publisher version: https://aclanthology.org/2021.acl-long.332/
Language: English
Additional information: ©2021 Association for Computational Linguistics, licensed on a Creative Commons Attribution 4.0 International License.
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Civil, Environ and Geomatic Eng
URI: https://discovery.ucl.ac.uk/id/eprint/10133183
Downloads since deposit
273Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item