UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

EDIN: An End-to-end Benchmark and Pipeline for Unknown Entity Discovery and Indexing

Kassner, N; Petroni, F; Plekhanov, M; Riedel, S; Cancedda, N; (2022) EDIN: An End-to-end Benchmark and Pipeline for Unknown Entity Discovery and Indexing. In: Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, EMNLP 2022. (pp. pp. 8659-8673). ACL Green open access

[thumbnail of 2022.emnlp-main.593.pdf]
Preview
PDF
2022.emnlp-main.593.pdf - Published Version

Download (528kB) | Preview

Abstract

Existing work on Entity Linking mostly assumes that the reference knowledge base is complete, and therefore all mentions can be linked. In practice this is hardly ever the case, as knowledge bases are incomplete and because novel concepts arise constantly. We introduce the temporally segmented Unknown Entity Discovery and Indexing (EDIN) -benchmark where unknown entities, that is entities not part of the knowledge base and without descriptions and labeled mentions, have to be integrated into an existing entity linking system. By contrasting EDIN with zero-shot entity linking, we provide insight on the additional challenges it poses. Building on dense-retrieval based entity linking, we introduce the end-to-end EDIN-pipeline that detects, clusters, and indexes mentions of unknown entities in context. Experiments show that indexing a single embedding per entity unifying the information of multiple mentions works better than indexing mentions independently.

Type: Proceedings paper
Title: EDIN: An End-to-end Benchmark and Pipeline for Unknown Entity Discovery and Indexing
Event: The 2022 Conference on Empirical Methods in Natural Language Processing
Open access status: An open access version is available from UCL Discovery
Publisher version: https://aclanthology.org/2022.emnlp-main.593
Language: English
Additional information: This version is the version of record. For information on re-use, please refer to the publisher's terms and conditions.
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery.ucl.ac.uk/id/eprint/10166599
Downloads since deposit
11Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item