UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Knowledge-Based Named Entity Recognition of Archaeological Concepts in Dutch

Vlachidis, A; Tudhope, D; Wansleeben, M; (2021) Knowledge-Based Named Entity Recognition of Archaeological Concepts in Dutch. In: Garoufallou, E and Ovalle-Perandones, M-A, (eds.) Metadata and Semantic Research. MTSR 2020. Communications in Computer and Information Science. Springer Verlag: Cham, Switzerland. Green open access

[thumbnail of 11-Vlachidis-MTSR2020.pdf]
Preview
Text
11-Vlachidis-MTSR2020.pdf - Accepted Version

Download (459kB) | Preview

Abstract

The advancement of Natural Language Processing (NLP) allows the process of deriving information from large volumes of text to be automated, making text-based resources more discoverable and useful. The attention is turned to one of the most important, but traditionally difficult to access resources in archaeology; the largely unpublished reports generated by commercial or “rescue” archaeology, commonly known as “grey literature”. The paper presents the development and evaluation of a Named Entity Recognition system of Dutch archaeological grey literature targeted at extracting mentions of artefacts, archaeological features, materials, places and time entities. The role of domain vocabulary is discussed for the development of a KOS-driven NLP pipeline which is evaluated against a Gold Standard, human-annotated corpus.

Type: Proceedings paper
Title: Knowledge-Based Named Entity Recognition of Archaeological Concepts in Dutch
Event: Metadata and Semantic Research MTSR 2020
Location: Madrid, Spain
Dates: 30 November 2020 - 04 December 2020
ISBN-13: 978-3-030-71902-9
Open access status: An open access version is available from UCL Discovery
DOI: 10.1007/978-3-030-71903-6_6
Publisher version: http://dx.doi.org/10.1007/978-3-030-71903-6_6
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: Named Entity Recognition, Archaeology, Grey literature, CIDOC-CRM, Knowledge Organization Systems
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL SLASH
UCL > Provost and Vice Provost Offices > UCL SLASH > Faculty of Arts and Humanities
UCL > Provost and Vice Provost Offices > UCL SLASH > Faculty of Arts and Humanities > Dept of Information Studies
URI: https://discovery.ucl.ac.uk/id/eprint/10125868
Downloads since deposit
59Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item