UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Classical Art Semantics Information Extraction: CASIE Pilot Project

Vlachidis, A; Tudhope, D; (2013) Classical Art Semantics Information Extraction: CASIE Pilot Project. In: Broughton, V, (ed.) (Proceedings) Knowledge Organization: Pushing the Boundaries, 3rd ISKO UK Biennial Conference, 8-9 July 2013, London, UK. ISKO UK Green open access

[thumbnail of Vlachidis_CASIE_Pilot_Project.pdf]
Preview
Text
Vlachidis_CASIE_Pilot_Project.pdf

Download (480kB) | Preview

Abstract

The paper discusses the application of Natural Language Processing (NLP) techniques in the context of semantic annotation of classical art text via rule-based Information Extraction (IE) techniques combined with ontological and domain vocabulary input. The CASIE (Classical Art Semantics Information Extraction) was a pilot collaborative project between the Hypermedia Research Unit (University of South Wales) and the Beazley Archive (Oxford University), which aims to automatically extract information about cultural objects from classical art scholarly texts and represent this information in terms of the ISO metadata standard for cultural heritage, the International Council of Museum’s CIDOC Conceptual Reference Model (CRM). In total 12 documents (fascicules – high quality catalogues) were processed, originating from the Corpus Vasorum Antiquorum (CVA) collection containing over 350 high quality catalogues of mostly ancient Greek painted pottery, illustrating more than 100,000 vases. The extracted information was expressed in interoperable RDF graphs consistent with the CLAROS project format. The role of CIDOC-CRM is central for enabling semantic interoperability across the range of datasets that contribute to CLAROS. The CASIE pilot enabled a complementary exploitation of terminological and ontological resources via rule-based information extraction techniques, delivering semantic annotation with respect to the CRM in the broader field of digital humanities.

Type: Proceedings paper
Title: Classical Art Semantics Information Extraction: CASIE Pilot Project
Event: Knowledge Organization: Pushing the Boundaries, 3rd ISKO UK Biennial Conference, 8-9 July 2013, London, UK
Open access status: An open access version is available from UCL Discovery
Publisher version: http://www.iskouk.org/content/isko-uk-conference-2...
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: Digital Humanities, CIDOC-CRM, Semantic Annotation, Natural Language Processing, Information Extraction
UCL classification: UCL
UCL > Provost and Vice Provost Offices
UCL > Provost and Vice Provost Offices > UCL SLASH
UCL > Provost and Vice Provost Offices > UCL SLASH > Faculty of Arts and Humanities
UCL > Provost and Vice Provost Offices > UCL SLASH > Faculty of Arts and Humanities > Dept of Information Studies
URI: https://discovery.ucl.ac.uk/id/eprint/1556221
Downloads since deposit
39Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item