Vlachidis, A;
Tudhope, D;
(2013)
Classical Art Semantics Information Extraction: CASIE Pilot Project.
In: Broughton, V, (ed.)
(Proceedings) Knowledge Organization: Pushing the Boundaries, 3rd ISKO UK Biennial Conference, 8-9 July 2013, London, UK.
ISKO UK
Preview |
Text
Vlachidis_CASIE_Pilot_Project.pdf Download (480kB) | Preview |
Abstract
The paper discusses the application of Natural Language Processing (NLP) techniques in the context of semantic annotation of classical art text via rule-based Information Extraction (IE) techniques combined with ontological and domain vocabulary input. The CASIE (Classical Art Semantics Information Extraction) was a pilot collaborative project between the Hypermedia Research Unit (University of South Wales) and the Beazley Archive (Oxford University), which aims to automatically extract information about cultural objects from classical art scholarly texts and represent this information in terms of the ISO metadata standard for cultural heritage, the International Council of Museum’s CIDOC Conceptual Reference Model (CRM). In total 12 documents (fascicules – high quality catalogues) were processed, originating from the Corpus Vasorum Antiquorum (CVA) collection containing over 350 high quality catalogues of mostly ancient Greek painted pottery, illustrating more than 100,000 vases. The extracted information was expressed in interoperable RDF graphs consistent with the CLAROS project format. The role of CIDOC-CRM is central for enabling semantic interoperability across the range of datasets that contribute to CLAROS. The CASIE pilot enabled a complementary exploitation of terminological and ontological resources via rule-based information extraction techniques, delivering semantic annotation with respect to the CRM in the broader field of digital humanities.
Type: | Proceedings paper |
---|---|
Title: | Classical Art Semantics Information Extraction: CASIE Pilot Project |
Event: | Knowledge Organization: Pushing the Boundaries, 3rd ISKO UK Biennial Conference, 8-9 July 2013, London, UK |
Open access status: | An open access version is available from UCL Discovery |
Publisher version: | http://www.iskouk.org/content/isko-uk-conference-2... |
Language: | English |
Additional information: | This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions. |
Keywords: | Digital Humanities, CIDOC-CRM, Semantic Annotation, Natural Language Processing, Information Extraction |
UCL classification: | UCL UCL > Provost and Vice Provost Offices UCL > Provost and Vice Provost Offices > UCL SLASH UCL > Provost and Vice Provost Offices > UCL SLASH > Faculty of Arts and Humanities UCL > Provost and Vice Provost Offices > UCL SLASH > Faculty of Arts and Humanities > Dept of Information Studies |
URI: | https://discovery.ucl.ac.uk/id/eprint/1556221 |
Archive Staff Only
View Item |