UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Dataset Augmentation in Papyrology with Generative Models: A Study of Synthetic Ancient Greek Character Images

Swindall, Matthew I; Player, Timothy; Keener, Ben; Williams, Alex C; Brusuelas, James H; Nicolardi, Federica; D'Angelo, Marzia; ... Wallin, John F; + view all (2022) Dataset Augmentation in Papyrology with Generative Models: A Study of Synthetic Ancient Greek Character Images. In: De Raedt, Luc, (ed.) Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, Vienna. (pp. pp. 4973-4979). IJCAI: Vienna, Austria. Green open access

[thumbnail of Dataset Augmentation in Papyrology with Generative Models Offprint.pdf]
Preview
PDF
Dataset Augmentation in Papyrology with Generative Models Offprint.pdf - Published Version

Download (2MB) | Preview

Abstract

Character recognition models rely substantially on image datasets that maintain a balance of class samples. However, achieving a balance of classes is particularly challenging for ancient manuscript contexts as character instances may be significantly limited. In this paper, we present findings from a study that assess the efficacy of using synthetically generated character instances to augment an existing dataset of ancient Greek character images for use in machine learning models. We complement our model exploration by engaging professional papyrologists to better understand the practical opportunities afforded by synthetic instances. Our results suggest that synthetic instances improve model performance for limited character classes, and may have unexplored effects on character classes more generally. We also find that trained papyrologists are unable to distinguish between synthetic and non-synthetic images and regard synthetic instances as valuable assets for professional and educational contexts. We conclude by discussing the practical implications of our research.

Type: Proceedings paper
Title: Dataset Augmentation in Papyrology with Generative Models: A Study of Synthetic Ancient Greek Character Images
Event: Thirty-First International Joint Conference on Artificial Intelligence (IJCAI 2022)
Dates: 23 Jul 2022 - 29 Jul 2022
ISBN-13: 978-1-956792-00-3
Open access status: An open access version is available from UCL Discovery
DOI: 10.24963/ijcai.2022/689
Publisher version: https://doi.org/10.24963/ijcai.2022/689
Language: English
Additional information: This version is the version of record. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: Text, literature and creative language, Theory and philosophy of arts and creativity in AI systems, Support of human creativity, domains of art or creativity
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL SLASH
UCL > Provost and Vice Provost Offices > UCL SLASH > Faculty of Arts and Humanities
UCL > Provost and Vice Provost Offices > UCL SLASH > Faculty of Arts and Humanities > Dept of Greek and Latin
URI: https://discovery.ucl.ac.uk/id/eprint/10206762
Downloads since deposit
6Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item