UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

A Cookbook for Community-driven Data Collection of Impaired Speech in Low-Resource Languages

Salihs, SA; Wiafe, I; Abdulai, JD; Atsakpo, ED; Ayoka, G; Cave, R; Ekpezu, AO; ... Winful, FBP; + view all (2025) A Cookbook for Community-driven Data Collection of Impaired Speech in Low-Resource Languages. In: Proceedings of the Annual Conference of the International Speech Communication Association Interspeech. (pp. pp. 4623-4627). ISCA Green open access

[thumbnail of salihs25_interspeech.pdf]
Preview
Text
salihs25_interspeech.pdf - Published Version

Download (317kB) | Preview

Abstract

This study presents an approach for collecting speech samples to build Automatic Speech Recognition (ASR) models for impaired speech, particularly, low-resource languages. It aims to democratize ASR technology and data collection by developing a "cookbook" of best practices and training for community-driven data collection and ASR model building. As a proof-of-concept, this study curated the first open-source dataset of impaired speech in Akan: a widely spoken indigenous language in Ghana. The study involved participants from diverse backgrounds with speech impairments. The resulting dataset, along with the cookbook and open-source tools, are publicly available to enable researchers and practitioners to create inclusive ASR technologies tailored to the unique needs of speech impaired individuals. In addition, this study presents the initial results of finetuning open-source ASR models to better recognize impaired speech in Akan.

Type: Proceedings paper
Title: A Cookbook for Community-driven Data Collection of Impaired Speech in Low-Resource Languages
Event: Interspeech 2025
Open access status: An open access version is available from UCL Discovery
DOI: 10.21437/Interspeech.2025-2261
Publisher version: https://doi.org/10.21437/interspeech.2025-2261
Language: English
Additional information: This version is the version of record. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: automatic speech recognition, impaired speech, low resource language, community engagement, democratizing AI
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery.ucl.ac.uk/id/eprint/10217367
Downloads since deposit
7Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item