UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

AfriQA: Cross-lingual Open-Retrieval Question Answering for African Languages

Ogundepo, O; Gwadabe, TR; Rivera, CE; Clark, JH; Ruder, S; Adelani, DI; Dossou, BFP; ... Adhiambo, S; + view all (2023) AfriQA: Cross-lingual Open-Retrieval Question Answering for African Languages. In: Findings of the Association for Computational Linguistics: EMNLP 2023. (pp. pp. 14957-14972). Association for Computational Linguistics Green open access

[thumbnail of 2023.findings-emnlp.997.pdf]
Preview
Text
2023.findings-emnlp.997.pdf - Published Version

Download (247kB) | Preview

Abstract

African languages have far less in-language content available digitally, making it challenging for question-answering systems to satisfy the information needs of users. Cross-lingual open-retrieval question answering (XOR QA) systems-those that retrieve answer content from other languages while serving people in their native language-offer a means of filling this gap. To this end, we create AFRIQA, the first cross-lingual QA dataset with a focus on African languages. AFRIQA includes 12,000+ XOR QA examples across 10 African languages. While previous datasets have focused primarily on languages where crosslingual QA augments coverage from the target language, AFRIQA focuses on languages where cross-lingual answer content is the only high-coverage source of answer content. Because of this, we argue that African languages are one of the most important and realistic use cases for XOR QA. Our experiments demonstrate the poor performance of automatic translation and multilingual retrieval methods. Overall, AFRIQA proves challenging for state-of-the-art QA models. We hope that the dataset enables the development of more equitable QA technology.

Type: Proceedings paper
Title: AfriQA: Cross-lingual Open-Retrieval Question Answering for African Languages
ISBN-13: 9798891760615
Open access status: An open access version is available from UCL Discovery
Publisher version: https://aclanthology.org/2023.findings-emnlp.997.p...
Language: English
Additional information: This version is the version of record. For information on re-use, please refer to the publisher's terms and conditions.
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery.ucl.ac.uk/id/eprint/10187539
Downloads since deposit
3Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item