UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

INQUIRE: A Natural World Text-to-Image Retrieval Benchmark

Vendrow, Edward; Pantazis, Omiros; Shepard, Alexander; Brostow, Gabriel; Jones, Kate E; Mac Aodha, Oisin; Beery, Sara; (2024) INQUIRE: A Natural World Text-to-Image Retrieval Benchmark. In: Proceedings of the 38th Annual Conference on Neural Information Processing Systems (NeurIPS 2024). NeurIPS Proceedings: Vancouver, Canada. Green open access

[thumbnail of 2411.02537v3.pdf]
Preview
Text
2411.02537v3.pdf - Published Version

Download (26MB) | Preview

Abstract

We introduce INQUIRE, a text-to-image retrieval benchmark designed to challenge multimodal vision-language models on expert-level queries. INQUIRE includes iNaturalist 2024 (iNat24), a new dataset of five million natural world images, along with 250 expert-level retrieval queries. These queries are paired with all relevant images comprehensively labeled within iNat24, comprising 33,000 total matches. Queries span categories such as species identification, context, behavior, and appearance, emphasizing tasks that require nuanced image understanding and domain expertise. Our benchmark evaluates two core retrieval tasks: (1) INQUIRE-Fullrank, a full dataset ranking task, and (2) INQUIRE-Rerank, a reranking task for refining top-100 retrievals. Detailed evaluation of a range of recent multimodal models demonstrates that INQUIRE poses a significant challenge, with the best models failing to achieve an mAP@50 above 50%. In addition, we show that reranking with more powerful multimodal models can enhance retrieval performance, yet there remains a significant margin for improvement. By focusing on scientifically-motivated ecological challenges, INQUIRE aims to bridge the gap between AI capabilities and the needs of real-world scientific inquiry, encouraging the development of retrieval systems that can assist with accelerating ecological and biodiversity research. Our dataset and code are available at this https URL (https://inquire-benchmark.github.io/).

Type: Proceedings paper
Title: INQUIRE: A Natural World Text-to-Image Retrieval Benchmark
Event: 38th Annual Conference on Neural Information Processing Systems (NeurIPS 2024)
Open access status: An open access version is available from UCL Discovery
Publisher version: https://neurips.cc/virtual/2024/poster/97543
Language: English
Additional information: This version is the version of record. For information on re-use, please refer to the publisher’s terms and conditions.
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > Div of Biosciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > Div of Biosciences > Genetics, Evolution and Environment
URI: https://discovery.ucl.ac.uk/id/eprint/10202762
Downloads since deposit
1Download
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item