UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Open Vocabulary Extreme Classification Using Generative Models

Simig, D; Petroni, F; Yanki, P; Popat, K; Du, C; Riedel, S; Yazdani, M; (2022) Open Vocabulary Extreme Classification Using Generative Models. In: Proceedings of the Annual Meeting of the Association for Computational Linguistics. (pp. pp. 1561-1583). ACL Green open access

[thumbnail of 2022.findings-acl.123.pdf]
Preview
PDF
2022.findings-acl.123.pdf - Published Version

Download (1MB) | Preview

Abstract

The extreme multi-label classification (XMC) task aims at tagging content with a subset of labels from an extremely large label set. The label vocabulary is typically defined in advance by domain experts and assumed to capture all necessary tags. However in real world scenarios this label set, although large, is often incomplete and experts frequently need to refine it. To develop systems that simplify this process, we introduce the task of open vocabulary XMC (OXMC): given a piece of content, predict a set of labels, some of which may be outside of the known tag set. Hence, in addition to not having training data for some labels-as is the case in zero-shot classification-models need to invent some labels on-the-fly. We propose GROOV, a fine-tuned seq2seq model for OXMC that generates the set of labels as a flat sequence and is trained using a novel loss independent of predicted label order. We show the efficacy of the approach, experimenting with popular XMC datasets for which GROOV is able to predict meaningful labels outside the given vocabulary while performing on par with state-of-the-art solutions for known labels.

Type: Proceedings paper
Title: Open Vocabulary Extreme Classification Using Generative Models
ISBN-13: 9781955917254
Open access status: An open access version is available from UCL Discovery
Publisher version: https://aclanthology.org/2022.findings-acl.123.pdf
Language: English
Additional information: This version is the version of record. For information on re-use, please refer to the publisher's terms and conditions.
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery.ucl.ac.uk/id/eprint/10166601
Downloads since deposit
Loading...
19Downloads
Download activity - last month
Loading...
Download activity - last 12 months
Loading...
Downloads by country - last 12 months
Loading...

Archive Staff Only

View Item View Item