UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Learning to Re-Rank with Contextualized Stopwords

Hofstätter, S; Lipani, A; Zlabinger, M; Hanbury, A; (2020) Learning to Re-Rank with Contextualized Stopwords. In: Proceedings of the 29th ACM International Conference on Information & Knowledge Management. (pp. pp. 2057-2060). Association for Computing Machinery (ACM) Green open access

[thumbnail of Learning_to_Re_Rank_with_Contextualized_Stopwords.pdf]
Preview
Text
Learning_to_Re_Rank_with_Contextualized_Stopwords.pdf - Accepted Version

Download (1MB) | Preview

Abstract

The use of stopwords has been thoroughly studied in traditional Information Retrieval systems, but remains unexplored in the context of neural models. Neural re-ranking models take the full text of both the query and document into account. Naturally, removing tokens that do not carry relevance information provides us with an opportunity to improve the effectiveness by reducing noise and lower document representation caching-storage requirements. In this work we propose a novel contextualized stopword detection mechanism for neural re-ranking models. This mechanism consists of training a sparse vector in order to filter out document tokens from the ranking decision. This vector is learned end-to-end based on the contextualized document representations, allowing the model to filter terms on a per occurrence basis. This leads to a more explainable model, as it reduces noise. We integrate our component into the state-of-the-art interaction-based TK neural re-ranking model. Our experiments on the MS MARCO passage collection and queries from the TREC 2019 Deep Learning Track show that filtering out traditional stopwords prior to the neural model reduces its effectiveness, while learning to filter out contextualized representations improves it.

Type: Proceedings paper
Title: Learning to Re-Rank with Contextualized Stopwords
Event: The 29th ACM International Conference on Information & Knowledge Management
ISBN-13: 978-1-4503-6859-9
Open access status: An open access version is available from UCL Discovery
DOI: 10.1145/3340531.3412079
Publisher version: https://doi.org/10.1145/3340531.3412079
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher's terms and conditions.
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Civil, Environ and Geomatic Eng
URI: https://discovery.ucl.ac.uk/id/eprint/10113160
Downloads since deposit
291Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item