UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Towards a Similarity-adjusted Surprisal Theory

Meister, Clara; Giulianelli, Mario; Pimentel, Tiago; (2024) Towards a Similarity-adjusted Surprisal Theory. In: Al-Onaizan, Yaser and Bansal, Mohit and Chen, Yun-Nung, (eds.) Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing. (pp. pp. 16485-16498). Association for Computational Linguistics: Miami, FL, USA. Green open access

[thumbnail of 2024.emnlp-main.921.pdf]
Preview
PDF
2024.emnlp-main.921.pdf - Published Version

Download (417kB) | Preview

Abstract

Surprisal theory posits that the cognitive effort required to comprehend a word is determined by its contextual predictability, quantified as surprisal. Traditionally, surprisal theory treats words as distinct entities, overlooking any potential similarity between them. Giulianelli et al. (2023) address this limitation by introducing information value, a measure of predictability designed to account for similarities between communicative units. Our work leverages Ricotta and Szeidl’s (2006) diversity index to extend surprisal into a metric that we term similarity-adjusted surprisal, exposing a mathematical relationship between surprisal and information value. Similarity-adjusted surprisal aligns with information value when considering graded similarities and reduces to standard surprisal when words are treated as distinct. Experimental results with reading time data indicate that similarity-adjusted surprisal adds predictive power beyond standard surprisal for certain datasets, suggesting it serves as a complementary measure of comprehension effort.

Type: Proceedings paper
Title: Towards a Similarity-adjusted Surprisal Theory
Event: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing
Dates: Nov 2024 - Nov 2024
Open access status: An open access version is available from UCL Discovery
DOI: 10.18653/v1/2024.emnlp-main.921
Publisher version: https://doi.org/10.18653/v1/2024.emnlp-main.921
Language: English
Additional information: ACL materials are Copyright © 1963–2025 ACL; other materials are copyrighted by their respective copyright holders. Materials prior to 2016 here are licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 International License. Permission is granted to make copies for the purposes of teaching and research. Materials published in or after 2016 are licensed on a Creative Commons Attribution 4.0 International License.
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences > Div of Psychology and Lang Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences > Div of Psychology and Lang Sciences > Linguistics
URI: https://discovery.ucl.ac.uk/id/eprint/10216475
Downloads since deposit
1Download
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item