UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Information Locality as an Inductive Bias for Neural Language Models

Someya, Taiga; Svete, Anej; DuSell, Brian; O’Donnell, Timothy J; Giulianelli, Mario; Cotterell, Ryan; (2025) Information Locality as an Inductive Bias for Neural Language Models. In: Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). (pp. pp. 27995-28013). Association for Computational Linguistics Green open access

[thumbnail of 2025.acl-long.1357.pdf]
Preview
Text
2025.acl-long.1357.pdf - Published Version

Download (1MB) | Preview

Abstract

Inductive biases are inherent in every machine learning system, shaping how models generalize from finite data. In the case of neural language models (LMs), debates persist as to whether these biases align with or diverge from human processing constraints. To address this issue, we propose a quantitative framework that allows for controlled investigations into the nature of these biases. Within our framework, we introduce m-local entropy—an informationtheoretic measure derived from average lossycontext surprisal—that captures the local uncertainty of a language by quantifying how effectively the m − 1 preceding symbols disambiguate the next symbol. In experiments on both perturbed natural language corpora and languages defined by probabilistic finite-state automata (PFSAs), we show that languages with higher m-local entropy are more difficult for Transformer and LSTM LMs to learn. These results suggest that neural LMs, much like humans, are highly sensitive to the local statistical structure of a language.

Type: Proceedings paper
Title: Information Locality as an Inductive Bias for Neural Language Models
Event: Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Dates: Jul 2025 - Jul 2025
Open access status: An open access version is available from UCL Discovery
DOI: 10.18653/v1/2025.acl-long.1357
Publisher version: https://doi.org/10.18653/v1/2025.acl-long.1357
Language: English
Additional information: ACL materials are Copyright © 1963–2025 ACL; other materials are copyrighted by their respective copyright holders. Materials prior to 2016 here are licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 International License. Permission is granted to make copies for the purposes of teaching and research. Materials published in or after 2016 are licensed on a Creative Commons Attribution 4.0 International License.
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences > Div of Psychology and Lang Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences > Div of Psychology and Lang Sciences > Linguistics
URI: https://discovery.ucl.ac.uk/id/eprint/10216472
Downloads since deposit
3Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item