UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

On The Specialization of Neural Modules

Jarvis, Devon; Klein, Richard; Rosman, Benjamin; Saxe, Andrew; (2023) On The Specialization of Neural Modules. In: Proceedings of the Eleventh International Conference on Learning Representations. (pp. pp. 1-31). ICLR (In press). Green open access

[thumbnail of 5201_on_the_specialization_of_neura.pdf]
Preview
PDF
5201_on_the_specialization_of_neura.pdf - Accepted Version

Download (1MB) | Preview

Abstract

A number of machine learning models have been proposed with the goal of achieving systematic generalization: the ability to reason about new situations by combining aspects of previous experiences. These models leverage compositional architectures which aim to learn specialized modules dedicated to structures in a task that can be composed to solve novel problems with similar structures. While the compositionality of these architectures is guaranteed by design, the modules specializing is not. Here we theoretically study the ability of network modules to specialize to useful structures in a dataset and achieve systematic generalization. To this end we introduce a minimal space of datasets motivated by practical systematic generalization benchmarks. From this space of datasets we present a mathematical definition of systematicity and study the learning dynamics of linear neural modules when solving components of the task. Our results shed light on the difficulty of module specialization, what is required for modules to successfully specialize, and the necessity of modular architectures to achieve systematicity. Finally, we confirm that the theoretical results in our tractable setting generalize to more complex datasets and non-linear architectures.

Type: Proceedings paper
Title: On The Specialization of Neural Modules
Event: The Eleventh International Conference on Learning Representations
Location: Kigali, Rwanda
Dates: 1 May 2023 - 5 May 2023
Open access status: An open access version is available from UCL Discovery
Publisher version: https://iclr.cc/
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher's terms and conditions.
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > Gatsby Computational Neurosci Unit
URI: https://discovery.ucl.ac.uk/id/eprint/10169381
Downloads since deposit
92Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item