When Representations Align: Universality in Representation Learning Dynamics

Advanced search
Browse by:

Department | Year

UCL Theses | Latest

Deposit your research

When Representations Align: Universality in Representation Learning Dynamics

van Rossem, L; Saxe, AM; (2024) When Representations Align: Universality in Representation Learning Dynamics. In: Proceedings of the 41st International Conference on Machine Learning. (pp. pp. 49098-49121). Proceedings of Machine Learning Research: Vienna, Austria. Green open access

[thumbnail of When Representations Align.pdf]

Preview

PDF
When Representations Align.pdf - Accepted Version
Download (4MB) | Preview

Abstract

Deep neural networks come in many sizes and architectures. The choice of architecture, in conjunction with the dataset and learning algorithm, is commonly understood to affect the learned neural representations. Yet, recent results have shown that different architectures learn representations with striking qualitative similarities. Here we derive an effective theory of representation learning under the assumption that the encoding map from input to hidden representation and the decoding map from representation to output are arbitrary smooth functions. This theory schematizes representation learning dynamics in the regime of complex, large architectures, where hidden representations are not strongly constrained by the parametrization. We show through experiments that the effective theory describes aspects of representation learning dynamics across a range of deep networks with different activation functions and architectures, and exhibits phenomena similar to the “rich” and “lazy” regime. While many network behaviors depend quantitatively on architecture, our findings point to certain behaviors that are widely conserved once models are sufficiently flexible.

Type:	Proceedings paper
Title:	When Representations Align: Universality in Representation Learning Dynamics
Event:	41st International Conference on Machine Learning
Open access status:	An open access version is available from UCL Discovery
Publisher version:	https://proceedings.mlr.press/v235/van-rossem24a.h...
Language:	English
Additional information:	This version is the version of record. For information on re-use, please refer to the publisher’s terms and conditions.
UCL classification:	UCL UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > Gatsby Computational Neurosci Unit
URI:	https://discovery.ucl.ac.uk/id/eprint/10198177

Downloads since deposit

11Downloads

Download activity - last month

Download activity - last 12 months

Downloads by country - last 12 months

Archive Staff Only

View Item