UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Exact learning dynamics of deep linear networks with prior knowledge

Domine, Clementine C; Braun, Lukas; Fitzgerald, James E; Saxe, Andrew M; (2023) Exact learning dynamics of deep linear networks with prior knowledge. Journal of Statistical Mechanics: Theory and Experiment , 2023 (11) , Article ARTN 114004. 10.1088/1742-5468/ad01b8. Green open access

[thumbnail of Exact learning dynamics of deep linear networks with prior knowledge.pdf]
Preview
Text
Exact learning dynamics of deep linear networks with prior knowledge.pdf - Published Version

Download (3MB) | Preview

Abstract

Learning in deep neural networks is known to depend critically on the knowledge embedded in the initial network weights. However, few theoretical results have precisely linked prior knowledge to learning dynamics. Here we derive exact solutions to the dynamics of learning with rich prior knowledge in deep linear networks by generalising Fukumizu’s matrix Riccati solution (Fukumizu 1998 Gen 1 1E-03). We obtain explicit expressions for the evolving network function, hidden representational similarity, and neural tangent kernel over training for a broad class of initialisations and tasks. The expressions reveal a class of task-independent initialisations that radically alter learning dynamics from slow non-linear dynamics to fast exponential trajectories while converging to a global optimum with identical representational similarity, dissociating learning trajectories from the structure of initial internal representations. We characterise how network weights dynamically align with task structure, rigorously justifying why previous solutions successfully described learning from small initial weights without incorporating their fine-scale structure. Finally, we discuss the implications of these findings for continual learning, reversal learning and learning of structured knowledge. Taken together, our results provide a mathematical toolkit for understanding the impact of prior knowledge on deep learning.

Type: Article
Title: Exact learning dynamics of deep linear networks with prior knowledge
Location: England
Open access status: An open access version is available from UCL Discovery
DOI: 10.1088/1742-5468/ad01b8
Publisher version: http://dx.doi.org/10.1088/1742-5468/ad01b8
Language: English
Additional information: Original content from this work may be used under the terms of the Creative Commons Attribution 4.0 license. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.
Keywords: Science & Technology, Technology, Physical Sciences, Mechanics, Physics, Mathematical, Physics, deep learning, learning theory, machine learning, CONNECTIONIST MODELS, NEURAL-NETWORKS, SYSTEMS
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > Gatsby Computational Neurosci Unit
URI: https://discovery.ucl.ac.uk/id/eprint/10198142
Downloads since deposit
10Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item