UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Unraveling the Enigma of Double Descent: an in-depth Analysis Through the Lens of Learned Feature Space

Gu, Y; Zheng, X; Aste, T; (2024) Unraveling the Enigma of Double Descent: an in-depth Analysis Through the Lens of Learned Feature Space. In: 12th International Conference on Learning Representations, ICLR 2024. International Conference on Learning Representations (ICLR): Vienna, Austria. Green open access

[thumbnail of 287_Unraveling_the_Enigma_of_D.pdf]
Preview
Text
287_Unraveling_the_Enigma_of_D.pdf - Published Version

Download (996kB) | Preview

Abstract

Double descent presents a counter-intuitive aspect within the machine learning domain, and researchers have observed its manifestation in various models and tasks. While some theoretical explanations have been proposed for this phenomenon in specific contexts, an accepted theory for its occurring mechanism in deep learning remains yet to be established. In this study, we revisit the phenomenon of double descent and demonstrate that the presence of noisy data strongly influences its occurrence. By comprehensively analysing the feature space of learned representations, we unveil that double descent arises in imperfect models trained with noisy data. We argue that while small and intermediate models before the interpolation threshold follow the traditional bias-variance trade-off, over-parameterized models interpolate noisy samples among robust data thus acquiring the capability to separate the information from the noise. The source code is available at https://github.com/Yufei-Gu-451/double_descent_inference.git.

Type: Proceedings paper
Title: Unraveling the Enigma of Double Descent: an in-depth Analysis Through the Lens of Learned Feature Space
Event: 12th International Conference on Learning Representations, ICLR 2024
Open access status: An open access version is available from UCL Discovery
Publisher version: https://openreview.net/forum?id=CEkIyshNbC
Language: English
Additional information: This version is the version of record. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: Neural network, double descent, classification, interpretability
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery.ucl.ac.uk/id/eprint/10195825
Downloads since deposit
59Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item