UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Self-Supervised Learning with Kernel Dependence Maximization

Li, Y; Pogodin, R; Sutherland, DJ; Gretton, A; (2021) Self-Supervised Learning with Kernel Dependence Maximization. In: Advances in Neural Information Processing Systems 34 (NeurIPS 2021). (pp. pp. 15543-15556). NeurIPS Green open access

[thumbnail of self_supervised_learning_with_.pdf]
Preview
Text
self_supervised_learning_with_.pdf - Published Version

Download (2MB) | Preview

Abstract

We approach self-supervised learning of image representations from a statistical dependence perspective, proposing Self-Supervised Learning with the Hilbert- Schmidt Independence Criterion (SSL-HSIC). SSL-HSIC maximizes dependence between representations of transformations of an image and the image identity, while minimizing the kernelized variance of those representations. This framework yields a new understanding of InfoNCE, a variational lower bound on the mutual information (MI) between different transformations. While the MI itself is known to have pathologies which can result in learning meaningless representations, its bound is much better behaved: we show that it implicitly approximates SSLHSIC (with a slightly different regularizer). Our approach also gives us insight into BYOL, a negative-free SSL method, since SSL-HSIC similarly learns local neighborhoods of samples. SSL-HSIC allows us to directly optimize statistical dependence in time linear in the batch size, without restrictive data assumptions or indirect mutual information estimators. Trained with or without a target network, SSL-HSIC matches the current state-of-the-art for standard linear evaluation on ImageNet [1], semi-supervised learning and transfer to other classification and vision tasks such as semantic segmentation, depth estimation and object recognition. Code is available at https://github.com/deepmind/ssl-hsic.

Type: Proceedings paper
Title: Self-Supervised Learning with Kernel Dependence Maximization
Event: 35th Conference on Neural Information Processing Systems (NeurIPS 2021)
Open access status: An open access version is available from UCL Discovery
Publisher version: https://proceedings.neurips.cc/paper/2021/hash/830...
Language: English
Additional information: This version is the version of record. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: stat.ML, stat.ML, cs.AI, cs.CV, cs.LG
UCL classification: UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > Gatsby Computational Neurosci Unit
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL
URI: https://discovery.ucl.ac.uk/id/eprint/10150718
Downloads since deposit
11Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item