UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

OSLNet: Deep Small-Sample Classification with an Orthogonal Softmax Layer

Li, X; Li, X; Chang, D; Ma, Z; Tan, Z-H; Xue, J-H; Cao, J; ... Guo, J; + view all (2020) OSLNet: Deep Small-Sample Classification with an Orthogonal Softmax Layer. IEEE Transactions on Image Processing 10.1109/tip.2020.2990277. (In press). Green open access

[thumbnail of TIP-XiaoxuLi-OSL-R3-UCL.pdf]
Preview
Text
TIP-XiaoxuLi-OSL-R3-UCL.pdf - Accepted Version

Download (9MB) | Preview

Abstract

A deep neural network of multiple nonlinear layers forms a large function space, which can easily lead to overfitting when it encounters small-sample data. To mitigate overfitting in small-sample classification, learning more discriminative features from small-sample data is becoming a new trend. To this end, this paper aims to find a subspace of neural networks that can facilitate a large decision margin. Specifically, we propose the Orthogonal Softmax Layer (OSL), which makes the weight vectors in the classification layer remain orthogonal during both the training and test processes. The Rademacher complexity of a network using the OSL is only 1/K, where K is the number of classes, of that of a network using the fully connected classification layer, leading to a tighter generalization error bound. Experimental results demonstrate that the proposed OSL has better performance than the methods used for comparison on four small-sample benchmark datasets, as well as its applicability to large-sample datasets. Codes are available at: https://github.com/dongliangchang/OSLNet.

Type: Article
Title: OSLNet: Deep Small-Sample Classification with an Orthogonal Softmax Layer
Open access status: An open access version is available from UCL Discovery
DOI: 10.1109/tip.2020.2990277
Publisher version: https://doi.org/10.1109/tip.2020.2990277
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: Deep neural network, Orthogonal softmax layer, Overfitting, Small-sample classification
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences > Dept of Statistical Science
URI: https://discovery.ucl.ac.uk/id/eprint/10096964
Downloads since deposit
104Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item