UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

MIMMO: Multi-Input Massive Multi-Output Neural Network

Ferianc, M; Rodrigues, M; (2023) MIMMO: Multi-Input Massive Multi-Output Neural Network. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops. (pp. pp. 4564-4569). IEEE: Vancouver, Canada. Green open access

[thumbnail of Ferianc_MIMMO_Multi-Input_Massive_Multi-Output_Neural_Network_CVPRW_2023_paper.pdf]
Preview
Text
Ferianc_MIMMO_Multi-Input_Massive_Multi-Output_Neural_Network_CVPRW_2023_paper.pdf - Accepted Version

Download (682kB) | Preview

Abstract

Neural networks (NNs) have achieved superhuman accuracy in multiple tasks, but NNs predictions' certainty is often debatable, especially if confronted with out of training distribution data. Averaging predictions of an ensemble of NNs can recalibrate the certainty of the predictions, but an ensemble is computationally expensive to deploy in practice. Recently, a new hardware-efficient multi-input multi-output (MIMO) NN was proposed to fit an ensemble of independent NNs into a single NN. In this work, we propose the addition of early-exits to the MIMO architecture with inferred depth-wise weightings to produce multiple predictions for the same input, giving a more diverse ensemble. We denote this combination as MIMMO: a multi-input, massive multi-output NN and we show that it can achieve better accuracy and calibration compared to the MIMO NN, simultaneously fit more NNs and be similarly hardware efficient as MIMO or the early-exit ensemble.

Type: Proceedings paper
Title: MIMMO: Multi-Input Massive Multi-Output Neural Network
Event: 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)
Dates: 17 Jun 2023 - 24 Jun 2023
ISBN-13: 9798350302493
Open access status: An open access version is available from UCL Discovery
DOI: 10.1109/CVPRW59228.2023.00480
Publisher version: https://doi.org/10.1109/CVPRW59228.2023.00480
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: Training, Costs, Artificial neural networks, Computer architecture, Benchmark testing, Transformers, Hardware
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Electronic and Electrical Eng
URI: https://discovery.ucl.ac.uk/id/eprint/10178269
Downloads since deposit
33Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item