MIMMO: Multi-Input Massive Multi-Output Neural Network

Advanced search
Browse by:

Department | Year

UCL Theses | Latest

Deposit your research

Bookmark & Share

MIMMO: Multi-Input Massive Multi-Output Neural Network

Ferianc, M; Rodrigues, M; (2023) MIMMO: Multi-Input Massive Multi-Output Neural Network. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops. (pp. pp. 4564-4569). IEEE: Vancouver, Canada. Green open access

Preview

Text
Ferianc_MIMMO_Multi-Input_Massive_Multi-Output_Neural_Network_CVPRW_2023_paper.pdf - Accepted Version
Download (682kB) | Preview

Abstract

Neural networks (NNs) have achieved superhuman accuracy in multiple tasks, but NNs predictions' certainty is often debatable, especially if confronted with out of training distribution data. Averaging predictions of an ensemble of NNs can recalibrate the certainty of the predictions, but an ensemble is computationally expensive to deploy in practice. Recently, a new hardware-efficient multi-input multi-output (MIMO) NN was proposed to fit an ensemble of independent NNs into a single NN. In this work, we propose the addition of early-exits to the MIMO architecture with inferred depth-wise weightings to produce multiple predictions for the same input, giving a more diverse ensemble. We denote this combination as MIMMO: a multi-input, massive multi-output NN and we show that it can achieve better accuracy and calibration compared to the MIMO NN, simultaneously fit more NNs and be similarly hardware efficient as MIMO or the early-exit ensemble.

Type:	Proceedings paper
Title:	MIMMO: Multi-Input Massive Multi-Output Neural Network
Event:	2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)
Dates:	17 Jun 2023 - 24 Jun 2023
ISBN-13:	9798350302493
Open access status:	An open access version is available from UCL Discovery
DOI:	10.1109/CVPRW59228.2023.00480
Publisher version:	https://doi.org/10.1109/CVPRW59228.2023.00480
Language:	English
Additional information:	This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords:	Training, Costs, Artificial neural networks, Computer architecture, Benchmark testing, Transformers, Hardware
UCL classification:	UCL UCL > Provost and Vice Provost Offices > UCL BEAMS UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Electronic and Electrical Eng
URI:	https://discovery.ucl.ac.uk/id/eprint/10178269

Downloads since deposit

33Downloads

Download activity - last month

Download activity - last 12 months

Downloads by country - last 12 months

Archive Staff Only

View Item