Lane, ND;
Bhattacharya, S;
Georgiev, P;
Forlivesi, C;
Jiao, L;
Qendro, L;
Kawsar, F;
(2016)
DeepX: A Software Accelerator for Low-Power Deep Learning Inference on Mobile Devices.
In:
Proceedings of the 2016 15th ACM/IEEE International Conference on Information Processing in Sensor Networks (IPSN).
IEEE: Vienna, Austria.
Preview |
Text
deepx_ipsn.pdf - Published Version Download (486kB) | Preview |
Abstract
Breakthroughs from the field of deep learning are radically changing how sensor data are interpreted to extract the high-level information needed by mobile apps. It is critical that the gains in inference accuracy that deep models afford become embedded in future generations of mobile apps. In this work, we present the design and implementation of DeepX, a software accelerator for deep learning execution. DeepX signif- icantly lowers the device resources (viz. memory, computation, energy) required by deep learning that currently act as a severe bottleneck to mobile adoption. The foundation of DeepX is a pair of resource control algorithms, designed for the inference stage of deep learning, that: (1) decompose monolithic deep model network architectures into unit- blocks of various types, that are then more efficiently executed by heterogeneous local device processors (e.g., GPUs, CPUs); and (2), perform principled resource scaling that adjusts the architecture of deep models to shape the overhead each unit-blocks introduces. Experiments show, DeepX can allow even large-scale deep learning models to execute efficently on modern mobile processors and significantly outperform existing solutions, such as cloud-based offloading.
Type: | Proceedings paper |
---|---|
Title: | DeepX: A Software Accelerator for Low-Power Deep Learning Inference on Mobile Devices |
Event: | 15th ACM/IEEE International Conference on Information Processing in Sensor Networks (IPSN) |
ISBN-13: | 9781509008025 |
Open access status: | An open access version is available from UCL Discovery |
DOI: | 10.1109/IPSN.2016.7460664 |
Publisher version: | http://dx.doi.org/10.1109/IPSN.2016.7460664 |
Language: | English |
Additional information: | Copyright © 2016 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. |
Keywords: | Computational modeling, Computer architecture, Inference algorithms, Machine learning, Mobile communication, Program processors, Runtime |
UCL classification: | UCL UCL > Provost and Vice Provost Offices > UCL BEAMS UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science |
URI: | https://discovery.ucl.ac.uk/id/eprint/1503670 |
Archive Staff Only
View Item |