UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

DeepX: A Software Accelerator for Low-Power Deep Learning Inference on Mobile Devices

Lane, ND; Bhattacharya, S; Georgiev, P; Forlivesi, C; Jiao, L; Qendro, L; Kawsar, F; (2016) DeepX: A Software Accelerator for Low-Power Deep Learning Inference on Mobile Devices. In: Proceedings of the 2016 15th ACM/IEEE International Conference on Information Processing in Sensor Networks (IPSN). IEEE: Vienna, Austria. Green open access

[thumbnail of deepx_ipsn.pdf]
Preview
Text
deepx_ipsn.pdf - Published Version

Download (486kB) | Preview

Abstract

Breakthroughs from the field of deep learning are radically changing how sensor data are interpreted to extract the high-level information needed by mobile apps. It is critical that the gains in inference accuracy that deep models afford become embedded in future generations of mobile apps. In this work, we present the design and implementation of DeepX, a software accelerator for deep learning execution. DeepX signif- icantly lowers the device resources (viz. memory, computation, energy) required by deep learning that currently act as a severe bottleneck to mobile adoption. The foundation of DeepX is a pair of resource control algorithms, designed for the inference stage of deep learning, that: (1) decompose monolithic deep model network architectures into unit- blocks of various types, that are then more efficiently executed by heterogeneous local device processors (e.g., GPUs, CPUs); and (2), perform principled resource scaling that adjusts the architecture of deep models to shape the overhead each unit-blocks introduces. Experiments show, DeepX can allow even large-scale deep learning models to execute efficently on modern mobile processors and significantly outperform existing solutions, such as cloud-based offloading.

Type: Proceedings paper
Title: DeepX: A Software Accelerator for Low-Power Deep Learning Inference on Mobile Devices
Event: 15th ACM/IEEE International Conference on Information Processing in Sensor Networks (IPSN)
ISBN-13: 9781509008025
Open access status: An open access version is available from UCL Discovery
DOI: 10.1109/IPSN.2016.7460664
Publisher version: http://dx.doi.org/10.1109/IPSN.2016.7460664
Language: English
Additional information: Copyright © 2016 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Keywords: Computational modeling, Computer architecture, Inference algorithms, Machine learning, Mobile communication, Program processors, Runtime
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery.ucl.ac.uk/id/eprint/1503670
Downloads since deposit
3,153Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item