Multi-objective autotuning of mobile nets across the full software/hardware stack

A Lokhmotov; Flavio Vella; N Chunosov; G Fursin

doi:10.1145/3229762.3229767

Back

Multi-objective autotuning of mobile nets across the full software/hardware stack

Conference proceeding

Open access

Peer reviewed

Multi-objective autotuning of mobile nets across the full software/hardware stack

A Lokhmotov, Flavio Vella, N Chunosov and G Fursin

Proceedings of the 1st on Reproducible Quality-Efficient Systems Tournament on Co-designing Pareto-efficient Deep Learning

1st ACM ReQuEST Workshop/Tournament on Reproducible Software/Hardware Co-Design of Pareto-Efficient Deep Learning, ReQuEST 2018 (Williamsburg, 24/04/2018 - 24/04/2018)

2018

DOI: https://doi.org/10.1145/3229762.3229767

Handle:

https://hdl.handle.net/10863/24407

Abstract

System co-design

Reproducible experimentation

Collective Knowledge

Autotuning

Customizable workflows

Crowdtuning

MobileNets

Live scoreboard

Accuracy

Performance

We present a customizable Collective Knowledge workflow to study the execution time vs. accuracy trade-offs for the MobileNets CNN family. We use this workflow to evaluate MobileNets on Arm Cortex CPUs using TensorFlow and Arm Mali GPUs using several versions of the Arm Compute Library. Our optimizations for the Arm Bifrost GPU architecture reduce the execution time by 2-3 times, while lying on a Pareto-optimal frontier. We also highlight the challenge of maintaining the accuracy when deploying CNN models across diverse platforms. We make all the workflow components (models, programs, scripts, etc.) publicly available to encourage further exploration by the community.

Files and links (2)

pdf

3229762.32297672.43 MBDownload View

Open Access

url

https://dl.acm.org/doi/abs/10.1145/3229762.3229767View

Details

Title: Multi-objective autotuning of mobile nets across the full software/hardware stack
Creators: A Lokhmotov - dividiti, UK
Flavio Vella - dividiti, UK
N Chunosov - Xored, Russia; dividiti, UK
G Fursin - dividiti, UK; cTuning foundation, France
Publication Details: Proceedings of the 1st on Reproducible Quality-Efficient Systems Tournament on Co-designing Pareto-efficient Deep Learning
ISBN: 9781450359238
Conference: 1st ACM ReQuEST Workshop/Tournament on Reproducible Software/Hardware Co-Design of Pareto-Efficient Deep Learning, ReQuEST 2018 (Williamsburg, 24/04/2018 - 24/04/2018)
Publisher: ACM
New York, NY
Format: Online
Number of pages: 10
Identifiers: 978-1-4503-5923-8
(UNIBZ)29973020
991006414398001241
Web of Science ID: 000491870400005
Scopus ID: 2-s2.0-85050640500
Academic Unit: Faculty of Computer Science
Language: English
Resource Type: Conference proceeding
Author Names String: Lokhmotov A, Vella F, Chunosov N, Fursin G

Metrics

3 File views/ downloads

10 Record Views

See more details