arXiv Analytics

Sign in

arXiv:1902.05690 [cs.LG]AbstractReferencesReviewsResources

AutoQB: AutoML for Network Quantization and Binarization on Mobile Devices

Qian Lou, Lantao Liu, Minje Kim, Lei Jiang

Published 2019-02-15Version 1

In this paper, we propose a hierarchical deep reinforcement learning (DRL)-based AutoML framework, AutoQB, to automatically explore the design space of channel-level network quantization and binarization for hardware-friendly deep learning on mobile devices. Compared to prior DDPG-based quantization techniques, on the various CNN models, AutoQB automatically achieves the same inference accuracy by $\sim79\%$ less computing overhead, or improves the inference accuracy by $\sim2\%$ with the same computing cost.

Related articles: Most relevant | Search more
arXiv:2011.04232 [cs.LG] (Published 2020-11-09)
SplitEasy: A Practical Approach for Training ML models on Mobile Devices in a split second
arXiv:2101.04866 [cs.LG] (Published 2021-01-13)
Towards Energy Efficient Federated Learning over 5G+ Mobile Devices
arXiv:2306.11426 [cs.LG] (Published 2023-06-20)
Exploring the Performance and Efficiency of Transformer Models for NLP on Mobile Devices