arXiv:1902.05690 Abstract | arXiv Analytics

arXiv:1902.05690 [cs.LG]Abstract References Reviews Resources

AutoQB: AutoML for Network Quantization and Binarization on Mobile Devices

Qian Lou, Lantao Liu, Minje Kim, Lei Jiang

Published 2019-02-15Version 1

In this paper, we propose a hierarchical deep reinforcement learning (DRL)-based AutoML framework, AutoQB, to automatically explore the design space of channel-level network quantization and binarization for hardware-friendly deep learning on mobile devices. Compared to prior DDPG-based quantization techniques, on the various CNN models, AutoQB automatically achieves the same inference accuracy by $\sim79\%$ less computing overhead, or improves the inference accuracy by $\sim2\%$ with the same computing cost.

Comments: 10 pages, 12 figures

Categories: cs.LG, stat.ML

Keywords: mobile devices, binarization, inference accuracy, channel-level network quantization, prior ddpg-based quantization techniques

Related articles: Most relevant | Search more

arXiv:2011.04232 [cs.LG] (Published 2020-11-09)

SplitEasy: A Practical Approach for Training ML models on Mobile Devices in a split second

Kamalesh Palanisamy, Vivek Khimani, Moin Hussain Moti, Dimitris Chatzopoulos

arXiv:2101.04866 [cs.LG] (Published 2021-01-13)

Towards Energy Efficient Federated Learning over 5G+ Mobile Devices

Dian Shi, Liang Li, Rui Chen, Pavana Prakash, Miao Pan, Yuguang Fang

arXiv:2306.11426 [cs.LG] (Published 2023-06-20)

Exploring the Performance and Efficiency of Transformer Models for NLP on Mobile Devices