arXiv:1809.10463 Abstract | arXiv Analytics

arXiv:1809.10463 [cs.LG]Abstract References Reviews Resources

Learning to Train a Binary Neural Network

Joseph Bethge, Haojin Yang, Christian Bartz, Christoph Meinel

Published 2018-09-27Version 1

Convolutional neural networks have achieved astonishing results in different application areas. Various methods which allow us to use these models on mobile and embedded devices have been proposed. Especially binary neural networks seem to be a promising approach for these devices with low computational power. However, understanding binary neural networks and training accurate models for practical applications remains a challenge. In our work, we focus on increasing our understanding of the training process and making it accessible to everyone. We publish our code and models based on BMXNet for everyone to use. Within this framework, we systematically evaluated different network architectures and hyperparameters to provide useful insights on how to train a binary neural network. Further, we present how we improved accuracy by increasing the number of connections in the network.

Comments: Code: https://github.com/Jopyth/BMXNet

Categories: cs.LG, cs.CV, stat.ML

Keywords: convolutional neural networks, understanding binary neural networks, low computational power, application areas, training accurate models

Tags: github project

Related articles: Most relevant | Search more

arXiv:1511.06067 [cs.LG] (Published 2015-11-19)

Convolutional neural networks with low-rank regularization

Cheng Tai, Tong Xiao, Xiaogang Wang, Weinan E

arXiv:1905.04967 [cs.LG] (Published 2019-05-13)

Implicit Filter Sparsification In Convolutional Neural Networks

Dushyant Mehta, Kwang In Kim, Christian Theobalt

arXiv:1802.08250 [cs.LG] (Published 2018-02-22)