arXiv:2003.11535 Abstract | arXiv Analytics

arXiv:2003.11535 [cs.CV]Abstract References Reviews Resources

Training Binary Neural Networks with Real-to-Binary Convolutions

Brais Martinez, Jing Yang, Adrian Bulat, Georgios Tzimiropoulos

Published 2020-03-25Version 1

This paper shows how to train binary networks to within a few percent points ($\sim 3-5 \%$) of the full precision counterpart. We first show how to build a strong baseline, which already achieves state-of-the-art accuracy, by combining recently proposed advances and carefully adjusting the optimization procedure. Secondly, we show that by attempting to minimize the discrepancy between the output of the binary and the corresponding real-valued convolution, additional significant accuracy gains can be obtained. We materialize this idea in two complementary ways: (1) with a loss function, during training, by matching the spatial attention maps computed at the output of the binary and real-valued convolutions, and (2) in a data-driven manner, by using the real-valued activations, available during inference prior to the binarization process, for re-scaling the activations right after the binary convolution. Finally, we show that, when putting all of our improvements together, the proposed model beats the current state of the art by more than 5% top-1 accuracy on ImageNet and reduces the gap to its real-valued counterpart to less than 3% and 5% top-1 accuracy on CIFAR-100 and ImageNet respectively when using a ResNet-18 architecture. Code available at https://github.com/brais-martinez/real2binary.

Comments: ICLR 2020

Categories: cs.CV

Keywords: training binary neural networks, real-to-binary convolutions, additional significant accuracy gains, spatial attention maps, real-valued convolution

Related articles:

arXiv:1904.07852 [cs.CV] (Published 2019-04-16)

Matrix and tensor decompositions for training binary neural networks

Adrian Bulat, Jean Kossaifi, Georgios Tzimiropoulos, Maja Pantic

arXiv:2010.04871 [cs.CV] (Published 2020-10-10)

Training Binary Neural Networks through Learning with Noisy Supervision

Kai Han, Yunhe Wang, Yixing Xu, Chunjing Xu, Enhua Wu, Chang Xu

arXiv:1709.01182 [cs.CV] (Published 2017-09-04)

Is human face processing a feature- or pattern-based task? Evidence using a unified computational method driven by eye movements