arXiv:1907.00274 Abstract | arXiv Analytics

arXiv:1907.00274 [cs.CV]Abstract References Reviews Resources

NetTailor: Tuning the Architecture, Not Just the Weights

Published 2019-06-29Version 1

Real-world applications of object recognition often require the solution of multiple tasks in a single platform. Under the standard paradigm of network fine-tuning, an entirely new CNN is learned per task, and the final network size is independent of task complexity. This is wasteful, since simple tasks require smaller networks than more complex tasks, and limits the number of tasks that can be solved simultaneously. To address these problems, we propose a transfer learning procedure, denoted NetTailor, in which layers of a pre-trained CNN are used as universal blocks that can be combined with small task-specific layers to generate new networks. Besides minimizing classification error, the new network is trained to mimic the internal activations of a strong unconstrained CNN, and minimize its complexity by the combination of 1) a soft-attention mechanism over blocks and 2) complexity regularization constraints. In this way, NetTailor can adapt the network architecture, not just its weights, to the target task. Experiments show that networks adapted to simple tasks, such as character or traffic sign recognition, become significantly smaller than those adapted to hard tasks, such as fine-grained recognition. More importantly, due to the modular nature of the procedure, this reduction in network complexity is achieved without compromise of either parameter sharing across tasks, or classification accuracy.

Journal: CVF/IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019

Categories: cs.CV, cs.LG

Keywords: architecture, simple tasks, traffic sign recognition, complexity regularization constraints, small task-specific layers

Tags: journal article

Related articles: Most relevant | Search more

arXiv:2202.14020 [cs.CV] (Published 2022-02-28)

State-of-the-Art in the Architecture, Methods and Applications of StyleGAN

Amit H. Bermano et al.

arXiv:2108.06159 [cs.CV] (Published 2021-08-13)

Robustness testing of AI systems: A case study for traffic sign recognition

Christian Berghoff, Pavol Bielik, Matthias Neu, Petar Tsankov, Arndt von Twickel

arXiv:1804.00497 [cs.CV] (Published 2018-03-28, updated 2018-10-03)

MicronNet: A Highly Compact Deep Convolutional Neural Network Architecture for Real-time Embedded Traffic Sign Classification

Alexander Wong, Mohammad Javad Shafiee, Michael St. Jules

arXiv Analytics

arXiv:1907.00274 [cs.CV]Abstract References Reviews Resources

NetTailor: Tuning the Architecture, Not Just the Weights

Links

Toolbox

arXiv:1907.00274 [cs.CV]AbstractReferencesReviewsResources

NetTailor: Tuning the Architecture, Not Just the Weights

Links

Toolbox

arXiv:1907.00274 [cs.CV]Abstract References Reviews Resources