arXiv:1707.09725 Abstract | arXiv Analytics

arXiv:1707.09725 [cs.CV]Abstract References Reviews Resources

Analysis and Optimization of Convolutional Neural Network Architectures

Published 2017-07-31Version 1

Convolutional Neural Networks (CNNs) dominate various computer vision tasks since Alex Krizhevsky showed that they can be trained effectively and reduced the top-5 error from 26.2 % to 15.3 % on the ImageNet large scale visual recognition challenge. Many aspects of CNNs are examined in various publications, but literature about the analysis and construction of neural network architectures is rare. This work is one step to close this gap. A comprehensive overview over existing techniques for CNN analysis and topology construction is provided. A novel way to visualize classification errors with confusion matrices was developed. Based on this method, hierarchical classifiers are described and evaluated. Additionally, some results are confirmed and quantified for CIFAR-100. For example, the positive impact of smaller batch sizes, averaging ensembles, data augmentation and test-time transformations on the accuracy. Other results, such as the positive impact of learned color transformation on the test accuracy could not be confirmed. A model which has only one million learned parameters for an input size of 32x32x3 and 100 classes and which beats the state of the art on the benchmark dataset Asirra, GTSRB, HASYv2 and STL-10 was developed.

Comments: Master's thesis. 73 pages + 24 pages appendix; 39 figures; 33 tables

Categories: cs.CV

Keywords: convolutional neural network architectures, large scale visual recognition challenge, imagenet large scale visual recognition, optimization

Tags: dissertation

Related articles: Most relevant | Search more

arXiv:1409.0575 [cs.CV] (Published 2014-09-01)

ImageNet Large Scale Visual Recognition Challenge

Olga Russakovsky et al.

arXiv:2102.11506 [cs.CV] (Published 2021-02-23)

Comparative evaluation of CNN architectures for Image Caption Generation

Sulabh Katiyar, Samir Kumar Borgohain

arXiv:2211.03854 [cs.CV] (Published 2022-11-07)

Exploration of Convolutional Neural Network Architectures for Large Region Map Automation

R. M. Tsenov, C. J. Henry, J. L. Storie, C. D. Storie, B. Murray, M. Sokolov

arXiv Analytics

arXiv:1707.09725 [cs.CV]Abstract References Reviews Resources

Analysis and Optimization of Convolutional Neural Network Architectures

Links

Toolbox

arXiv:1707.09725 [cs.CV]AbstractReferencesReviewsResources

Analysis and Optimization of Convolutional Neural Network Architectures

Links

Toolbox

arXiv:1707.09725 [cs.CV]Abstract References Reviews Resources