arXiv:1511.06068 Abstract | arXiv Analytics

arXiv:1511.06068 [cs.LG]Abstract References Reviews Resources

Reducing Overfitting in Deep Networks by Decorrelating Representations

Michael Cogswell, Faruk Ahmed, Ross Girshick, Larry Zitnick, Dhruv Batra

Published 2015-11-19Version 1

One major challenge in training Deep Neural Networks is preventing overfitting. Many techniques such as data augmentation and novel regularizers such as Dropout have been proposed to prevent overfitting without requiring a massive amount of training data. In this work, we propose a new regularizer called DeCov which leads to significantly reduced overfitting (as indicated by the difference between train and val performance), and better generalization. Our regularizer encourages diverse or non-redundant representations in Deep Neural Networks by minimizing the cross-covariance of hidden activations. This simple intuition has been explored in a number of past works but surprisingly has never been applied as a regularizer in supervised learning. Experiments across a range of datasets and network architectures show that this loss always reduces overfitting while almost always maintaining or increasing generalization performance and often improving performance over Dropout.

Comments: 11 pages, 5 figures, 5 tables, ICLR 2016 submission

Categories: cs.LG, stat.ML

Keywords: deep networks, decorrelating representations, reducing overfitting, regularizer encourages diverse, training deep neural networks

Related articles: Most relevant | Search more

arXiv:1908.09375 [cs.LG] (Published 2019-08-25)

Theoretical Issues in Deep Networks: Approximation, Optimization and Generalization

Tomaso Poggio, Andrzej Banburski, Qianli Liao

arXiv:1807.09011 [cs.LG] (Published 2018-07-24)

Uncertainty Modelling in Deep Networks: Forecasting Short and Noisy Series

Axel Brando, Jose A. Rodríguez-Serrano, Mauricio Ciprian, Roberto Maestre, Jordi Vitrià

arXiv:1906.00150 [cs.LG] (Published 2019-06-01)

Sparsity Normalization: Stabilizing the Expected Outputs of Deep Networks