arXiv:1906.00150 Abstract | arXiv Analytics

arXiv:1906.00150 [cs.LG]Abstract References Reviews Resources

Sparsity Normalization: Stabilizing the Expected Outputs of Deep Networks

Joonyoung Yi, Juhyuk Lee, Sung Ju Hwang, Eunho Yang

Published 2019-06-01Version 1

The learning of deep models, in which a numerous of parameters are superimposed, is known to be a fairly sensitive process and should be carefully done through a combination of several techniques that can help to stabilize it. We introduce an additional challenge that has never been explicitly studied: the heterogeneity of sparsity at the instance level due to missing values or the innate nature of the input distribution. We confirm experimentally on the widely used benchmark datasets that this variable sparsity problem makes the output statistics of neurons unstable and makes the learning process more difficult by saturating non-linearities. We also provide the analysis of this phenomenon, and based on our analysis, we present a simple technique to prevent this issue, referred to as Sparsity Normalization (SN). Finally, we show that the performance can be significantly improved with SN on certain popular benchmark datasets, or that similar performance can be achieved with lower capacity. Especially focusing on the collaborative filtering problem where the variable sparsity issue has been completely ignored, we achieve new state-of-the-art results on Movielens 100k and 1M datasets, by simply applying Sparsity Normalization (SN).

Comments: 23 pages

Categories: cs.LG, stat.ML

Keywords: deep networks, expected outputs, popular benchmark datasets, variable sparsity issue, lower capacity

Related articles: Most relevant | Search more

arXiv:1908.09375 [cs.LG] (Published 2019-08-25)

Theoretical Issues in Deep Networks: Approximation, Optimization and Generalization

Tomaso Poggio, Andrzej Banburski, Qianli Liao

arXiv:2007.10099 [cs.LG] (Published 2020-07-20)

Early Stopping in Deep Networks: Double Descent and How to Eliminate it

Reinhard Heckel, Fatih Furkan Yilmaz

arXiv:1807.09011 [cs.LG] (Published 2018-07-24)

Uncertainty Modelling in Deep Networks: Forecasting Short and Noisy Series