arXiv:2105.14835 Abstract | arXiv Analytics

arXiv:2105.14835 [cs.LG]Abstract References Reviews Resources

Towards Lower Bounds on the Depth of ReLU Neural Networks

Christoph Hertrich, Amitabh Basu, Marco Di Summa, Martin Skutella

Published 2021-05-31Version 1

We contribute to a better understanding of the class of functions that is represented by a neural network with ReLU activations and a given architecture. Using techniques from mixed-integer optimization, polyhedral theory, and tropical geometry, we provide a mathematical counterbalance to the universal approximation theorems which suggest that a single hidden layer is sufficient for learning tasks. In particular, we investigate whether the class of exactly representable functions strictly increases by adding more layers (with no restrictions on size). This problem has potential impact on algorithmic and statistical aspects because of the insight it provides into the class of functions represented by neural hypothesis classes. However, to the best of our knowledge, this question has not been investigated in the neural network literature. We also present upper bounds on the sizes of neural networks required to represent functions in these neural hypothesis classes.

Categories: cs.LG, cs.DM, cs.NE, math.CO, stat.ML

Keywords: relu neural networks, lower bounds, representable functions strictly increases, neural hypothesis classes, neural network literature

Related articles: Most relevant | Search more

arXiv:2305.15141 [cs.LG] (Published 2023-05-24)

From Tempered to Benign Overfitting in ReLU Neural Networks

Guy Kornowski, Gilad Yehudai, Ohad Shamir

arXiv:2202.03841 [cs.LG] (Published 2022-02-08)

Width is Less Important than Depth in ReLU Neural Networks

Gal Vardi, Gilad Yehudai, Ohad Shamir

arXiv:1903.02237 [cs.LG] (Published 2019-03-06)

Positively Scale-Invariant Flatness of ReLU Neural Networks

Mingyang Yi, Qi Meng, Wei Chen, Zhi-ming Ma, Tie-Yan Liu