arXiv:2007.04028 Abstract | arXiv Analytics

arXiv:2007.04028 [cs.LG]Abstract References Reviews Resources

How benign is benign overfitting?

Amartya Sanyal, Puneet K Dokania, Varun Kanade, Philip H. S. Torr

Published 2020-07-08Version 1

We investigate two causes for adversarial vulnerability in deep neural networks: bad data and (poorly) trained models. When trained with SGD, deep neural networks essentially achieve zero training error, even in the presence of label noise, while also exhibiting good generalization on natural test data, something referred to as benign overfitting [2, 10]. However, these models are vulnerable to adversarial attacks. We identify label noise as one of the causes for adversarial vulnerability, and provide theoretical and empirical evidence in support of this. Surprisingly, we find several instances of label noise in datasets such as MNIST and CIFAR, and that robustly trained models incur training error on some of these, i.e. they don't fit the noise. However, removing noisy labels alone does not suffice to achieve adversarial robustness. Standard training procedures bias neural networks towards learning "simple" classification boundaries, which may be less robust than more complex ones. We observe that adversarial training does produce more complex decision boundaries. We conjecture that in part the need for complex decision boundaries arises from sub-optimal representation learning. By means of simple toy examples, we show theoretically how the choice of representation can drastically affect adversarial robustness.

Categories: cs.LG, stat.ML

Keywords: benign overfitting, procedures bias neural networks, neural networks essentially achieve, networks essentially achieve zero, models incur training error

Related articles: Most relevant | Search more

arXiv:2302.00257 [cs.LG] (Published 2023-02-01)

Implicit Regularization Leads to Benign Overfitting for Sparse Linear Regression

Mo Zhou, Rong Ge

arXiv:2201.11489 [cs.LG] (Published 2022-01-27)

The Implicit Bias of Benign Overfitting

Ohad Shamir

arXiv:2410.07746 [cs.LG] (Published 2024-10-10)

Benign Overfitting in Single-Head Attention

Roey Magen, Shuning Shang, Zhiwei Xu, Spencer Frei, Wei Hu, Gal Vardi

arXiv Analytics

arXiv:2007.04028 [cs.LG]Abstract References Reviews Resources

How benign is benign overfitting?

Links

Toolbox

arXiv:2007.04028 [cs.LG]AbstractReferencesReviewsResources

How benign is benign overfitting?

Links

Toolbox

arXiv:2007.04028 [cs.LG]Abstract References Reviews Resources