arXiv:1709.02802 Abstract | arXiv Analytics

arXiv:1709.02802 [cs.LG]Abstract References Reviews Resources

Towards Proving the Adversarial Robustness of Deep Neural Networks

Guy Katz, Clark Barrett, David L. Dill, Kyle Julian, Mykel J. Kochenderfer

Published 2017-09-08Version 1

Autonomous vehicles are highly complex systems, required to function reliably in a wide variety of situations. Manually crafting software controllers for these vehicles is difficult, but there has been some success in using deep neural networks generated using machine-learning. However, deep neural networks are opaque to human engineers, rendering their correctness very difficult to prove manually; and existing automated techniques, which were not designed to operate on neural networks, fail to scale to large systems. This paper focuses on proving the adversarial robustness of deep neural networks, i.e. proving that small perturbations to a correctly-classified input to the network cannot cause it to be misclassified. We describe some of our recent and ongoing work on verifying the adversarial robustness of networks, and discuss some of the open questions we have encountered and how they might be addressed.

Comments: In Proceedings FVAV 2017, arXiv:1709.02126

Journal: EPTCS 257, 2017, pp. 19-26

DOI: 10.4204/EPTCS.257.3

Categories: cs.LG, cs.CR, cs.LO, stat.ML

Subjects: D.2.4, I.2.2

Keywords: deep neural networks, adversarial robustness, open questions, wide variety, manually crafting software controllers

Tags: journal article

Related articles: Most relevant | Search more

arXiv:1711.09404 [cs.LG] (Published 2017-11-26)

Improving the Adversarial Robustness and Interpretability of Deep Neural Networks by Regularizing their Input Gradients

Andrew Slavin Ross, Finale Doshi-Velez

arXiv:1905.00180 [cs.LG] (Published 2019-05-01)

Dropping Pixels for Adversarial Robustness

Hossein Hosseini, Sreeram Kannan, Radha Poovendran

arXiv:2005.02540 [cs.LG] (Published 2020-05-06)

Proper measure for adversarial robustness

Hyeongji Kim, Ketil Malde

arXiv Analytics

arXiv:1709.02802 [cs.LG]Abstract References Reviews Resources

Towards Proving the Adversarial Robustness of Deep Neural Networks

Links

Toolbox

arXiv:1709.02802 [cs.LG]AbstractReferencesReviewsResources

Towards Proving the Adversarial Robustness of Deep Neural Networks

Links

Toolbox

arXiv:1709.02802 [cs.LG]Abstract References Reviews Resources