arXiv:1904.04334 Abstract | arXiv Analytics

arXiv:1904.04334 [cs.LG]Abstract References Reviews Resources

A Target-Agnostic Attack on Deep Models: Exploiting Security Vulnerabilities of Transfer Learning

Published 2019-04-08Version 1

Due to the lack of enough training data and high computational cost to train a deep neural network from scratch, transfer learning has been extensively used in many deep-neural-network-based applications, such as face recognition, image classification, speech recognition, etc. A commonly-used transfer learning approach involves taking a part of a pre-trained model, adding a few layers at the end, and re-training the new layers with a small dataset. This approach, while efficient and widely used, imposes a security vulnerability because the pre-trained models used in transfer learning are usually available publicly to everyone, including potential attackers. In this paper, we show that without any additional knowledge other than the pre-trained model, an attacker can launch an effective and efficient brute force attack that can craft instances of input to trigger each target class with high confidence. Note that we assume that the attacker does not have access to any target-specific information, including samples from target classes, re-trained model, and probabilities assigned by Softmax to each class, and thus called target-agnostic attack. These assumptions render all previous attacks impractical, to the best of our knowledge. To evaluate the proposed attack, we perform a set of experiments on face recognition and speech recognition tasks and show the effectiveness of the attack. Our work sheds light on a fundamental security challenge of transfer learning in deep neural networks.

Categories: cs.LG, stat.ML

Keywords: transfer learning, exploiting security vulnerabilities, target-agnostic attack, security vulnerability, deep models

Related articles: Most relevant | Search more

arXiv:1911.02048 [cs.LG] (Published 2019-11-05)

Guided Layer-wise Learning for Deep Models using Side Information

Pavel Sulimov, Elena Sukmanova, Roman Chereshnev, Attila Kertesz-Farkas

arXiv:1903.09033 [cs.LG] (Published 2019-03-21)

Deep Models for Relational Databases

Devon Graham, Siamak Ravanbakhsh

arXiv:1902.04151 [cs.LG] (Published 2019-01-26)

Evaluation of Transfer Learning for Classification of: (1) Diabetic Retinopathy by Digital Fundus Photography and (2) Diabetic Macular Edema, Choroidal Neovascularization and Drusen by Optical Coherence Tomography

Rony Gelman

arXiv Analytics

arXiv:1904.04334 [cs.LG]Abstract References Reviews Resources

A Target-Agnostic Attack on Deep Models: Exploiting Security Vulnerabilities of Transfer Learning

Links

Toolbox

arXiv:1904.04334 [cs.LG]AbstractReferencesReviewsResources

A Target-Agnostic Attack on Deep Models: Exploiting Security Vulnerabilities of Transfer Learning

Links

Toolbox

arXiv:1904.04334 [cs.LG]Abstract References Reviews Resources