arXiv:2211.13644 Abstract | arXiv Analytics

arXiv:2211.13644 [cs.CV]Abstract References Reviews Resources

Seeds Don't Lie: An Adaptive Watermarking Framework for Computer Vision Models

Jacob Shams, Ben Nassi, Ikuya Morikawa, Toshiya Shimizu, Asaf Shabtai, Yuval Elovici

Published 2022-11-24Version 1

In recent years, various watermarking methods were suggested to detect computer vision models obtained illegitimately from their owners, however they fail to demonstrate satisfactory robustness against model extraction attacks. In this paper, we present an adaptive framework to watermark a protected model, leveraging the unique behavior present in the model due to a unique random seed initialized during the model training. This watermark is used to detect extracted models, which have the same unique behavior, indicating an unauthorized usage of the protected model's intellectual property (IP). First, we show how an initial seed for random number generation as part of model training produces distinct characteristics in the model's decision boundaries, which are inherited by extracted models and present in their decision boundaries, but aren't present in non-extracted models trained on the same data-set with a different seed. Based on our findings, we suggest the Robust Adaptive Watermarking (RAW) Framework, which utilizes the unique behavior present in the protected and extracted models to generate a watermark key-set and verification model. We show that the framework is robust to (1) unseen model extraction attacks, and (2) extracted models which undergo a blurring method (e.g., weight pruning). We evaluate the framework's robustness against a naive attacker (unaware that the model is watermarked), and an informed attacker (who employs blurring strategies to remove watermarked behavior from an extracted model), and achieve outstanding (i.e., >0.9) AUC values. Finally, we show that the framework is robust to model extraction attacks with different structure and/or architecture than the protected model.

Comments: 9 pages, 6 figures, 3 tables

Categories: cs.CV

Keywords: computer vision models, seeds dont lie, adaptive watermarking framework, extracted model, training produces distinct characteristics

Related articles: Most relevant | Search more

arXiv:2301.13514 [cs.CV] (Published 2023-01-31)

Fourier Sensitivity and Regularization of Computer Vision Models

Kiran Krishnamachari, See-Kiong Ng, Chuan-Sheng Foo

arXiv:2005.10430 [cs.CV] (Published 2020-05-21)

Gender Slopes: Counterfactual Fairness for Computer Vision Models by Attribute Manipulation

Jungseock Joo, Kimmo Kärkkäinen

arXiv:2212.02774 [cs.CV] (Published 2022-12-06)

Adaptive Testing of Computer Vision Models