arXiv Analytics

Sign in

arXiv:1102.3751 [cs.IT]AbstractReferencesReviewsResources

Utility-Privacy Tradeoff in Databases: An Information-theoretic Approach

Lalitha Sankar, S. Raj Rajagopalan, H. Vincent Poor

Published 2011-02-18, updated 2013-01-21Version 4

Ensuring the usefulness of electronic data sources while providing necessary privacy guarantees is an important unsolved problem. This problem drives the need for an analytical framework that can quantify the safety of personally identifiable information (privacy) while still providing a quantifable benefit (utility) to multiple legitimate information consumers. This paper presents an information-theoretic framework that promises an analytical model guaranteeing tight bounds of how much utility is possible for a given level of privacy and vice-versa. Specific contributions include: i) stochastic data models for both categorical and numerical data; ii) utility-privacy tradeoff regions and the encoding (sanization) schemes achieving them for both classes and their practical relevance; and iii) modeling of prior knowledge at the user and/or data source and optimal encoding schemes for both cases.

Comments: Revised following submission to the IEEE Transactions on Information Forensics and Security: Special Issue on Privacy and Trust Management in Cloud and Distributed Systems; updated with missing references
Categories: cs.IT, math.IT
Related articles: Most relevant | Search more
arXiv:1010.0226 [cs.IT] (Published 2010-10-01)
An Information-theoretic Approach to Privacy
arXiv:2107.14264 [cs.IT] (Published 2021-07-29)
An Information-Theoretic Approach to Joint Sensing and Communication
arXiv:2107.01799 [cs.IT] (Published 2021-07-05)
An Information-Theoretic Approach for Automatically Determining the Number of States when Aggregating Markov Chains