Abstract
Analyzing the structure of social networks is of interest in a wide range of disciplines, but such activity is limited by the fact that these data represent sensitive information and can not be published in their raw form. One of the approaches to sanitize network data is to randomly add or remove edges from the graph. Recent studies have quantified the level of anonymity that is obtained by random perturbation by means of a-posteriori belief probabilities and, by conducting experiments on small datasets, arrived at the conclusion that random perturbation can not achieve meaningful levels of anonymity without deteriorating the graph features. We offer a new information-theoretic perspective on this issue. We make an essential distinction between image and preimage anonymity and propose a more accurate quantification, based on entropy, of the anonymity level that is provided by the perturbed network. We explain why the entropy-based quantification, which is global, is more adequate than the previously used local quantification based on a-posteriori belief. We also prove that the anonymity level quantified by means of entropy is always greater than or equal to the one based on a-posteriori belief probabilities. In addition, we introduce and explore the method of random sparsification, which randomly removes edges, without adding new ones. Extensive experimentation on several very large datasets shows that randomization techniques for identity obfuscation are back in the game, as they may achieve meaningful levels of anonymity while still preserving features of the original graph.
Original language | English |
---|---|
Title of host publication | 2011 IEEE 27th International Conference on Data Engineering |
Pages | 924-935 |
Number of pages | 12 |
DOIs | |
Publication status | Published - 2011 |
MoE publication type | A4 Article in a conference publication |
Event | International Conference on Data Engineering - Hannover, Germany Duration: 11 Apr 2011 → 16 Apr 2011 Conference number: 27 http://ieeexplore.ieee.org/xpl/conhome.jsp?punumber=1000178 |
Publication series
Name | International Conference on Data Engineering |
---|---|
Publisher | IEEE |
ISSN (Print) | 2375-0286 |
ISSN (Electronic) | 1084-4627 |
Conference
Conference | International Conference on Data Engineering |
---|---|
Abbreviated title | ICDE |
Country/Territory | Germany |
City | Hannover |
Period | 11/04/2011 → 16/04/2011 |
Internet address |