Fixed-parameter tractability of anonymizing data by suppressing entries

Evans, Patricia A.; Wareham, H. Todd; Chaytor, Rhonda

doi:10.1007/s10878-009-9253-6

Fixed-parameter tractability of anonymizing data by suppressing entries

Published: 18 August 2009

Volume 18, pages 362–375, (2009)
Cite this article

Download PDF

Access provided by CONRICYT-eBooks

Journal of Combinatorial Optimization Aims and scope Submit manuscript

Fixed-parameter tractability of anonymizing data by suppressing entries

Download PDF

Patricia A. Evans¹,
H. Todd Wareham² &
Rhonda Chaytor³

97 Accesses
13 Citations
Explore all metrics

Abstract

A popular model for protecting privacy when person-specific data is released is k -anonymity. A dataset is k-anonymous if each record is identical to at least (k−1) other records in the dataset. The basic k-anonymization problem, which minimizes the number of dataset entries that must be suppressed to achieve k-anonymity, is NP-hard and hence not solvable both quickly and optimally in general. We apply parameterized complexity analysis to explore algorithmic options for restricted versions of this problem that occur in practice. We present the first fixed-parameter algorithms for this problem and identify key techniques that can be applied to this and other k-anonymization problems.

References

Aggarwal G, Feder T, Kenthapadi K, Motwani R, Panigrahy R, Thomas D, Zhu A (2005) Approximation algorithms for k-anonymity. J Priv Technol, paper 20051120001
Brankovic L, Estivill-Castro V (1999) Privacy issues in knowledge discovery and data mining. In: Proceedings of Australian institute of computer ethics conference (AICEC99), pp 89–99
Brankovic L, Miller M, Horak P, Wrightson G (1997) Usability of compromise-free statistical databases. In: Proceedings of the ninth international conference on scientific and statistical database management (SSDBM 1997). IEEE Press, New York, pp 144–154
Chapter Google Scholar
Bonizzoni P, Della Vedova G, Dondi R (2007) Anonymizing binary tables is APX-hard. The Computing Research Repository (CoRR) 0707.0421. http://arxiv.org/abs/0707.0421
Chaytor R (2006) Utility preserving k-anonymity. Technical report MUN-CS 2006-01, Dept Computer Science, Memorial University of Newfoundland
Chaytor R (2007) Allowing privacy protection algorithms to jump out of local optimums: an ordered greed framework. In: Bonchi F et al. (eds) Proceedings of the 1st SIGKDD international workshop on privacy, security, and trust in KDD (PinKDD’07). LNCS, vol 4890. Springer, Berlin, pp 33–55
Chapter Google Scholar
Downey R, Fellows M (1999) Parameterized complexity. Springer, Berlin
Google Scholar
Er MC (1988) A fast algorithm for generating set partitions. Comput J 31:283–284
Article MATH Google Scholar
Fernau H (2004) Complexity of a {0,1}-matrix problem. Australasian J Comb 29:273–300
MATH MathSciNet Google Scholar
Horak P, Brankovic L, Miller M (1999) A combinatorial problem in database security. Discrete Appl Math 91:119–126
Article MATH MathSciNet Google Scholar
Islam MZ, Brankovic L (2004) A framework for privacy preserving classification in data mining. In: Proceedings of the second workshop on Australasian information security, data mining and web intelligence, and software internationalisation (ACSW Frontiers 2004), pp 163–168
MacDonald (2005) personal communication
Meyerson A, Williams R (2004) On the complexity of optimal k-anonymity. In: Proceedings of 23rd ACM symposium on principles of database systems (PODS’04), pp 223–228
Niedermeier R (2006) Invitation to fixed-parameter algorithms. Oxford University Press, Oxford
Book MATH Google Scholar
Samarati P, Sweeney L (1998) Protecting privacy when disclosing information: k-anonymity and its enforcement through generalization and suppression. Technical report SRI-CSL-98-04, SRI International, Computer Science Laboratory
Sweeney L (2002) Achieving k-anonymity privacy protection using generalization and suppression. Int J Uncertain Fuzziness Knowl-Based Syst 10(5):571–588
Article MATH MathSciNet Google Scholar
Wang K, Yu P, Chakraborty S (2004) Bottom-up generalization: a data mining solution to privacy protection. In: Proceedings of 4th IEEE international conference on data mining (ICDM’04), pp 249–256
Wareham T (1999) Systematic parameterized complexity analysis in computational phonology. PhD thesis, Dept Computer Science, University of Victoria

Download references

Author information

Authors and Affiliations

Faculty of Computer Science, University of New Brunswick, Fredericton, NB, Canada
Patricia A. Evans
Department of Computer Science, Memorial University, St. John’s, NL, Canada
H. Todd Wareham
School of Computing Science, Simon Fraser University, Vancouver, BC, Canada
Rhonda Chaytor

Authors

Patricia A. Evans
View author publications
You can also search for this author in PubMed Google Scholar
H. Todd Wareham
View author publications
You can also search for this author in PubMed Google Scholar
Rhonda Chaytor
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Patricia A. Evans.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Evans, P.A., Wareham, H.T. & Chaytor, R. Fixed-parameter tractability of anonymizing data by suppressing entries. J Comb Optim 18, 362–375 (2009). https://doi.org/10.1007/s10878-009-9253-6

Download citation

Published: 18 August 2009
Issue Date: November 2009
DOI: https://doi.org/10.1007/s10878-009-9253-6

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Fixed-parameter tractability of anonymizing data by suppressing entries

Abstract

Article PDF

Similar content being viewed by others

Optimization algorithm for k-anonymization of datasets with low information loss

The Complexity of Finding a Large Subgraph under Anonymity Constraints

De-anonymization of Heterogeneous Random Graphs in Quasilinear Time

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Fixed-parameter tractability of anonymizing data by suppressing entries

Abstract

Article PDF

Similar content being viewed by others

Optimization algorithm for k-anonymization of datasets with low information loss

The Complexity of Finding a Large Subgraph under Anonymity Constraints

De-anonymization of Heterogeneous Random Graphs in Quasilinear Time

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation