K−Means Clustering Microaggregation for Statistical Disclosure Control

Kabir, Md. Enamul; Mahmood, Abdun Naser; Mustafa, Abdul K.

doi:10.1007/978-81-322-0740-5_135

Md. Enamul Kabir⁵,
Abdun Naser Mahmood⁵ &
Abdul K. Mustafa⁵

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 174))

1921 Accesses

Abstract

This paper presents a K-means clustering technique that satisfies the bi-objective function to minimize the information loss and maintain k-anonymity. The proposed technique starts with one cluster and subsequently partitions the dataset into two or more clusters such that the total information loss across all clusters is the least, while satisfying the k-anonymity requirement. The structure of K− means clustering problem is defined and investigated and an algorithm of the proposed problem is developed. The performance of the K− means clustering algorithm is compared against the most recent microaggregation methods. Experimental results show that K− means clustering algorithm incurs less information loss than the latest microaggregation methods for all of the test situations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Softcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Domingo-Ferrer, J., Mateo-Sanz, J.: Practical data-oriented microaggregation for statistical disclosure control. IEEE Transactions on Knowledge and Data Engineering 14(1), 189–201 (2002)
Article Google Scholar
Domingo-Ferrer, J., Torra, V.: Ordinal, continuous and heterogeneous kanonymity through microaggregation. Data Mining and Knowledge Discovery 11(2), 195–212 (2005)
Article MathSciNet Google Scholar
Domingo-Ferrer, J., Martinez-Balleste, A., Mateo-Sanz, J.M., Sebe, F.: Efficient multivariate data-oriented microaggregation. The VLDB Journal 15(4), 355–369 (2006)
Article Google Scholar
Domingo-Ferrer, J., Sebe, F., Solanas, A.: A polynomial-time approximation to optimal mul tivariate microaggregation. Computer and Mathematics with Applications 55(4), 714–732 (2008)
Article MathSciNet MATH Google Scholar
Samarati, P.: Protecting respondent’s privacy in microdata release. IEEE Transactions on Knowledge and Data Engineering 13(6), 1010–1027 (2001)
Article Google Scholar
Sweeney, L.: k-Anonymity: A model for protecting privacy. International Journal on Uncertainty, Fuzziness and Knowledge-based Systems 10(5), 557–570 (2002)
Article MathSciNet MATH Google Scholar
Kabir, M.E., Wang, H.: Systematic Clustering-based Microaggregation for Statistical Disclosure Control. In: Proc. IEEE International Conference on Network and System Security, Melbourne, pp. 435–441 (September 2010)
Google Scholar
Kabir, M.E., Wang, H., Bertino, E., Chi, Y.: Systematic Clustering Method for l-diversity Model. In: Proc. Australasian Database Conference, Brisbane, pp. 93–102 (January 2010)
Google Scholar
Kabir, M.E., Wang, H.: Microdata Protection Method Through Microaggragation: A Median Based Approach. Information Security Journal: A Global Perspective (in press)
Google Scholar
Chang, C.-C., Li, Y.-C., Huang, W.-H.: TFRP: An efficient microaggregation algorithm for statistical disclosure control. Journal of Systems and Software 80(11), 1866–1878 (2007)
Article Google Scholar
Lin, J.-L., Wen, T.-H., Hsieh, J.-C., Chang, P.-C.: Density-based microaggregation for statistical disclosure control. Expert Systems with Applications 37(4), 3256–3263 (2010)
Article Google Scholar
Lloyd, S.: Least squares quantization in PCM. IEEE Transactions on Information Theory 28(2), 129–137 (1982)
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

University of New South Wales, Kensington, Australia
Md. Enamul Kabir, Abdun Naser Mahmood & Abdul K. Mustafa

Authors

Md. Enamul Kabir
View author publications
You can also search for this author in PubMed Google Scholar
Abdun Naser Mahmood
View author publications
You can also search for this author in PubMed Google Scholar
Abdul K. Mustafa
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

M. S. Ramaiah Institute of Technology, Bengaluru, India
Aswatha Kumar M.
M. S. Ramaiah Institute of Technology, Bengaluru, India
Selvarani R.
M. S. Ramaiah Institute of Technology, Bengaluru, India
T V Suresh Kumar

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kabir, M.E., Mahmood, A.N., Mustafa, A.K. (2013). K−Means Clustering Microaggregation for Statistical Disclosure Control. In: Kumar M., A., R., S., Kumar, T. (eds) Proceedings of International Conference on Advances in Computing. Advances in Intelligent Systems and Computing, vol 174. Springer, New Delhi. https://doi.org/10.1007/978-81-322-0740-5_135

Download citation

DOI: https://doi.org/10.1007/978-81-322-0740-5_135
Publisher Name: Springer, New Delhi
Print ISBN: 978-81-322-0739-9
Online ISBN: 978-81-322-0740-5
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics