Skip to main content

P-IRON for Privacy Preservation in Data Mining

  • Conference paper
  • First Online:
Knowledge Management in Organizations (KMO 2017)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 731))

Included in the following conference series:

  • 1874 Accesses

Abstract

Data mining includes extracting useful and interesting patterns from large dataset, to create and enhance decision support systems. Due to this, data mining has become an important component in various fields of day-to-day life including medicine, business, education, science and so on. Numerous data mining techniques have been developed. These techniques make the privacy preservation an important issue. When applying privacy preservation techniques, importance is given to the utility and information loss. In this paper we propose Preference Imposed Individual Ranking based microaggregation with Optimal Noise addition technique (P-IRON) for anonymizing the individual records. Through the experimental results, our proposed technique is validated to prevent the disclosure of sensitive data without degradation of data utilization. Our work highlights some discussions about future work and promising directions in the perspective of privacy preservation in data mining.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Dehkordi, M.N., Badie, K., Zadeh, A.K.: A novel method for privacy preserving in association rule mining based on genetic algorithms. JSW 4(6), 555–562 (2009)

    Article  Google Scholar 

  2. Lindell, Y., Pinkas, B.: Privacy preserving data mining. In: Bellare, M. (ed.) CRYPTO 2000. LNCS, vol. 1880, pp. 36–54. Springer, Heidelberg (2000). doi:10.1007/3-540-44598-6_3

    Chapter  Google Scholar 

  3. Agrawal, R., Srikant, R.: Privacy-preserving data mining. ACM SIGMOD Rec. 29(2), 439–450 (2000). ACM

    Article  Google Scholar 

  4. Muralidhar, K., Parsa, R., Sarathy, R.: A general additive data perturbation method for database security. Manage. Sci. 45(10), 1399–1415 (1999)

    Article  Google Scholar 

  5. Aggarwal, C.C., Yu, P.S.: A condensation approach to privacy preserving data mining. In: Bertino, E., Christodoulakis, S., Plexousakis, D., Christophides, V., Koubarakis, M., Böhm, K., Ferrari, E. (eds.) EDBT 2004. LNCS, vol. 2992, pp. 183–199. Springer, Heidelberg (2004). doi:10.1007/978-3-540-24741-8_12

    Chapter  Google Scholar 

  6. Chen, K., Liu, L.: A random rotation perturbation approach to privacy preserving data classification (2005)

    Google Scholar 

  7. Liu, K., Kargupta, H., Ryan, J.: Random projection-based multiplicative data perturbation for privacy preserving distributed data mining. IEEE Trans. Knowl. Data Eng. 18(1), 92–106 (2006)

    Article  Google Scholar 

  8. Chen, K., Sun, G., Liu, L.: Towards attack-resilient geometric data perturbation. In: Proceedings of the 2007 SIAM International Conference on Data Mining. Society for Industrial and Applied Mathematics (2007)

    Google Scholar 

  9. Oliveira, S.R.M., Zaiane, O.R.: Privacy preserving clustering by data transformation. J. Inf. Data Manag. 1(1), 37 (2010)

    Google Scholar 

  10. Gal, T.S., Chen, Z., Gangopadhyay, A.: A privacy protection model for patient data with multiple sensitive attributes. IGI Glob. 2, 28 (2008)

    Google Scholar 

  11. Aggarwal, C.C., Yu, P.S.: Privacy-preserving data mining: a survey. In: Gertz, M., Jajodia, S. (eds.) Handbook of Database Security, 431–460. Springer, New York (2008)

    Google Scholar 

  12. Domingo-Ferrer, J., Mateo-Sanz, J.M.: Practical data-oriented microaggregation for statistical disclosure control. IEEE Trans. Knowl. Data Eng. 14(1), 189–201 (2002)

    Article  Google Scholar 

  13. Domingo-Ferrer, J., et al.: Efficient multivariate data-oriented microaggregation. The VLDB J. Int. J. Very Large Data Bases 15(4), 355–369 (2006)

    Article  Google Scholar 

  14. Oganian, A., Domingo-Ferrer, J.: On the complexity of optimal microaggregation for statistical disclosure control. Stat. J. U. N. Econ. Comm. Eur. 18(4), 345–353 (2001)

    Google Scholar 

  15. Solanas, A., Martinez-Balleste, A., Domingo-Ferrer, J.: V-MDAV: a multivariate microaggregation with variable group size. In: Proceedings of the 17th COMPSTAT Symposium of the IASC, Rome (2006)

    Google Scholar 

  16. Lin, J.-L., et al.: Density-based microaggregation for statistical disclosure control. Expert Syst. Appl. 37(4), 3256–3263 (2010)

    Article  Google Scholar 

  17. Kabir, M.E., Wang, H.: Microdata protection method through microaggregation: a median-based approach. Inf. Secur. J. Glob. Perspect. 20(1), 1–8 (2011)

    Article  Google Scholar 

  18. Soria-Comas, J., et al.: T-Closeness through microaggregation: strict privacy with enhanced utility preservation. IEEE Trans. Knowl. Data Eng. 27(11), 3098–3110 (2015)

    Article  Google Scholar 

  19. Sánchez, D., et al.: Utility-preserving differentially private data releases via individual ranking microaggregation. Inf. Fusion 30, 1–14 (2016)

    Article  Google Scholar 

  20. Gal, T.S., et al.: A data recipient centered de-identification method to retain statistical attributes. J. Biomed. Inform. 50, 32–45 (2014)

    Article  Google Scholar 

  21. Han, J., Kamber, M.: Data Mining: Concepts and Techniques. Morgan Kaufmann Publishers, San Francisco (2000)

    Google Scholar 

  22. Arumugam, G., Sulekha, V.: IMR based anonymization for privacy preservation in data mining. In: Proceedings of the 11th International Knowledge Management in Organizations Conference on the Changing Face of Knowledge Management Impacting Society. ACM (2016)

    Google Scholar 

  23. Wishart, D.: 256. Note: an algorithm for hierarchical classifications. Biometrics 25, 165–170 (1969)

    Article  Google Scholar 

  24. Soria-Comas, J., Domingo-Ferrer, J.: Optimal data-independent noise for differential privacy. Inf. Sci. 250, 200–214 (2013)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to V. Jane Varamani Sulekha .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Arumugam, G., Jane Varamani Sulekha, V. (2017). P-IRON for Privacy Preservation in Data Mining. In: Uden, L., Lu, W., Ting, IH. (eds) Knowledge Management in Organizations. KMO 2017. Communications in Computer and Information Science, vol 731. Springer, Cham. https://doi.org/10.1007/978-3-319-62698-7_34

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-62698-7_34

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-62697-0

  • Online ISBN: 978-3-319-62698-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics