Abstract
Privacy-preservation in distributed progressive databases is an active area of research in recent years. In a typical scenario, multiple parties may wish to collaborate to extract interesting global information such as class labels without revealing their respective data to each other. This may be particularly useful in applications such as customer retention, medical research etc. In the proposed work, we aim to develop a global classification model based on the Naïve Bayes classification scheme. The Naïve Bayes classification has been used because of its simplicity, high efficiency. For privacy-preservation of the data, the concept of trusted third party with two offsets has been used. The data is first anonymized at local party end and then the aggregation and global classification is done at the trusted third party. The proposed algorithms address various types of fragmentation schemes such as horizontal, vertical and arbitrary distribution required format. The car-evaluation dataset is used to test the effectiveness of proposed algorithms.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Vaidya, J., Kantarcıoğlu, M., Clifton, C.: Privacy-Preserving Naïve Bayes classification. International Journal on Very Large Data Bases 17(4), 879–898 (2008)
Huang, J.-W., Tseng, C.-Y.: A General Model for Sequential Pattern Mining with a Progressive Databases. IEEE Trans. Knowledge Engineering 20(9), 1153–1167 (2008)
Liew, C.K., Choi, U.J., Liew, C.J.: A data distortion by probability distribution. ACM TODS, 395–411 (1985)
Warner, S.L.: Randomized Response: A survey technique for eliminating evasive answer bias. Journal of American Statistical Association (60), 63–69 (1965)
Agarwal, R., Srikanth, R.: Privacy–preserving data mining. In: Proceedings of the ACM SIGMOD conference, vol. 29, pp. 439–450 (2005)
Agarwal, D., Agarwal, C.C.: On the design and Quantification of Privacy–Preserving Data Mining Algorithms. In: ACM PODS Conference, pp. 1224–1236 (2002)
Zhang, P., Tong, Y., Tang, D.: Privacy–Preserving Naïve Bayes Classifier. In: Li, X., Wang, S., Dong, Z.Y. (eds.) ADMA 2005. LNCS (LNAI), vol. 3584, pp. 744–752. Springer, Heidelberg (2005)
Zhu, Y., Liu, L.: Optimal Randomization for Privacy–Preserving Data Mining. In: KDD ACM KDD Conference (2004)
Gambs, S., Kegl, B., Aimeur, E.: Privacy–Preserving Boosting. Journal (to appear)
Poovammal, E., Poonavaikko, M.: An Improved Method for Privacy Preserving Data Mining. In: IEEE IACC Conference, Patiala, India, pp. 1453–1458 (2009)
Yao, A.C.: Protocol for secure sum computations. In: Proc. IEEE Foundations of Computer Science, pp. 160–164 (1982)
Bohanec, M., Zupan, B.: UCI Machine Learning Repository. (1997), http://archive.ics.uci.edu/ml/datasets/Car+Evaluation
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
B.N., K., Toshniwal, D. (2011). Privacy Preserving Naïve Bayes Classification Using Trusted Third Party Computation over Distributed Progressive Databases. In: Meghanathan, N., Kaushik, B.K., Nagamalai, D. (eds) Advances in Computer Science and Information Technology. CCSIT 2011. Communications in Computer and Information Science, vol 131. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17857-3_3
Download citation
DOI: https://doi.org/10.1007/978-3-642-17857-3_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-17856-6
Online ISBN: 978-3-642-17857-3
eBook Packages: Computer ScienceComputer Science (R0)