Privacy-Preserving Naive Bayesian Classification over Horizontally Partitioned Data

Zhan, Justin; Matwin, Stan; Chang, Li Wu

doi:10.1007/978-3-540-78488-3_31

Privacy-Preserving Naive Bayesian Classification over Horizontally Partitioned Data

Justin Zhan⁶,
Stan Matwin⁷ &
Li Wu Chang⁸

Chapter

1211 Accesses
5 Citations

Part of the book series: Studies in Computational Intelligence ((SCI,volume 118))

Recent advances¹ in computer networking and database technologies have resulted in creation of large quantities of data which are located in different sites. Data mining is a useful tool to extract valuable knowledge from this data. Well known data mining algorithms include association rule mining, classification, clustering, outlier detection, etc. However, extracting useful knowledge from distributed sites is often challenging due to real world constraints such as privacy, communication and computation overhead.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

G. Aggarwal, N. Mishra, and B. Pinkas. Secure computation of the k th-ranked element. In EUROCRYPT pp 40–55, 2004.
Google Scholar
R. Agrawal and R. Srikant. Privacy-preserving data mining. In Proceedings of the ACM SIGMOD Conference on Management of Data, pp 439–450. ACM, May 2000.
Google Scholar
J. Benaloh. Dense probabilistic encryption. In Proceedings of the Workshop on Selected Areas of Cryptography, pp 120–128, Kingston, Ontario, May 1994.
Google Scholar
J. Domingo-Ferrer. A provably secure additive and multiplicative privacy homomorphism. In Information Security Conference, pp 471–483, 2002.
Google Scholar
W. Du and Z. Zhan. Using randomized response techniques for privacy-preserving data mining. In Proceedings of The 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 24–27 2003.
Google Scholar
C. Dwork and K. Nissim. Privacy-preserving datamining on vertically partitioned databases. In CRYPTO 2004, pp 528–544.
Google Scholar
A. Evfmievski, J. Gehrke, and R. Srikant. Limiting privacy breaches in privacy preserving data mining. In Proceedings of the 22nd ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, pp 211–222, San Diego, CA, June 9–12, 2003.
Google Scholar
M. Franklin, Z. Galil, and M. Yung. An overview of secure distributed computing. Technical Report TR CUCS-00892, Department of Computer Science, Columbia University, 1992.
Google Scholar
B. Goethals, S. Laur, H. Lipmaa, and T. Mielikainen. On secure scalar product computation for privacy-preserving data mining. In Proceedings of The 7th Annual International Conference in Information Security and Cryptology (ICISC 2004), Volume 3506 of Lecture Notes in Computer Science, pp 104–120, Seoul, Korea, December 2–3, 2004, Springer, Berlin Heidelberg New York, 2004.
Google Scholar
O. Goldreich. Secure multi-party computation (working draft). http://www.wisdom.weizmann.ac.il/home/oded/public_html/foc.html, 1998.
O. Goldreich, S. Micali, and A. Wigderson. How to play any mental game. In Proceedings of the 19th Annual ACM Symposium on Theory of Computing, pp 218–229, 1987.
Google Scholar
S. Goldwasser. Multi-party computations: Past and present. In Proceedings of the 16th Annual ACM Symposium on Principles of Distributed Computing, Santa Barbara, CA USA, August 21–24, 1997.
Google Scholar
Y. Lindell and B. Pinkas. Privacy preserving data mining. In Advances in Cryptology – Crypto2000, Lecture Notes in Computer Science, Volume 1880, 2000.
Google Scholar
D. Naccache and J. Stern. A new public key cryptosystem based on higher residues. In Proceedings of the 5th ACM conference on Computer and Communication Security, pp 59–66, San Francisco, California, United States, 1998.
Google Scholar
T. Okamoto and S. Uchiyama. A new public-key cryptosystem as secure as factoring. In Eurocrypt’98, LNCS 1403, pp 308–318, 1998.
Google Scholar
P. Paillier. Public key cryptosystems based on composite degree residuosity classes. In In Advances in Cryptology – Eurocrypt’99 Proceedings, LNCS 1592, pp 223–238, Springer, Berlin Heidelberg New York, 1999.
Google Scholar
R. Rivest, L. Adleman, and M. Dertouzos. On data banks and privacy homomorphisms. In Foundations of Secure Computation, eds. R. A. DeMillo et al., Academic Press, pp 169–179, 1978.
Google Scholar
S. Rizvi and J.R. Haritsa. Maintaining data privacy in association rule mining. In Proceedings of the 28th VLDB Conference, Hong Kong, China, 2002.
Google Scholar
L. Sweeney. k-anonymity: a model for protecting privacy. In International Journal on Uncertainty, Fuzziness and Knowledge-based Systems 10(5), 557–570, 2002.
Article MATH MathSciNet Google Scholar
J. Vaidya and C. Clifton. Privacy preserving association rule mining in vertically partitioned data. In Proceedings of the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp 639–644, Edmonton, Alberta, Canada, July 23–26, 2002.
Google Scholar
A. C. Yao. Protocols for secure computations. In Proceedings of the 23rd Annual IEEE Symposium on Foundations of Computer Science, 1982.
Google Scholar
J. Zhan and S. Matwin. Privacy-preserving nave bayesian classification over vertically partitioned data. In IEEE ICDM Workshop on Foundations of Semantic Oriented Data and Web Mining, Houston, Texas, USA, November 27–30, 2005.
Google Scholar

Download references

Author information

Authors and Affiliations

Carnegie Mellon University, USA
Justin Zhan
University of Ottawa, Canada
Stan Matwin
Naval Research Laboratory, USA
Li Wu Chang

Authors

Justin Zhan
View author publications
You can also search for this author in PubMed Google Scholar
Stan Matwin
View author publications
You can also search for this author in PubMed Google Scholar
Li Wu Chang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, San Jose State University, San Jose, CA, 95192, USA
Tsau Young Lin
Department of Computer Science and Information Systems, Kennesaw State University, Building 11, Room 3060 1000 Chastain Road, Kennesaw, GA, 30144, USA
Ying Xie
Department of Computer Science, The University at Stony Brook, Stony Brook, New York, 11794-4400, USA
Anita Wasilewska
Institute of Information Science, Academia Sinica, No 128, Academia Road, Section 2 Nankang, Taipei, 11529, Taiwan
Churn-Jung Liau

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Zhan, J., Matwin, S., Chang, L.W. (2008). Privacy-Preserving Naive Bayesian Classification over Horizontally Partitioned Data. In: Lin, T.Y., Xie, Y., Wasilewska, A., Liau, CJ. (eds) Data Mining: Foundations and Practice. Studies in Computational Intelligence, vol 118. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-78488-3_31

Download citation

DOI: https://doi.org/10.1007/978-3-540-78488-3_31
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-78487-6
Online ISBN: 978-3-540-78488-3
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Buying options