Partial Domain Theories for Privacy

Armengol, Eva; Torra, Vicenç

doi:10.1007/978-3-319-45656-0_18

Eva Armengol¹⁷ &
Vicenç Torra¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9880))

Included in the following conference series:

International Conference on Modeling Decisions for Artificial Intelligence

1257 Accesses

Abstract

Generalization and Suppression are two of the most used techniques to achieve k-anonymity. However, the generalization concept is also used in machine learning to obtain domain models useful for the classification task, and the suppression is the way to achieve such generalization. In this paper we want to address the anonymization of data preserving the classification task. What we propose is to use machine learning methods to obtain partial domain theories formed by partial descriptions of classes. Differently than in machine learning, we impose that such descriptions be as specific as possible, i.e., formed by the maximum number of attributes. This is achieved by suppressing some values of some records. In our method, we suppress only a particular value of an attribute in only a subset of records, that is, we use local suppression. This avoids one of the problems of global suppression that is the loss of more information than necessary.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Armengol, E.: Building partial domain theories from explanations. Knowl. Intell. 22(2), 19–24 (2008)
Google Scholar
Armengol, E., Plaza, E.: Lazy induction of descriptions for relational case-based learning. In: Flach, P.A., De Raedt, L. (eds.) ECML 2001. LNCS (LNAI), vol. 2167, pp. 13–24. Springer, Heidelberg (2001)
Chapter Google Scholar
Armengol, E., Plaza, E.: Relational case-based reasoning for carcinogenic activity prediction. Artif. Intell. Rev. 20(1–2), 121–141 (2003)
Article Google Scholar
Bache, K., Lichman, M.: UCI machine learning repository (2013)
Google Scholar
Domingo-Ferrer, J., Torra, V.: Ordinal, continuous and heterogeneous k-anonymity through microaggregation. Data Min. Knowl. Discov. 11(2), 195–212 (2005)
Article MathSciNet Google Scholar
Friedman, A., Wolff, R., Schuster, A.: Providing k-anonymity in data mining. VLDB J. 17(4), 789–804 (2008)
Article Google Scholar
Friedman, J.H.: Lazy decision trees. In: Proceedings of the Thirteenth National Conference on Artificial Intelligence, AAAI 1996, vol. 1, pp. 717–724. AAAI Press (1996)
Google Scholar
Fung, B.C.M., Wang, K., Yu, P.S.: Anonymizing classification data for privacy preservation. IEEE Trans. Knowl. Data Eng. (TKDE) 19(5), 711–725 (2007)
Article Google Scholar
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The WEKA data mining software: an update. SIGKDD Explor. Newsl. 11(1), 10–18 (2009)
Article Google Scholar
Iyengar, V.S.: Transforming data to satisfy privacy constraints. In: Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2002, pp. 279–288. ACM, New York (2002)
Google Scholar
Bayardo Jr., R.J., Agrawal, R.: Data privacy through optimal k-anonymization. In: Proceedings of the 21st International Conference on Data Engineering, ICDE 2005, Tokyo, Japan, 5–8 April 2005, pp. 217–228 (2005)
Google Scholar
Kisilevich, S., Keim, D.A., Rokach, L.: A gis-based decision support system for hotel room rate estimation and temporal price prediction: the hotel brokers’ context. Decis. Support Syst. 54(2), 1119–1133 (2013)
Article Google Scholar
LeFevre, K., DeWitt, D.J., Ramakrishnan, R.: Incognito: efficient full-domain k-anonymity. In: Proceedings of the 2005 ACM SIGMOD International Conference on Management of Data, SIGMOD 2005, pp. 49–60. ACM, New York (2005)
Google Scholar
LeFevre, K., DeWitt, D.J., Ramakrishnan, R.: Mondrian multidimensional k-anonymity. In: Proceedings of the 22nd International Conference on Data Engineering, ICDE 2006, Atlanta, GA, USA, 3–8 April 2006, p. 25 (2006)
Google Scholar
López de Mántaras, R.: A distance-based attribute selection measure for decision tree induction. Mach. Learn. 6, 81–92 (1991)
Article Google Scholar
Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers Inc., San Francisco (1993)
Google Scholar
Samarati, P.: Protecting respondents’ identities in microdata release. IEEE Trans. Knowl. Data Eng. 13(6), 1010–1027 (2001)
Article Google Scholar
Samarati, P., Sweeney, L.: Protecting privacy when disclosing information: k-anonymity and itsenforcement through generalization and suppression. Technical report, SRI (1998)
Google Scholar

Download references

Acknowledgments

This research is partially funded by the project RPREF (CSIC Intramural 201650E044) and the grants 2014-SGR-118 from the Generalitat de Catalunya.

Author information

Authors and Affiliations

IIIA - Artificial Intelligence Research Institute, CSIC - Spanish Council for Scientific Research, Campus UAB, 08193, Bellaterra, Catalonia, Spain
Eva Armengol
School of Informatics, University of Skövde, Skövde, Sweden
Vicenç Torra

Authors

Eva Armengol
View author publications
You can also search for this author in PubMed Google Scholar
Vicenç Torra
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Eva Armengol .

Editor information

Editors and Affiliations

University of Kövde , Skövde, Sweden
Vicenç Torra
Toho Gakuen , Kunitachi, Tokyo, Japan
Yasuo Narukawa
Enginyeria de la Informacio I de les Com, Univ Autonoma de Barcelona, Bellaterra, Spain
Guillermo Navarro-Arribas
Universitat d'Andorra , Sant Julià de Lòria, Andorra
Cristina Yañez

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Armengol, E., Torra, V. (2016). Partial Domain Theories for Privacy . In: Torra, V., Narukawa, Y., Navarro-Arribas, G., Yañez, C. (eds) Modeling Decisions for Artificial Intelligence. MDAI 2016. Lecture Notes in Computer Science(), vol 9880. Springer, Cham. https://doi.org/10.1007/978-3-319-45656-0_18

Download citation

DOI: https://doi.org/10.1007/978-3-319-45656-0_18
Published: 08 September 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-45655-3
Online ISBN: 978-3-319-45656-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics