Skip to main content

Document Sanitization in the Age of Data Mining

  • Chapter
Privacy and Technologies of Identity
  • 1294 Accesses

Abstract

The volume of data collected about people and their activities has increased over the years, especially with the widespread use of the internet. Data collection efforts coupled with powerful querying and data mining tools have raised concerns among people regarding their privacy. Recently the issue of privacy has been investigated in the context of databases and data mining to develop privacy preserving technologies. In this work, we concentrate on textual data and discuss methods for preserving privacy in text documents.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer Science+Business Media, Inc.

About this chapter

Cite this chapter

Hakkani-Tür, D., Tur, G., Saygin, Y., Tang, M. (2006). Document Sanitization in the Age of Data Mining. In: Strandburg, K.J., Raicu, D.S. (eds) Privacy and Technologies of Identity. Springer, Boston, MA. https://doi.org/10.1007/0-387-28222-X_15

Download citation

  • DOI: https://doi.org/10.1007/0-387-28222-X_15

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-0-387-26050-1

  • Online ISBN: 978-0-387-28222-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics