Skip to main content

Data Mining Technologies for Digital Libraries and Web Information Systems

  • Conference paper
  • First Online:
Book cover Digital Libraries: People, Knowledge, and Technology (ICADL 2002)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2555))

Included in the following conference series:

  • 1226 Accesses

Abstract

In the first half of the talk, I will discuss data mining technologies that can result in better browsing and searching. Consider the problem of merging documents from different categorizations (taxonomies) into a single master categorization. Current classifiers ignore the implicit similarity information present in the source categorizations. I will show that by incorporating this information into the classification model, classification accuracy can be substantially improved [1]. Next, I will demonstrate novel search technology that treats numbers as first-class objects, and thus yields dramatically better results than current Web search engines when searching over product descriptions or other number-rich documents [2].

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Rakesh Agrawal and Ramakrishnan Srikant. On catalog integration. In Proc. of the Tenth Int’l World Wide Web Conference, Hong Kong, May 2001.

    Google Scholar 

  2. Rakesh Agrawal and Ramakrishnan Srikant. Searching with numbers. In Proc. of the Eleventh Int’l World Wide Web Conference, Honolulu, Hawaii, May 2002.

    Google Scholar 

  3. Benny Chor, Oded Goldreich, Eyal Kushilevitz, and Madhu Sudan. Private information retrieval. In IEEE Symposium on Foundations of Computer Science, pp. 41–50, 1995.

    Google Scholar 

  4. Rakesh Agrawal and Ramakrishnan Srikant. Privacy preserving data mining. In Proc. of the ACM SIGMOD Conference on Management of Data, pp. 439–450, Dallas, Texas, May 2000.

    Google Scholar 

  5. Alexandre Evfimievski, Ramakrishnan Srikant, Rakesh Agrawal, and Johannes Gehrke. Privacy preserving mining of association rules. In Proc. of the 8th ACM SIGKDD Int’l Conference on Knowledge Discovery and Data Mining, Edmonton, Canada, July 2002.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Srikant, R. (2002). Data Mining Technologies for Digital Libraries and Web Information Systems. In: Lim, E.P., et al. Digital Libraries: People, Knowledge, and Technology. ICADL 2002. Lecture Notes in Computer Science, vol 2555. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36227-4_7

Download citation

  • DOI: https://doi.org/10.1007/3-540-36227-4_7

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-00261-1

  • Online ISBN: 978-3-540-36227-2

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics