Skip to main content

A Rough Set-Aided System for Sorting WWW Bookmarks

  • Conference paper
  • First Online:
Book cover Web Intelligence: Research and Development (WI 2001)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2198))

Included in the following conference series:

Abstract

Most people store “bookmarks” to web pages. These allow the user to return to a web page later on, without having to remember the exact URL address. People attempt to organise their bookmark databases by filing bookmarks under categories, themselves arranged in a hierarchicalfashion. As the maintenance of such large repositories is difficult and time-consuming, a tool that automatically categorises bookmarks is required. This paper investigates how rough set theory can help extract information out of this domain, for use in an experimentalautomatic bookmark classification system. In particular, work on rough set dependency degrees is applied to reduce the otherwise high dimensionality of the feature patterns used to characterize bookmarks. A comparison is made between this approach to data reduction and a conventional entropy-based approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. L. Tauscher and S. Greenberg, Revisitation patterns in World Wide Web navigation, in: Proc. 1997 ACM CHI Conference, Atlanta, GA, March 1997.

    Google Scholar 

  2. Georgia Tech Research Corporation, GVU’s 8th WWW User Survey, 1997, information available at http://www.gvu.gatech.edu/user_surveys/survey-1997-10/

  3. K. Larson and M. Czerwinski, Web page design: implications of memory, structure and scent for information retrieval, in: Proc. 1998 ACM SIGCHI Conf. on Human Factors in Computing Systems, Los Angeles, CA, April 1998, pp. 25–32.

    Google Scholar 

  4. Y. S. Maarek, I. Z. Ben Shaul. Automatically Organizing Bookmarks per Contents. Fifth International W orld Wide Web Conference 1996, Paris, France. http://www5conf.inria.fr/fich html/papers/P37/Overview.html

  5. W. Li, Q. Vu, D. Agrawal, Y. Hara, H. Takano. PowerBookmarks: a system for personalizable Web information organization, sharing, and management. Proceedings of the Eighth InternationalW orld Wide Web Conference, Toronto, Canada, 11-14 May 1999, ISBN 0-444-50264-5.

    Google Scholar 

  6. P. Devijver and J. Kittler, (1982) Pattern Recognition: A Statistical Approach. Prentice Hall.

    Google Scholar 

  7. T. Mitchell (1997) Machine Learning. McGraw-Hill.

    Google Scholar 

  8. Z. Pawlak. Rough Sets: Theoretical Aspects of Reasoning About Data. Kluwer Academic Publishing, Dordrecht, 1991.

    MATH  Google Scholar 

  9. Q. Shen and A. Chouchoulas. A Modular Approach to Generating Fuzzy Rules with Reduced Attributes for the Monitoring of Complex Systems. Engineering Applications of Artificial Intelligence, 13(3):263–278, 2000.

    Article  Google Scholar 

  10. J.R. Quinlan. Induction of Decision Trees. Machine Learning 1(1), pp. 81–106. 1986.

    Google Scholar 

  11. M. Dash, H. Liu, J. Yao. Dimensionality Reduction of Unsupervised Data. Proceedings of the 9th International Conference on Tools with Artificial Intelligence (ICTAI’97).

    Google Scholar 

  12. A. Chouchoulas and Q. Shen. Rough set-aided keyword reduction for text categorisation. Applied Artificial Intelligence, 2001.

    Google Scholar 

  13. H. S. Heaps, Information retrieval, computationaland theoreticalasp ects. Academic Press, 1978.

    Google Scholar 

  14. G. Salton, Introduction to Modern Information Retrieval. McGraw-Hill, 1983.

    Google Scholar 

  15. G. Salton, E. A. Fox, and H. Wu, (Cornell Technical Report TR82-511) Extended Boolean Information Retrieval. Cornell University. August 1982.

    Google Scholar 

  16. G. Salton, and C. Buckley. Term Weighting Approaches in Automatic Text Retrieval. Technical Report TR87-881, Department of Computer Science, Cornell University, 1987. Information Processing and Management Vol.32 (4), p. 431–443, 1996.

    Google Scholar 

  17. C.J. van Rijsbergen. Information Retrieval. Butterworths, London, United Kingdom, 1979. http://www.dcs.gla.ac.uk/Keith/Preface.html.

    Google Scholar 

  18. W. Pedrycz, and F. Gomide. An Introduction to Fuzzy Sets: Analysis and Design. The MIT Press, 1998.

    Google Scholar 

  19. R. Jensen. Rough-Fuzzy Methods for Determining Fuzzy Reducts. Project Report. The University of Edinburgh, 2001.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2001 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Jensen, R., Shen, Q. (2001). A Rough Set-Aided System for Sorting WWW Bookmarks. In: Zhong, N., Yao, Y., Liu, J., Ohsuga, S. (eds) Web Intelligence: Research and Development. WI 2001. Lecture Notes in Computer Science(), vol 2198. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45490-X_10

Download citation

  • DOI: https://doi.org/10.1007/3-540-45490-X_10

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-42730-8

  • Online ISBN: 978-3-540-45490-8

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics