Skip to main content

Multi-level Document Classifications with Self-organising Maps

  • Conference paper
Intelligent Data Engineering and Automated Learning - IDEAL 2005 (IDEAL 2005)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3578))

  • 1311 Accesses

Abstract

The Self-Organising Map (SOM) is widely used to classify document collections. Such classifications are usually coarse-grained and cannot accommodate accurate document retrieval. A document classification scheme based on Multi-level Nested Self-Organising Map (MNSOM) is proposed to solve the problem. An MNSOM consists of a top map and a set of nested maps organised at different levels. The clusters on the top map of an MNSOM are at a relatively general level achieving retrieval recall, and the nested maps further elaborate the clusters into more specific groups, thus enhancing retrieval precision. The MNSOM was tested by a software document collection. The experimental results reveal that the MNSOM significantly improved the retrieval performance in comparison with the single SOM based classification.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Kohonen, T.: Self-Organisation and Associative Memory. Springer, Berlin (1988)

    Google Scholar 

  2. Kohonen, T.: Self-Organising Maps. Springer, Berlin (1997)

    Google Scholar 

  3. Honkela, T., Kaski, S., Lagus, K., Kohonen, T.: Newsgroup Exploration with WEBSOM Method and Browsing Interface. Technical Report, Report A32. Helsinki University of Technology, Helsinki (1996)

    Google Scholar 

  4. Kohonen, T.: Self-Organisation of Very Large Document Collections: State of the Art. In: Proc. of the 8th International Conference on Artificial Neural Networks, pp. 55–74. Springer, Skovde (1998)

    Google Scholar 

  5. Chen, H., Houston, A., Sewell, R., Schatz, B.: Internet Browsing and Searching: User Evaluations of Category Map and Concept Space Techniques. J. American Society Info. Sci. 49, 582–603 (1998)

    Google Scholar 

  6. Merkl, D.: Text Classification with Self-Organising Maps: Some Lessons Learned. Neurocomputting 21, 61–77 (1998)

    Article  Google Scholar 

  7. Lin, X., Soergel, D., Marchionini, G.: A Self-Organising Semantic Map for Information Retrieval. In: Proc. of the 14th Annual International ACM/SIGIR Conference on Research and Development in Information Retrieval, Chicago, IL, pp. 262–269 (1991)

    Google Scholar 

  8. Orwig, R., Chen, H., Nunamaker, J.: A Graphical, Self-Organising Approach to Classifying Electronic Meeting Output. J. American Society Info. Sci. 48, 157–170 (1997)

    Article  Google Scholar 

  9. Maarek, Y.: Using Structural Information for Managing Very Large Software Systems. Computer Science Department, PhD thesis. Technion, Israel Institute of Technology (1989)

    Google Scholar 

  10. Salton, G., Wong, A., Yang, C.: A Vector Space Model for Automatic Indexing. Comm. ACM 18, 613–620 (1975)

    Article  MATH  Google Scholar 

  11. Maarek, Y., Berry, D., Kaiser, G.: An Information Retrieval Approach for Automatically Construction of Software Libraries. IEEE Trans. Softw. Eng. 17, 800–813 (1991)

    Article  Google Scholar 

  12. Maarek, Y.: Software Library Construction from an IR Perspective. SIGIR Forum 25, 8–18 (1991)

    Article  Google Scholar 

  13. Rijsbergen, C.: Information Retrieval. Butterworths, London (1980)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Ye, H. (2005). Multi-level Document Classifications with Self-organising Maps. In: Gallagher, M., Hogan, J.P., Maire, F. (eds) Intelligent Data Engineering and Automated Learning - IDEAL 2005. IDEAL 2005. Lecture Notes in Computer Science, vol 3578. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11508069_48

Download citation

  • DOI: https://doi.org/10.1007/11508069_48

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-26972-4

  • Online ISBN: 978-3-540-31693-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics