Abstract
The Self-Organising Map (SOM) is widely used to classify document collections. Such classifications are usually coarse-grained and cannot accommodate accurate document retrieval. A document classification scheme based on Multi-level Nested Self-Organising Map (MNSOM) is proposed to solve the problem. An MNSOM consists of a top map and a set of nested maps organised at different levels. The clusters on the top map of an MNSOM are at a relatively general level achieving retrieval recall, and the nested maps further elaborate the clusters into more specific groups, thus enhancing retrieval precision. The MNSOM was tested by a software document collection. The experimental results reveal that the MNSOM significantly improved the retrieval performance in comparison with the single SOM based classification.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Kohonen, T.: Self-Organisation and Associative Memory. Springer, Berlin (1988)
Kohonen, T.: Self-Organising Maps. Springer, Berlin (1997)
Honkela, T., Kaski, S., Lagus, K., Kohonen, T.: Newsgroup Exploration with WEBSOM Method and Browsing Interface. Technical Report, Report A32. Helsinki University of Technology, Helsinki (1996)
Kohonen, T.: Self-Organisation of Very Large Document Collections: State of the Art. In: Proc. of the 8th International Conference on Artificial Neural Networks, pp. 55–74. Springer, Skovde (1998)
Chen, H., Houston, A., Sewell, R., Schatz, B.: Internet Browsing and Searching: User Evaluations of Category Map and Concept Space Techniques. J. American Society Info. Sci. 49, 582–603 (1998)
Merkl, D.: Text Classification with Self-Organising Maps: Some Lessons Learned. Neurocomputting 21, 61–77 (1998)
Lin, X., Soergel, D., Marchionini, G.: A Self-Organising Semantic Map for Information Retrieval. In: Proc. of the 14th Annual International ACM/SIGIR Conference on Research and Development in Information Retrieval, Chicago, IL, pp. 262–269 (1991)
Orwig, R., Chen, H., Nunamaker, J.: A Graphical, Self-Organising Approach to Classifying Electronic Meeting Output. J. American Society Info. Sci. 48, 157–170 (1997)
Maarek, Y.: Using Structural Information for Managing Very Large Software Systems. Computer Science Department, PhD thesis. Technion, Israel Institute of Technology (1989)
Salton, G., Wong, A., Yang, C.: A Vector Space Model for Automatic Indexing. Comm. ACM 18, 613–620 (1975)
Maarek, Y., Berry, D., Kaiser, G.: An Information Retrieval Approach for Automatically Construction of Software Libraries. IEEE Trans. Softw. Eng. 17, 800–813 (1991)
Maarek, Y.: Software Library Construction from an IR Perspective. SIGIR Forum 25, 8–18 (1991)
Rijsbergen, C.: Information Retrieval. Butterworths, London (1980)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ye, H. (2005). Multi-level Document Classifications with Self-organising Maps. In: Gallagher, M., Hogan, J.P., Maire, F. (eds) Intelligent Data Engineering and Automated Learning - IDEAL 2005. IDEAL 2005. Lecture Notes in Computer Science, vol 3578. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11508069_48
Download citation
DOI: https://doi.org/10.1007/11508069_48
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26972-4
Online ISBN: 978-3-540-31693-0
eBook Packages: Computer ScienceComputer Science (R0)