Abstract
The self-organizing map (SOM) is an automatic data- analysis method. It is widely applied to clustering problems and data exploration in industry, finance, natural sciences, and linguistics. The most extensive applications, exemplified in this paper, can be found in the management of massive textual data bases. The SOM is related to the classical vector quantization (VQ), which is used extensively in digital signal processing and transmission. Like in VQ, the SOM represents a distribution of input data items using a finite set of models. In the SOM, however, these models are automatically associated with the nodes of a regular (usually two-dimensional) grid in an ordered fashion such that more similar models become automatically associated with nodes that are adjacent in the grid, whereas less similar models are situated farther away from each other in the grid. This organization, a kind of similarity diagram of the models, makes it possible to obtain an insight into the topographic relationships of data, especially of high-dimensional data items. If the data items belong to certain predetermined classes, the models (and the nodes) can be calibrated according to these classes. An unknown input item is then classified according to that node, the model of which is most similar with it in some metric used in the construction of the SOM. A new finding introduced in this paper is that an input item can even more accurately be represented by a linear mixture of a few best-matching models. This becomes possible by a least-squares fitting procedure where the coefficients in the linear mixture of models are constrained to nonnegative values.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Allinson, N., Yin, H., Allinson, L., Slack, J. (eds.): Advances in Self-Organizing Maps. Springer, London (2001)
Anderberg, M.: Cluster Analysis for Applications. Academic, New York (1973)
Bishop, C.M., Svensen, M., Williams, C.K.I.: Developments of the generative topographic mapping. Neurocomputing 21, 203–224 (1998)
Bishop, C.M., Svensen, M., Williams, C.K.I.: GTM: The generative topographic mapping. Neural Computation 10, 215–234 (1998)
Cheng, Y.: Convergence and ordering of Kohonen’s Batch map. Neural Computation 9, 1667–1676 (1997)
Cottrell, M., Fort, J.C.: Étude d’un processus d’auto-organization. Ann. Inst. Henri Poincaré 23, 1–20 (1987)
Cottrell, M., Fort, J.C., Pagés, G.: Theoretical aspects of the SOM algorithm. In: Proc. WSOM 1997, Workshop on Self-Organizing Maps, Helsinki University of Technology, Neural Networks Research Centre, Espoo, Finland, pp. 246–267 (1997)
Deboeck, G., Kohonen, T. (eds.): Visual Explorations in Finance with Self-Organizing Maps. Springer, London (1998)
Deerwester, S., Dumais, S., Furnas, G., Landauer, K.: Indexing by latent semantic analysis. J. Am. Soc. Inform. Sci. 41, 391–407 (1990)
Dirichlet, G.L.: Über die Reduktion der positiven quadratischen Formen mit drei unbestimmten ganzen Zahlen. J. Reine und Angew. Math. 40, 209–227 (1850)
Fritzke, B.: Growing cell structures - a self-organizing network for unsupervised and supervised learning. Neural Networks 7, 1441–1460 (1994)
Gersho, A.: On the structure of vector quantizers. IEEE Trans. Inform. Theory IT 25, 373–380 (1979)
Gray, R.M.: Vector quantization. IEEE ASSP Mag. 1, 4–29 (1984)
Grenander, U.: Abstract Inference. Wiley, New York (1981)
Hammer, B., Micheli, A., Sperduti, A., Strickert, M.: Recursive self-organizing network models. Neural Networks 17, 1061–1085 (2004)
Hartigan, J.: Clustering Algorithms. Wiley, New York (1975)
Heskes, T.M., Kappen, B.: Error potential for self-organization. In: Proc. ICNN 1993, Int. Conf. on Neural Networks, vol. III, pp. 1219–1223. IEEE Service Center, Piscataway (1993)
Jain, A.K., Dubes, R.C.: Algorithms for Clustering of Data. Prentice-Hall, Englewood Cliffs (1988)
Kaski, S., Kohonen, T.: Exploratory data analysis by the self-organizing map: Structures of welfare and poverty in the world. In: Refenes, A.-P., Abu-Mostafa, Y., Moody, J., Weigand, A. (eds.) Neural Networks in Financial Engineering. Proc. Third Int. Conf. on Neural Networks in the Capital Markets, London, England, October 11-13, 1995, pp. 498–507. World Scientific, Singapore (1996)
Kaski, S.: Dimensionality reduction by random mapping. In: Proc. IJCNN 1998, Intl. Joint Conf. on Neural Networks, pp. 413–418. IEEE Press, Los Alamitos (1998)
Kaski, S., Kangas, J., Kohonen, T.: Bibliography of self-organizing map (SOM) papers: 1981-1997. Neural Computing Surveys 1, 1–176 (1998), (Available in electronic form, pp. 102–350), http://www.cse.ucsc.edu/NCS/vol1.html
Kohonen, T.: Self-organized formation of topologically correct feature maps. Biol. Cyb. 43, 59–69 (1982)
Kohonen, T.: Clustering, taxonomy, and topological maps of patterns. In: Proc. Sixth Int. Conf. on Pattern Recognition, Munich, Germany, pp. 114–128 (1982)
Kohonen, T.: Self-Organization and Associative Memory, 3rd edn. Springer, Heidelberg (1989)
Kohonen, T.: The self-organizing map. Proc. IEEE 78, 1464–1480 (1990)
Kohonen, T.: Self-organizing maps: optimization approaches. In: Kohonen, T., Mäkisara, K., Simula, O., Kangas, J. (eds.) Artificial Neural Networks, vol. II, pp. 981–990. North-Holland, Amsterdam (1991)
Kohonen, T.: Self-Organizing Maps, 3rd edn. Springer, Heidelberg (2001)
Kohonen, T.: Description of Input Patterns by Linear Mixtures of SOM Models, Report E8. Espoo, Finland: Helsinki University of Technology, Laboratory of Computer and Information Science (2007)
Kohonen, T.: Description of input patterns by linear mixtures of SOM models. In: WSOM 2007 CD-ROM Proceedings, Bielefeld University, Bielefeld, Germany (2007), http://biecoll.ub-bielefeld.de
Kohonen, T., Oja, E., Simula, O., Visa, A., Kangas, J.: Engineering applications of the self-organizing map. Proc. IEEE 84, 1358–1384 (1996)
Kohonen, T., Hynninen, J., Kangas, J., Laaksonen, J.: The Self-Organizing Map Program Package, Report A31. Espoo, Finland: Helsinki University of Technology, Laboratory of Computer and Information Science (1996)
Kohonen, T., Kaski, S., Lagus, K., Salojärvi, J., Honkela, J., Paatero, V., Saarela, A.: Self organization of a massive document collection. IEEE Trans. on Neural Networks 11, 574–585 (2000)
Kohonen, T., Somervuo, P.: How to make large self-organizing maps for nonvectorial data. Neural Networks 15, 945–952 (2002)
Kruskal, J.B., Wish, M.: Multidimensional Scaling. Sage University Paper Series on Quantitative Applications in the Social Sciences No. 07-011. Sage Publications, Newbury Park (1978)
Laaksonen, J., Koskela, M., Oja, E.: PicSOM - self-organizing image retrieval with MPEG-7 content descriptors. IEEE Trans. Neural Networks 13, 841–853 (2002)
Lagus, K., Kaski, S.: Keyword selection method for characterizing text document maps. In: Proc. ICANN 1999, Ninth Int. Conf. on Artificial Neural Networks, vol. 1, pp. 371–376. IEE, London (1999)
Lagus, K., Kaski, S., Kohonen, T.: Mining massive document collections by the WEBSOM method. Inf. Sciences 163, 135–156 (2004)
Lawson, C.L., Hanson, R.J.: Solving Least-Squares Problems. Prentice-Hall, Englewood Cliffs (1974)
Lewis, D.D., Yang, Y., Rose, T.G., Li, T.: RCV1: A new benchmark collection for text categorization research. J. Mach. Learn. Res. 5, 361–397 (2004)
Linde, Y., Buzo, A., Gray, R.M.: An algorithm for vector quantization. IEEE Trans. Communications COM 28(1080), 84–95
Luttrell, S.P.: Technical Report 4669. Malvern, UK: DRA (1992)
Makhoul, J., Roucos, S., Gish, H.: Vector quantization in speech coding. Proc. IEEE PROC-73, 1551–1588 (1985)
Manning, C.D., Schütze, H.: Foundations of Statistical Natural Language Processing. MIT Press, Cambridge (1999)
Miikkulainen, R.: Subsymbolic Natural Language Processing: An Integrated Model of Scripts, Lexicon, and Memory. MIT Press, Cambridge (1993)
Miikkulainen, R., Bednar, J.A., Choe, Y., Sirosh, J.: Computational Maps in the Visual Cortex. Springer, New York (2005)
Naim, A., Ratnatunga, K.U., Griffiths, R.E.: Galaxy morphology without classification: Self-organizing maps. Astrophys. J. Suppl. Series 111, 357–367 (1997)
Obermayer, K., Sejnowski, T.: Self-Organizing Map Formation: Foundations of Neural Computation. MIT Press, Cambridge (2001)
Oja, E., Kaski, S. (eds.): Kohonen Maps. Elsevier, Amsterdam (1999)
Oja, M., Kaski, S., Kohonen, T.: Bibliography of self-organizing map (SOM) papers; 1998-2001 addendum. Neural Computing Surveys 3, 1–156 (2003), Available in electronic form at http://www.cse.ucsc.edu/NCS/vol3.html
Oja, M., Somervuo, P., Kaski, S., Kohonen, T.: Clustering of human endogeneous retrovirus sequences with median self-organizing map. In: Proc. WSOM 2003, Workshop on Self-Organizing Maps, Hibikino, Japan (2003)
Pöllä, M., Honkela, T., Kohonen, T.: Bibliography of SOM papers, http://www.cis.hut.fi/research/sombibliography
Ritter, H., Kohonen, T.: Self-organizing semantic maps. Biol. Cyb. 61, 241–254 (1989)
Ritter, H., Martinetz, T., Schulten, K.: Neural Computation and Self-Organizing Maps: An Introduction. Addison-Wesley, Reading (1992)
Robbins, H., Monro, S.: A stochastic approximation method. Ann. Math. Statist. 22, 400–407 (1951)
Salton, G., McGill, M.J.: Introduction to Modern Information Retrieval. McGraw-Hill, New York (1983)
Sammon, J.W.: A nonlinear mapping for data structure analysis. IEEE Trans. Computers C-18, 401–409 (1969)
Seiffert, U., Jain, L.C. (eds.): Self-Organizing Neural Networks: Recent Advances and Applications. Physica-Verlag, Heidelberg (2002)
SOM Japan Co., Ltd.: The ”BLOSSOM” software package, http://www.somj.com
Tokutaka, H., Kishida, S., Fujimura, K.: Application of Self-Organizing Maps - Two-Dimensional Visualization of Multi-Dimensional Information (in Japanese). Kaibundo, Tokyo (1999)
Tokutaka, H., Ookita, M., Fujimura, K.: SOM and the Applications (in Japanese). Springer, Japan (2007)
Tryon, R., Bailey, D.: Cluster Analysis. McGraw-Hill, New York (1973)
Ultsch, A.: Self-organizing neural networks for visualization and classification. In: Opitz, O., Lausen, B., Klar, R. (eds.) Information and Classification, pp. 307–313. Springer, Berlin (1993)
Van Hulle, M.: Faithful Representations and Topographic Maps: From Distoryion- to Information-Based Self-Organization. Wiley, New York (2000)
Vesanto, J., Himberg, J., Alhoniemi, E., Parhankangas, J.: Self-organizing map in Matlab: the SOM Toolbox. In: Proc. Matlab DSP Conference 1999, Espoo, Finland, November 16-17, pp. 35–40 (1999)
Vesanto, J., Alhoniemi, E., Himberg, J., Kiviluoto, K., Parviainen, J.: Self-organizing map for data mining in Matlab: the SOM Toolbox. Simulation News Europe, 25(54) March (1999)
Voronoi, G.: Nouvelles applications des paramétres continus á la théorie des formes quadratiques. J. Reine und Angew. Math. 133, 97–178 (1907)
World Bank: World Development Report 1992. Oxford Univ. Press, New York (1992)
Young, T.Y., Fu, K.S. (eds.): Handbook of Pattern Recognition and Image Processing. Academic, Orlando (1986)
Zador, P.L.: Asymptotic quantization error of continuous signals and the quantization dimension. IEEE Trans. Inform. Theory IT-28, 139–149 (1982)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Kohonen, T. (2008). Data Management by Self-Organizing Maps. In: Zurada, J.M., Yen, G.G., Wang, J. (eds) Computational Intelligence: Research Frontiers. WCCI 2008. Lecture Notes in Computer Science, vol 5050. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-68860-0_15
Download citation
DOI: https://doi.org/10.1007/978-3-540-68860-0_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-68858-7
Online ISBN: 978-3-540-68860-0
eBook Packages: Computer ScienceComputer Science (R0)