Skip to main content

Data Management by Self-Organizing Maps

  • Chapter
Computational Intelligence: Research Frontiers (WCCI 2008)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 5050))

Included in the following conference series:

Abstract

The self-organizing map (SOM) is an automatic data- analysis method. It is widely applied to clustering problems and data exploration in industry, finance, natural sciences, and linguistics. The most extensive applications, exemplified in this paper, can be found in the management of massive textual data bases. The SOM is related to the classical vector quantization (VQ), which is used extensively in digital signal processing and transmission. Like in VQ, the SOM represents a distribution of input data items using a finite set of models. In the SOM, however, these models are automatically associated with the nodes of a regular (usually two-dimensional) grid in an ordered fashion such that more similar models become automatically associated with nodes that are adjacent in the grid, whereas less similar models are situated farther away from each other in the grid. This organization, a kind of similarity diagram of the models, makes it possible to obtain an insight into the topographic relationships of data, especially of high-dimensional data items. If the data items belong to certain predetermined classes, the models (and the nodes) can be calibrated according to these classes. An unknown input item is then classified according to that node, the model of which is most similar with it in some metric used in the construction of the SOM. A new finding introduced in this paper is that an input item can even more accurately be represented by a linear mixture of a few best-matching models. This becomes possible by a least-squares fitting procedure where the coefficients in the linear mixture of models are constrained to nonnegative values.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 16.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Allinson, N., Yin, H., Allinson, L., Slack, J. (eds.): Advances in Self-Organizing Maps. Springer, London (2001)

    Google Scholar 

  2. Anderberg, M.: Cluster Analysis for Applications. Academic, New York (1973)

    MATH  Google Scholar 

  3. Bishop, C.M., Svensen, M., Williams, C.K.I.: Developments of the generative topographic mapping. Neurocomputing 21, 203–224 (1998)

    Article  MATH  Google Scholar 

  4. Bishop, C.M., Svensen, M., Williams, C.K.I.: GTM: The generative topographic mapping. Neural Computation 10, 215–234 (1998)

    Article  Google Scholar 

  5. Cheng, Y.: Convergence and ordering of Kohonen’s Batch map. Neural Computation 9, 1667–1676 (1997)

    Article  Google Scholar 

  6. Cottrell, M., Fort, J.C.: Étude d’un processus d’auto-organization. Ann. Inst. Henri Poincaré 23, 1–20 (1987)

    MATH  MathSciNet  Google Scholar 

  7. Cottrell, M., Fort, J.C., Pagés, G.: Theoretical aspects of the SOM algorithm. In: Proc. WSOM 1997, Workshop on Self-Organizing Maps, Helsinki University of Technology, Neural Networks Research Centre, Espoo, Finland, pp. 246–267 (1997)

    Google Scholar 

  8. Deboeck, G., Kohonen, T. (eds.): Visual Explorations in Finance with Self-Organizing Maps. Springer, London (1998)

    MATH  Google Scholar 

  9. Deerwester, S., Dumais, S., Furnas, G., Landauer, K.: Indexing by latent semantic analysis. J. Am. Soc. Inform. Sci. 41, 391–407 (1990)

    Article  Google Scholar 

  10. Dirichlet, G.L.: Über die Reduktion der positiven quadratischen Formen mit drei unbestimmten ganzen Zahlen. J. Reine und Angew. Math. 40, 209–227 (1850)

    MATH  Google Scholar 

  11. Fritzke, B.: Growing cell structures - a self-organizing network for unsupervised and supervised learning. Neural Networks 7, 1441–1460 (1994)

    Article  Google Scholar 

  12. Gersho, A.: On the structure of vector quantizers. IEEE Trans. Inform. Theory IT 25, 373–380 (1979)

    Article  MATH  MathSciNet  Google Scholar 

  13. Gray, R.M.: Vector quantization. IEEE ASSP Mag. 1, 4–29 (1984)

    Google Scholar 

  14. Grenander, U.: Abstract Inference. Wiley, New York (1981)

    MATH  Google Scholar 

  15. Hammer, B., Micheli, A., Sperduti, A., Strickert, M.: Recursive self-organizing network models. Neural Networks 17, 1061–1085 (2004)

    Article  MATH  Google Scholar 

  16. Hartigan, J.: Clustering Algorithms. Wiley, New York (1975)

    MATH  Google Scholar 

  17. Heskes, T.M., Kappen, B.: Error potential for self-organization. In: Proc. ICNN 1993, Int. Conf. on Neural Networks, vol. III, pp. 1219–1223. IEEE Service Center, Piscataway (1993)

    Google Scholar 

  18. Jain, A.K., Dubes, R.C.: Algorithms for Clustering of Data. Prentice-Hall, Englewood Cliffs (1988)

    Google Scholar 

  19. Kaski, S., Kohonen, T.: Exploratory data analysis by the self-organizing map: Structures of welfare and poverty in the world. In: Refenes, A.-P., Abu-Mostafa, Y., Moody, J., Weigand, A. (eds.) Neural Networks in Financial Engineering. Proc. Third Int. Conf. on Neural Networks in the Capital Markets, London, England, October 11-13, 1995, pp. 498–507. World Scientific, Singapore (1996)

    Google Scholar 

  20. Kaski, S.: Dimensionality reduction by random mapping. In: Proc. IJCNN 1998, Intl. Joint Conf. on Neural Networks, pp. 413–418. IEEE Press, Los Alamitos (1998)

    Google Scholar 

  21. Kaski, S., Kangas, J., Kohonen, T.: Bibliography of self-organizing map (SOM) papers: 1981-1997. Neural Computing Surveys 1, 1–176 (1998), (Available in electronic form, pp. 102–350), http://www.cse.ucsc.edu/NCS/vol1.html

  22. Kohonen, T.: Self-organized formation of topologically correct feature maps. Biol. Cyb. 43, 59–69 (1982)

    Article  MATH  MathSciNet  Google Scholar 

  23. Kohonen, T.: Clustering, taxonomy, and topological maps of patterns. In: Proc. Sixth Int. Conf. on Pattern Recognition, Munich, Germany, pp. 114–128 (1982)

    Google Scholar 

  24. Kohonen, T.: Self-Organization and Associative Memory, 3rd edn. Springer, Heidelberg (1989)

    Google Scholar 

  25. Kohonen, T.: The self-organizing map. Proc. IEEE 78, 1464–1480 (1990)

    Article  Google Scholar 

  26. Kohonen, T.: Self-organizing maps: optimization approaches. In: Kohonen, T., Mäkisara, K., Simula, O., Kangas, J. (eds.) Artificial Neural Networks, vol. II, pp. 981–990. North-Holland, Amsterdam (1991)

    Google Scholar 

  27. Kohonen, T.: Self-Organizing Maps, 3rd edn. Springer, Heidelberg (2001)

    MATH  Google Scholar 

  28. Kohonen, T.: Description of Input Patterns by Linear Mixtures of SOM Models, Report E8. Espoo, Finland: Helsinki University of Technology, Laboratory of Computer and Information Science (2007)

    Google Scholar 

  29. Kohonen, T.: Description of input patterns by linear mixtures of SOM models. In: WSOM 2007 CD-ROM Proceedings, Bielefeld University, Bielefeld, Germany (2007), http://biecoll.ub-bielefeld.de

  30. Kohonen, T., Oja, E., Simula, O., Visa, A., Kangas, J.: Engineering applications of the self-organizing map. Proc. IEEE 84, 1358–1384 (1996)

    Article  Google Scholar 

  31. Kohonen, T., Hynninen, J., Kangas, J., Laaksonen, J.: The Self-Organizing Map Program Package, Report A31. Espoo, Finland: Helsinki University of Technology, Laboratory of Computer and Information Science (1996)

    Google Scholar 

  32. Kohonen, T., Kaski, S., Lagus, K., Salojärvi, J., Honkela, J., Paatero, V., Saarela, A.: Self organization of a massive document collection. IEEE Trans. on Neural Networks 11, 574–585 (2000)

    Article  Google Scholar 

  33. Kohonen, T., Somervuo, P.: How to make large self-organizing maps for nonvectorial data. Neural Networks 15, 945–952 (2002)

    Article  Google Scholar 

  34. Kruskal, J.B., Wish, M.: Multidimensional Scaling. Sage University Paper Series on Quantitative Applications in the Social Sciences No. 07-011. Sage Publications, Newbury Park (1978)

    Google Scholar 

  35. Laaksonen, J., Koskela, M., Oja, E.: PicSOM - self-organizing image retrieval with MPEG-7 content descriptors. IEEE Trans. Neural Networks 13, 841–853 (2002)

    Article  Google Scholar 

  36. Lagus, K., Kaski, S.: Keyword selection method for characterizing text document maps. In: Proc. ICANN 1999, Ninth Int. Conf. on Artificial Neural Networks, vol. 1, pp. 371–376. IEE, London (1999)

    Chapter  Google Scholar 

  37. Lagus, K., Kaski, S., Kohonen, T.: Mining massive document collections by the WEBSOM method. Inf. Sciences 163, 135–156 (2004)

    Article  Google Scholar 

  38. Lawson, C.L., Hanson, R.J.: Solving Least-Squares Problems. Prentice-Hall, Englewood Cliffs (1974)

    MATH  Google Scholar 

  39. Lewis, D.D., Yang, Y., Rose, T.G., Li, T.: RCV1: A new benchmark collection for text categorization research. J. Mach. Learn. Res. 5, 361–397 (2004)

    Google Scholar 

  40. Linde, Y., Buzo, A., Gray, R.M.: An algorithm for vector quantization. IEEE Trans. Communications COM 28(1080), 84–95

    Google Scholar 

  41. Luttrell, S.P.: Technical Report 4669. Malvern, UK: DRA (1992)

    Google Scholar 

  42. Makhoul, J., Roucos, S., Gish, H.: Vector quantization in speech coding. Proc. IEEE PROC-73, 1551–1588 (1985)

    Article  Google Scholar 

  43. Manning, C.D., Schütze, H.: Foundations of Statistical Natural Language Processing. MIT Press, Cambridge (1999)

    MATH  Google Scholar 

  44. Miikkulainen, R.: Subsymbolic Natural Language Processing: An Integrated Model of Scripts, Lexicon, and Memory. MIT Press, Cambridge (1993)

    Google Scholar 

  45. Miikkulainen, R., Bednar, J.A., Choe, Y., Sirosh, J.: Computational Maps in the Visual Cortex. Springer, New York (2005)

    Google Scholar 

  46. Naim, A., Ratnatunga, K.U., Griffiths, R.E.: Galaxy morphology without classification: Self-organizing maps. Astrophys. J. Suppl. Series 111, 357–367 (1997)

    Article  Google Scholar 

  47. Obermayer, K., Sejnowski, T.: Self-Organizing Map Formation: Foundations of Neural Computation. MIT Press, Cambridge (2001)

    Google Scholar 

  48. Oja, E., Kaski, S. (eds.): Kohonen Maps. Elsevier, Amsterdam (1999)

    MATH  Google Scholar 

  49. Oja, M., Kaski, S., Kohonen, T.: Bibliography of self-organizing map (SOM) papers; 1998-2001 addendum. Neural Computing Surveys 3, 1–156 (2003), Available in electronic form at http://www.cse.ucsc.edu/NCS/vol3.html

    Google Scholar 

  50. Oja, M., Somervuo, P., Kaski, S., Kohonen, T.: Clustering of human endogeneous retrovirus sequences with median self-organizing map. In: Proc. WSOM 2003, Workshop on Self-Organizing Maps, Hibikino, Japan (2003)

    Google Scholar 

  51. Pöllä, M., Honkela, T., Kohonen, T.: Bibliography of SOM papers, http://www.cis.hut.fi/research/sombibliography

  52. Ritter, H., Kohonen, T.: Self-organizing semantic maps. Biol. Cyb. 61, 241–254 (1989)

    Article  Google Scholar 

  53. Ritter, H., Martinetz, T., Schulten, K.: Neural Computation and Self-Organizing Maps: An Introduction. Addison-Wesley, Reading (1992)

    MATH  Google Scholar 

  54. Robbins, H., Monro, S.: A stochastic approximation method. Ann. Math. Statist. 22, 400–407 (1951)

    Article  MathSciNet  MATH  Google Scholar 

  55. Salton, G., McGill, M.J.: Introduction to Modern Information Retrieval. McGraw-Hill, New York (1983)

    MATH  Google Scholar 

  56. Sammon, J.W.: A nonlinear mapping for data structure analysis. IEEE Trans. Computers C-18, 401–409 (1969)

    Article  Google Scholar 

  57. Seiffert, U., Jain, L.C. (eds.): Self-Organizing Neural Networks: Recent Advances and Applications. Physica-Verlag, Heidelberg (2002)

    Google Scholar 

  58. SOM Japan Co., Ltd.: The ”BLOSSOM” software package, http://www.somj.com

  59. Tokutaka, H., Kishida, S., Fujimura, K.: Application of Self-Organizing Maps - Two-Dimensional Visualization of Multi-Dimensional Information (in Japanese). Kaibundo, Tokyo (1999)

    Google Scholar 

  60. Tokutaka, H., Ookita, M., Fujimura, K.: SOM and the Applications (in Japanese). Springer, Japan (2007)

    Google Scholar 

  61. Tryon, R., Bailey, D.: Cluster Analysis. McGraw-Hill, New York (1973)

    Google Scholar 

  62. Ultsch, A.: Self-organizing neural networks for visualization and classification. In: Opitz, O., Lausen, B., Klar, R. (eds.) Information and Classification, pp. 307–313. Springer, Berlin (1993)

    Google Scholar 

  63. Van Hulle, M.: Faithful Representations and Topographic Maps: From Distoryion- to Information-Based Self-Organization. Wiley, New York (2000)

    Google Scholar 

  64. Vesanto, J., Himberg, J., Alhoniemi, E., Parhankangas, J.: Self-organizing map in Matlab: the SOM Toolbox. In: Proc. Matlab DSP Conference 1999, Espoo, Finland, November 16-17, pp. 35–40 (1999)

    Google Scholar 

  65. Vesanto, J., Alhoniemi, E., Himberg, J., Kiviluoto, K., Parviainen, J.: Self-organizing map for data mining in Matlab: the SOM Toolbox. Simulation News Europe, 25(54) March (1999)

    Google Scholar 

  66. Voronoi, G.: Nouvelles applications des paramétres continus á la théorie des formes quadratiques. J. Reine und Angew. Math. 133, 97–178 (1907)

    Google Scholar 

  67. World Bank: World Development Report 1992. Oxford Univ. Press, New York (1992)

    Google Scholar 

  68. Young, T.Y., Fu, K.S. (eds.): Handbook of Pattern Recognition and Image Processing. Academic, Orlando (1986)

    MATH  Google Scholar 

  69. Zador, P.L.: Asymptotic quantization error of continuous signals and the quantization dimension. IEEE Trans. Inform. Theory IT-28, 139–149 (1982)

    Article  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Jacek M. Zurada Gary G. Yen Jun Wang

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Kohonen, T. (2008). Data Management by Self-Organizing Maps. In: Zurada, J.M., Yen, G.G., Wang, J. (eds) Computational Intelligence: Research Frontiers. WCCI 2008. Lecture Notes in Computer Science, vol 5050. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-68860-0_15

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-68860-0_15

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-68858-7

  • Online ISBN: 978-3-540-68860-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics