Skip to main content

Data Visualization and Analysis with Self-Organizing Maps in Learning Metrics

  • Conference paper
  • First Online:
Data Warehousing and Knowledge Discovery (DaWaK 2001)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2114))

Included in the following conference series:

Abstract

High-dimensional data can be visualized and analyzed with the Self-Organizing Map, a method for clustering data and visualizing it on a lower-dimensional display. Results depend on the (often Euclidean) distance measure of the data space. We introduce an improved metric that emphasizes important local directions by measuring changes in an auxiliary, interesting property of the data points, for example their class. A Self-Organizing Map is computed in the new metric and used for vi- sualizing and clustering the data. The trained map represents directions of highest relevance for the property of interest. In data analysis it is especially beneficial that the importance of the original data variables throughout the data space can be assessed and visualized. We apply the method to analyze the bankruptcy risk of Finnish enterprises.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Amari, S.-I.: Differential-Geometrical Methods in Statistics. Springer, New York (1990)

    MATH  Google Scholar 

  2. Amari, S.-I.: Natural Gradient Works Efficiently in Learning. In: Neural Computation 10 (1998) 251–276

    Article  Google Scholar 

  3. Card, S.K., Mackinlay, J.D., Shneiderman, B. (eds.): Readings in Information Visualization. Using Vision to Think, Morgan Kaufmann, San Francisco, CA (1999)

    Google Scholar 

  4. Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum Likelihood from Incomplete Data via the EM Algorithm. In: Journal of the Royal Statistical Society, Series B 39 (1977) 1–38

    MATH  MathSciNet  Google Scholar 

  5. Fisher, J.W. III, Principe, J.: A Methodology for Information Theoretic Feature Extraction. In: Proc. IJCNN’98, International Joint Conference on Neural Networks, Vol. 3. IEEE Service Center, Piscataway, NJ (1998) 1712–1716

    Google Scholar 

  6. Hastie, T., Tibshirani, R., Buja, A.: Flexible Discriminant and Mixture Models. In: Kay, J., Titterington, D. (eds.): Proc. Conf. on Neural Networks and Statistics. Oxford University Press (1995)

    Google Scholar 

  7. Hastie, T., Tibshirani, R.: Discriminant Analysis by Gaussian Mixtures. JRSSB (1996)

    Google Scholar 

  8. Holmström, L., Koistinen, P., Laaksonen, J., Oja, E.: Neural and Statistical Classifiers Taxonomy and Two Case Studies. In: IEEE Transactions on Neural Networks 8 (1997) 5–17

    Article  Google Scholar 

  9. Hofmann, T.: Learning the Similarity of Documents: an Information-Geometric Approach to Document Retrieval and Categorization. In: Solla, S.A., Leen, T.K., Müller, K.-R. (eds.): Advances in Neural Information Processing Systems 12. MIT Press, Cambridge, MA (2000) 914–920

    Google Scholar 

  10. Jaakkola, T.S., Haussler, D.: Exploiting Generative Models in Discriminative Classifiers. In: Kearns, Michael S., Solla, Sara A., Cohn, David A. (eds.): Advances in Neural Information Processing Systems 11. Morgan Kauffmann Publishers, San Mateo, CA (1999) 487–493

    Google Scholar 

  11. Kaski, S., Kangas, J., and Kohonen, T.: Bibliography of Self-Organizing Map (SOM) Papers: 1981-1997. Neural Computing Surveys 1 (1998) 1–176.

    Google Scholar 

  12. Kass, R.E., Vos, P.W.: Geometrical Foundations of Asymptotic Inference. Wiley, New York (1997)

    MATH  Google Scholar 

  13. Keim, D.A., Kriegel, H.-P.: Visualization techniques for mining large databases: A comparison. IEEE Transactions on Knowledge and Data Engineering 8 (1996) 923–938

    Article  Google Scholar 

  14. Kiviluoto, K.: Predicting Bankruptcies with the Self-Organizing Map. Neurocomputing 21 (1998) 191–201

    Article  MATH  Google Scholar 

  15. Kiviluoto, K., Bergius, P.: Exploring Corporate Bankruptcy with Two-Level Self-Organizing Maps. Decision technologies for computational management science. In: Proceedings of Fifth International Conference on Computational Finance. Kluwer Academic Publishers, Boston (1998) 373–380

    Google Scholar 

  16. Kohonen, T.: Self-organized Formation of Topologically Correct Feature Maps. In: Biological Cybernetics 43 (1982) 59–69

    Article  MATH  MathSciNet  Google Scholar 

  17. Kohonen, T.: Self-Organizing Maps. Springer, Berlin (1995; second, extended edition 1997)

    Google Scholar 

  18. Kullback, S.: Information Theory and Statistics. Wiley, New York (1959)

    MATH  Google Scholar 

  19. Murray, M.K., Rice, J.W.: Differential Geometry and Statistics. Chapman & Hall, London (1993)

    MATH  Google Scholar 

  20. Rao, C.R.: Information and the Accuracy Attainable in the Estimation of Statistical Parameters. Bull. Calcutta Math. Soc. 37 (1945) 81–91

    MATH  MathSciNet  Google Scholar 

  21. Ripley, B.D.: Pattern Recognition and Neural Networks. Cambridge University Press, Cambridge, UK (1996)

    MATH  Google Scholar 

  22. Torkkola, K., Campbell, W. M.: Mutual Information in Learning Feature Transformations. In: Proc. ICML’2000, The Seventeenth International Conference on Machine Learning (2000)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2001 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Kaski, S., Sinkkonen, J., Peltonen, J. (2001). Data Visualization and Analysis with Self-Organizing Maps in Learning Metrics. In: Kambayashi, Y., Winiwarter, W., Arikawa, M. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2001. Lecture Notes in Computer Science, vol 2114. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44801-2_17

Download citation

  • DOI: https://doi.org/10.1007/3-540-44801-2_17

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-42553-3

  • Online ISBN: 978-3-540-44801-3

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics