Data Visualization and Analysis with Self-Organizing Maps in Learning Metrics

Kaski, Samuel; Sinkkonen, Janne; Peltonen, Jaakko

doi:10.1007/3-540-44801-2_17

Samuel Kaski⁷,
Janne Sinkkonen⁷ &
Jaakko Peltonen⁷

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2114))

Included in the following conference series:

International Conference on Data Warehousing and Knowledge Discovery

900 Accesses
3 Citations

Abstract

High-dimensional data can be visualized and analyzed with the Self-Organizing Map, a method for clustering data and visualizing it on a lower-dimensional display. Results depend on the (often Euclidean) distance measure of the data space. We introduce an improved metric that emphasizes important local directions by measuring changes in an auxiliary, interesting property of the data points, for example their class. A Self-Organizing Map is computed in the new metric and used for vi- sualizing and clustering the data. The trained map represents directions of highest relevance for the property of interest. In data analysis it is especially beneficial that the importance of the original data variables throughout the data space can be assessed and visualized. We apply the method to analyze the bankruptcy risk of Finnish enterprises.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Amari, S.-I.: Differential-Geometrical Methods in Statistics. Springer, New York (1990)
MATH Google Scholar
Amari, S.-I.: Natural Gradient Works Efficiently in Learning. In: Neural Computation 10 (1998) 251–276
Article Google Scholar
Card, S.K., Mackinlay, J.D., Shneiderman, B. (eds.): Readings in Information Visualization. Using Vision to Think, Morgan Kaufmann, San Francisco, CA (1999)
Google Scholar
Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum Likelihood from Incomplete Data via the EM Algorithm. In: Journal of the Royal Statistical Society, Series B 39 (1977) 1–38
MATH MathSciNet Google Scholar
Fisher, J.W. III, Principe, J.: A Methodology for Information Theoretic Feature Extraction. In: Proc. IJCNN’98, International Joint Conference on Neural Networks, Vol. 3. IEEE Service Center, Piscataway, NJ (1998) 1712–1716
Google Scholar
Hastie, T., Tibshirani, R., Buja, A.: Flexible Discriminant and Mixture Models. In: Kay, J., Titterington, D. (eds.): Proc. Conf. on Neural Networks and Statistics. Oxford University Press (1995)
Google Scholar
Hastie, T., Tibshirani, R.: Discriminant Analysis by Gaussian Mixtures. JRSSB (1996)
Google Scholar
Holmström, L., Koistinen, P., Laaksonen, J., Oja, E.: Neural and Statistical Classifiers Taxonomy and Two Case Studies. In: IEEE Transactions on Neural Networks 8 (1997) 5–17
Article Google Scholar
Hofmann, T.: Learning the Similarity of Documents: an Information-Geometric Approach to Document Retrieval and Categorization. In: Solla, S.A., Leen, T.K., Müller, K.-R. (eds.): Advances in Neural Information Processing Systems 12. MIT Press, Cambridge, MA (2000) 914–920
Google Scholar
Jaakkola, T.S., Haussler, D.: Exploiting Generative Models in Discriminative Classifiers. In: Kearns, Michael S., Solla, Sara A., Cohn, David A. (eds.): Advances in Neural Information Processing Systems 11. Morgan Kauffmann Publishers, San Mateo, CA (1999) 487–493
Google Scholar
Kaski, S., Kangas, J., and Kohonen, T.: Bibliography of Self-Organizing Map (SOM) Papers: 1981-1997. Neural Computing Surveys 1 (1998) 1–176.
Google Scholar
Kass, R.E., Vos, P.W.: Geometrical Foundations of Asymptotic Inference. Wiley, New York (1997)
MATH Google Scholar
Keim, D.A., Kriegel, H.-P.: Visualization techniques for mining large databases: A comparison. IEEE Transactions on Knowledge and Data Engineering 8 (1996) 923–938
Article Google Scholar
Kiviluoto, K.: Predicting Bankruptcies with the Self-Organizing Map. Neurocomputing 21 (1998) 191–201
Article MATH Google Scholar
Kiviluoto, K., Bergius, P.: Exploring Corporate Bankruptcy with Two-Level Self-Organizing Maps. Decision technologies for computational management science. In: Proceedings of Fifth International Conference on Computational Finance. Kluwer Academic Publishers, Boston (1998) 373–380
Google Scholar
Kohonen, T.: Self-organized Formation of Topologically Correct Feature Maps. In: Biological Cybernetics 43 (1982) 59–69
Article MATH MathSciNet Google Scholar
Kohonen, T.: Self-Organizing Maps. Springer, Berlin (1995; second, extended edition 1997)
Google Scholar
Kullback, S.: Information Theory and Statistics. Wiley, New York (1959)
MATH Google Scholar
Murray, M.K., Rice, J.W.: Differential Geometry and Statistics. Chapman & Hall, London (1993)
MATH Google Scholar
Rao, C.R.: Information and the Accuracy Attainable in the Estimation of Statistical Parameters. Bull. Calcutta Math. Soc. 37 (1945) 81–91
MATH MathSciNet Google Scholar
Ripley, B.D.: Pattern Recognition and Neural Networks. Cambridge University Press, Cambridge, UK (1996)
MATH Google Scholar
Torkkola, K., Campbell, W. M.: Mutual Information in Learning Feature Transformations. In: Proc. ICML’2000, The Seventeenth International Conference on Machine Learning (2000)
Google Scholar

Download references

Author information

Authors and Affiliations

Neural Networks Research Centre, Helsinki University of Technology, 5400, FIN, 02015, HUT, Finland
Samuel Kaski, Janne Sinkkonen & Jaakko Peltonen

Authors

Samuel Kaski
View author publications
You can also search for this author in PubMed Google Scholar
Janne Sinkkonen
View author publications
You can also search for this author in PubMed Google Scholar
Jaakko Peltonen
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Kyoto University, Kyoto, 606-8501, Japan
Yahiko Kambayashi
EC3, Siebensterngasse 21/3, 1070, Wien
Werner Winiwarter
Center for Spatial Information Science (CSIS), University of Tokyo, 4-6-1, Komaba Meguro-ku, Tokyo, 153-8904, Japan
Masatoshi Arikawa

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kaski, S., Sinkkonen, J., Peltonen, J. (2001). Data Visualization and Analysis with Self-Organizing Maps in Learning Metrics. In: Kambayashi, Y., Winiwarter, W., Arikawa, M. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2001. Lecture Notes in Computer Science, vol 2114. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44801-2_17

Download citation

DOI: https://doi.org/10.1007/3-540-44801-2_17
Published: 28 August 2001
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-42553-3
Online ISBN: 978-3-540-44801-3
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics