Skip to main content

Improved SOM Labeling Methodology for Data Mining Applications

  • Chapter
Soft Computing for Knowledge Discovery and Data Mining

Self-Organizing Maps (SOMs) have been useful in gaining insights about the information content of large volumes of data in various data mining applications. As a special form of neural networks, they have been attractive as a data mining tool because they are able to extract information from data even with very little user-intervention. However, although learning in self-organizing maps is considered unsupervised because training patterns do not need desired output information to be supplied by the user, a trained SOM often has to be labeled prior to use in many real-world applications. Unfortunately, this labeling phase is usually supervised as patterns need accompanying output information that have to be supplied by the user. Because labeled patterns are not always available or may not even be possible to construct, the supervised nature of the labeling phase restricts the deployment of SOM to a wider range of potential data mining applications. This work proposes a methodical and semi-automatic SOM labeling procedure that does not require a set of labeled patterns. Instead, nodes in the trained map are clustered and subsets of training patterns associated to each of the clustered nodes are identified. Salient dimensions per node cluster, that constitute the basis for labeling each node in the map, are then identified. The effectiveness of the method is demonstrated on a data mining application involving customer-profiling based on an international market segmentation study.

Key words: self-organizing maps, neural networks, classification, clustering

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 54.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Azcarraga AP, Hsieh M, Setiono R, 2003, Visualizing globalization: A SOM approach to customer profiling. In: Proceedings of 24th International Conference on Information Systems (ICIS), Seattle, WA.

    Google Scholar 

  • Azcarraga A, Yap TN, Tan J, Chua TS, 2002, Evaluating keyword selection methods for WEBSOM text archives, IEEE Transactions on Knowledge and Data Engineering, 16(3): 380-383.

    Google Scholar 

  • Carlson E, 1998, Real estate investment appraisal of land properties using SOM. In: Deboeck G, Kohonen T (eds), Visual explorations infinance with selforganizing maps, Springer-Verlag, London.

    Google Scholar 

  • Carpenter GA, Grossberg S, 1991, Pattern-recognition by self-organizing neural networks. MIT Press, Cambridge, MA.

    Google Scholar 

  • Clark D, Ravishankar K, 1990, A convergence theorem for Grossberg learning, Neural Networks 3(1): 87-92.

    Article  Google Scholar 

  • Deboeck G, Kohonen T, 1998, Visual explorations in finance with self-organizing maps, Springer-Verlag, London.

    MATH  Google Scholar 

  • Deboeck G, 1998, Picking mutual funds with self-organizing maps. In: Deboeck G, Kohonen T (eds), Visual explorations in finance with self-organizing maps, Springer-Verlag, London.

    Google Scholar 

  • Deboeck G, 1998, Investment maps of emerging markets. In: Deboeck G, Kohonen T (eds), Visual explorations in finance with self-organizing maps, SpringerVerlag, London.

    Google Scholar 

  • Everitt B, 1974, Cluster analysis, Heinemann Educational Books, London.

    Google Scholar 

  • Fukushima K, 1980, Neocognitron: a self-organizing neural network model for a mechanism of pattern recognition unaffected by shift inposition, Biological Cybernetics 36: 121-136.

    Article  Google Scholar 

  • Hartigan JA, 1975, Clustering algorithms, Wiley-Interscience, New York.

    MATH  Google Scholar 

  • Haykin S, 1998, Neural networks: a comprehensive foundation. Prentice-Hall International, 2nd Edition, Upper Saddle River, NewJersey.

    Google Scholar 

  • Holbrook MB, Schindler RM, 1994, Age, sex, and attitude toward the pastas predictors of consumers’ aesthetic taste for cultural products. Journal of Consumer Research 31: 412-22.

    Google Scholar 

  • Hsieh MH, 2002, Identifying brand image dimensionality and measuring degree of brand globalization: a cross-national study. Journal of International Marketing 10 (2): 46-67.

    Article  Google Scholar 

  • Kiang MY, Kumar A, 2001, An evaluation of self-organizing map networks as a robust alternative to factor analysis in data mining applications, Information Systems Research 12: 177-194.

    Article  Google Scholar 

  • Kiviluto K, Bergius P, 1998, Maps for analyzing failures of small andmediumsized enterprises. In: Deboeck G, Kohonen T (eds), Visual explorations in finance with self-organizing maps, Springer-Verlag, London.

    Google Scholar 

  • Kohonen T, 2000, Self-organization of a massive document collection, IEEE Transactions on Neural Networks 11(3): 574-585.

    Article  Google Scholar 

  • Kohonen T, 1982, Self-organized formation of topologically-correct feature maps, Biological Cybernetics 43: 59-69.

    Article  MATH  MathSciNet  Google Scholar 

  • Kohonen T, 1990, The self-organizing map, Proceedings of the IEEE 78:1464-1480.

    Article  Google Scholar 

  • Kohonen T, 1995, Self-organizing maps, Springer-Verlag, Berlin.

    Google Scholar 

  • Kohonen T, 1999, Kohonen maps, Elsevier, New York.

    Google Scholar 

  • Kuo RJ, Ho LM, Hu CM, 2002, Integration of self-organizing feature mapand kmeans algorithm for market segmentation, Computers and Operations Research 29:1475-1493.

    Article  MATH  Google Scholar 

  • Mayer R, Lidy T, Rauber A, (2006), The map of Mozart, Proc 7th International Conference on Music Information Retrieval, Victoria,Canada, Oct 8-12.

    Google Scholar 

  • Merkl D, 1998, Text classification with self-organizing maps: some lessons learned, Neurocomputing 21: 61-77.

    Article  Google Scholar 

  • Park CW, Jaworski BJ, MacInnis DJ, 1986, Strategic brand concept- imagemanagement. Journal of Marketing 50: 135-145.

    Article  Google Scholar 

  • Park CW, Milberg S, Lawson R, 1991, Evaluation of brand extension:the role of product level similarity and brand concept consistency. Journal of Consumer Research 18: 185-193.

    Article  Google Scholar 

  • Punj G, Steward DW, 1983, Cluster analysis in marketing research: review and suggestions for applications. Journal of Marketing Research 20: 134-148.

    Article  Google Scholar 

  • Quinlan R, 1993, C4.5: Programs for machine learning, Morgan Kaufman, San Mateo, CA.

    Google Scholar 

  • Resta M, 1998, A hybrid neural network system for trading financial markets. In: Deboeck G, Kohonen T (eds), Visual explorations infinance with self-organizing maps, Springer-Verlag, London.

    Google Scholar 

  • Ritter H, Martinetz T, Schulten K, 1992, Neural computation and self-organizing maps (translated from German), Addison-Wesley, Reading MA.

    Google Scholar 

  • Rumelhart DE, Zipser D, 1986, Feature discovery by competitive learning. In: Rumelhart DE and McClelland JL (eds) Parallel and Distributed Processing, Vol 1, 151-193. MIT Press, Cambridge, CA.

    Google Scholar 

  • Rumelhart DE, Hinton GE, Williams RJ, 1986, Learning internal representations by error propagation. In: Rumelhart DE, McClelland JL (eds) Parallel and Distributed Processing, Vol 1. 318-362. MITPress, Cambridge, MA.

    Google Scholar 

  • Schmitt B, Deboeck G, 1998, Differential patterns in consumer purchase preferences using self-organizing maps: a case study of China. In: Deboeck G, Kohonen T (eds), Visual explorations in finance withself-organizing maps, SpringerVerlag, London.

    Google Scholar 

  • Serrano-Cinca C, 1998, Let financial data speak for themselves. In: Deboeck G, Kohonen T (eds), Visual explorations in finance with self-organizing maps, Springer-Verlag, London.

    Google Scholar 

  • Shumsky S, Yarovoy AV, 1998, Self-organizing atlas of Russian banks. In: Deboeck G, Kohonen T (eds), Visual explorations in finance with self-organizing maps, Springer-Verlag, London.

    Google Scholar 

  • Spath H, 1980, Cluster analysis algorithms, Ellis Horwood, Chichester,England. Tulkki A, (1998), Real estate investment appraisal of buildings using SOM. In: Deboeck G, Kohonen T (eds), Visual explorations in financewith self-organizing maps, Springer-Verlag, London.

    Google Scholar 

  • Wedel M, Kamakura W, 1998, Market segmentation: conceptual and methodological foundations, Kluwer Academic Publishers, Boston,MA.

    Google Scholar 

  • Wu S, Chow T, 2005, PRSOM: A new visualization method by hybridizing multi dimensional scaling and self-organizing Map, IEEE Trans on Neural Networks 16 (6): 1362-1380.

    Article  Google Scholar 

  • Xu R, Wunsch D, 2005, Survey of cluster algorithms, IEEE Trans on Neural Networks, 16(3): 645-678.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer Science+Business Media, LLC

About this chapter

Cite this chapter

Azcarraga, A., Hsieh, MH., Pan, SL., Setiono, R. (2008). Improved SOM Labeling Methodology for Data Mining Applications. In: Maimon, O., Rokach, L. (eds) Soft Computing for Knowledge Discovery and Data Mining. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-69935-6_3

Download citation

  • DOI: https://doi.org/10.1007/978-0-387-69935-6_3

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-0-387-69934-9

  • Online ISBN: 978-0-387-69935-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics