Multi-layer Topology Preserving Mapping for K-Means Clustering

Wu, Ying; Doyle, Thomas K.; Fyfe, Colin

doi:10.1007/978-3-642-23878-9_11

Ying Wu¹⁹,
Thomas K. Doyle¹⁹ &
Colin Fyfe²⁰

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6936))

Included in the following conference series:

International Conference on Intelligent Data Engineering and Automated Learning

1818 Accesses
4 Citations

Abstract

In this paper, we investigate the multi-layer topology preserving mapping for K-means. We present a Multi-layer Topology Preserving Mapping (MTPM) based on the idea of deep architectures. We demonstrate that the MTPM output can be used to discover the number of clusters for K-means and initialize the prototypes of K-means more reasonably. Also, K-means clusters the data based on the discovered underlying structure of the data by the MTPM. The standard wine data set is used to test our algorithm. We finally analyse a real biological data set with no prior clustering information available.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bengio, Y., Lamblin, P., Popovici, D., Larochelle, H.: Greedy layer-wise training of deep networks. In: Advances in Neural Information Processing Systems, vol. 19, pp. 153–160. MIT Press, Cambridge (2007)
Google Scholar
Bengio, Y., LeCun, Y.: Large-Scale Kernel Machines. In: Scaling Learning Algorithms towards AI. MIT Press, Cambridge (2007)
Google Scholar
Bishop, C.M., Svensen, M., Williams, C.K.I.: GTM: The generative topographic mapping. Neural Computation 10, 215–234 (1998)
Article MATH Google Scholar
Bottou, L., Bengio, Y.: Convergence properties of the k-means algorithms. In: Advances in Neural Information Processing Systems, vol. 7, pp. 585–592. MIT Press, Cambridge (1995)
Google Scholar
de Boer, P.-T., Kroese, D.P., Mannor, S., Rubenstein, R.Y.: A tutorial on the cross-entropy method. Annals of Operations Research 134(1), 19–67 (2004)
Article MathSciNet MATH Google Scholar
Doyle, T.K., Houghton, J.D.R., O’Suilleabhain, P.F., Hobson, V.J., Marnell, F., Davenport, J., Hays, G.C.: Leatherback turtles satellite tagged in european waters. Endangered Species Research 4, 23–31 (2008)
Article Google Scholar
Fedak, M., Lovell, P., McConnell, B., Hunter, C.: Overcoming the constraints of long range radio telemetry from animals: Getting more useful data from smaller packages. Integrative and Comparative Biology 42(1), 3–10 (2002)
Article Google Scholar
Fyfe, C.: Two topographic maps for data visualization. Data Mining and Kownledge Discovery 14, 207–224 (2007)
Article MathSciNet Google Scholar
Hays, G.C., Houghton, J.D.R., Isaacs, C., King, R.S., Lloyd, C., Lovell, P.: First records of oceanic dive profiles for leatherback turtles, dermochelys coriacea, indicate behavioural plasticity associated with long-distance migration. Animal Behaviour 67(4), 733–743 (2004)
Article Google Scholar
Hinton, G.E.: Training products of experts by minimizing contrastive divergence. Technical Report 2000-004, Gatsby Computational Neuroscience Unit. University College, London (2000)
Google Scholar
Hinton, G.E., Osindero, S., Teh, Y.: A fast learning algorithm for deep belief nets. Neural Computation 16, 1527–1554 (2006)
Article MathSciNet MATH Google Scholar
Hinton, G.E., Salakhutdinov, R.R.: Reducing the demensionality of data with neural networks. Science 313, 504–507 (2006)
Article MathSciNet MATH Google Scholar
Kohonen, T.: Self-organising maps. Springer, Heidelberg (1995)
Book Google Scholar
MacQueen, J.B.: Some methods for classification and analysis of multivariate observations. In: Le Cam, L.M., Neyman, J. (eds.) Proc. of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, vol. 1, pp. 281–297. University of California Press, Berkeley (1967)
Google Scholar
Rubinstein, R.Y.: Optimization of computer simulation models with rare events. European Journal of Operations Reasearch 99, 89–112 (1997)
Article Google Scholar
Wu, Y., Fyfe, C.: The on-line cross entropy method for unsupervised data exploration. WSEAS Transactions on Mathematics 6(12), 865–877 (2007)
MathSciNet Google Scholar
Wu, Y., Fyfe, C.: Topology preserving mappings using cross entropy adaptation. In: International Conference on Artificial Intelligence, Knowledge Engineering and Data Bases (2008)
Google Scholar

Download references

Author information

Authors and Affiliations

Coastal and Marine Research Centre, ERI, University College Cork Glucksman Marine Facility, Naval Base, Haulbowline, Ireland
Ying Wu & Thomas K. Doyle
Applied Computational Intelligence Research Unit, The University of the West of Scotland, Scotland
Colin Fyfe

Authors

Ying Wu
View author publications
You can also search for this author in PubMed Google Scholar
Thomas K. Doyle
View author publications
You can also search for this author in PubMed Google Scholar
Colin Fyfe
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Electrical and Electronic Engineering, University of Manchester, Sackville Street Building, M60 1QD, Manchester, UK
Hujun Yin
School of Computing Sciences, University of East Anglia, NR4 7TJ, Norwich, UK
Wenjia Wang
University of East Anglia, NR4 7TJ, Norwich, UK
Victor Rayward-Smith

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wu, Y., Doyle, T.K., Fyfe, C. (2011). Multi-layer Topology Preserving Mapping for K-Means Clustering. In: Yin, H., Wang, W., Rayward-Smith, V. (eds) Intelligent Data Engineering and Automated Learning - IDEAL 2011. IDEAL 2011. Lecture Notes in Computer Science, vol 6936. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23878-9_11

Download citation

DOI: https://doi.org/10.1007/978-3-642-23878-9_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23877-2
Online ISBN: 978-3-642-23878-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics