Abstract
In this paper, we investigate the multi-layer topology preserving mapping for K-means. We present a Multi-layer Topology Preserving Mapping (MTPM) based on the idea of deep architectures. We demonstrate that the MTPM output can be used to discover the number of clusters for K-means and initialize the prototypes of K-means more reasonably. Also, K-means clusters the data based on the discovered underlying structure of the data by the MTPM. The standard wine data set is used to test our algorithm. We finally analyse a real biological data set with no prior clustering information available.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bengio, Y., Lamblin, P., Popovici, D., Larochelle, H.: Greedy layer-wise training of deep networks. In: Advances in Neural Information Processing Systems, vol. 19, pp. 153–160. MIT Press, Cambridge (2007)
Bengio, Y., LeCun, Y.: Large-Scale Kernel Machines. In: Scaling Learning Algorithms towards AI. MIT Press, Cambridge (2007)
Bishop, C.M., Svensen, M., Williams, C.K.I.: GTM: The generative topographic mapping. Neural Computation 10, 215–234 (1998)
Bottou, L., Bengio, Y.: Convergence properties of the k-means algorithms. In: Advances in Neural Information Processing Systems, vol. 7, pp. 585–592. MIT Press, Cambridge (1995)
de Boer, P.-T., Kroese, D.P., Mannor, S., Rubenstein, R.Y.: A tutorial on the cross-entropy method. Annals of Operations Research 134(1), 19–67 (2004)
Doyle, T.K., Houghton, J.D.R., O’Suilleabhain, P.F., Hobson, V.J., Marnell, F., Davenport, J., Hays, G.C.: Leatherback turtles satellite tagged in european waters. Endangered Species Research 4, 23–31 (2008)
Fedak, M., Lovell, P., McConnell, B., Hunter, C.: Overcoming the constraints of long range radio telemetry from animals: Getting more useful data from smaller packages. Integrative and Comparative Biology 42(1), 3–10 (2002)
Fyfe, C.: Two topographic maps for data visualization. Data Mining and Kownledge Discovery 14, 207–224 (2007)
Hays, G.C., Houghton, J.D.R., Isaacs, C., King, R.S., Lloyd, C., Lovell, P.: First records of oceanic dive profiles for leatherback turtles, dermochelys coriacea, indicate behavioural plasticity associated with long-distance migration. Animal Behaviour 67(4), 733–743 (2004)
Hinton, G.E.: Training products of experts by minimizing contrastive divergence. Technical Report 2000-004, Gatsby Computational Neuroscience Unit. University College, London (2000)
Hinton, G.E., Osindero, S., Teh, Y.: A fast learning algorithm for deep belief nets. Neural Computation 16, 1527–1554 (2006)
Hinton, G.E., Salakhutdinov, R.R.: Reducing the demensionality of data with neural networks. Science 313, 504–507 (2006)
Kohonen, T.: Self-organising maps. Springer, Heidelberg (1995)
MacQueen, J.B.: Some methods for classification and analysis of multivariate observations. In: Le Cam, L.M., Neyman, J. (eds.) Proc. of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, vol. 1, pp. 281–297. University of California Press, Berkeley (1967)
Rubinstein, R.Y.: Optimization of computer simulation models with rare events. European Journal of Operations Reasearch 99, 89–112 (1997)
Wu, Y., Fyfe, C.: The on-line cross entropy method for unsupervised data exploration. WSEAS Transactions on Mathematics 6(12), 865–877 (2007)
Wu, Y., Fyfe, C.: Topology preserving mappings using cross entropy adaptation. In: International Conference on Artificial Intelligence, Knowledge Engineering and Data Bases (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wu, Y., Doyle, T.K., Fyfe, C. (2011). Multi-layer Topology Preserving Mapping for K-Means Clustering. In: Yin, H., Wang, W., Rayward-Smith, V. (eds) Intelligent Data Engineering and Automated Learning - IDEAL 2011. IDEAL 2011. Lecture Notes in Computer Science, vol 6936. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23878-9_11
Download citation
DOI: https://doi.org/10.1007/978-3-642-23878-9_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23877-2
Online ISBN: 978-3-642-23878-9
eBook Packages: Computer ScienceComputer Science (R0)