Advertisement

Clustering Ensemble Method for Heterogeneous Partitions

  • Sandro Vega-Pons
  • José Ruiz-Shulcloper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5856)

Abstract

Cluster ensemble is a promising technique for improving the clustering results. An alternative to generate the cluster ensemble is to use different representations of the data and different similarity measures between objects. This way, it is produced a cluster ensemble conformed by heterogeneous partitions obtained with different point of views of the faced problem. This diversity enhances the cluster ensemble but, it restricts the combination process since it makes difficult the use of the original data. In this paper, in order to solve these limitations, we propose a unified representation of the objects taking into account the whole information in the cluster ensemble. This representation allows working with the original data of the problem regardless of the used generation mechanism. Also, this new representation is embedded in the WKF [1] algorithm making a more robust cluster ensemble method. Experimental results with numerical, categorical and mixed datasets show the accuracy of the proposed method.

Keywords

Cluster ensemble object representation similarity measure co-association matrix 

References

  1. 1.
    Vega-Pons, S., Correa-Morris, J., Ruiz-Shulcloper, J.: Weighted cluster ensemble using a kernel consensus function. In: Ruiz-Shulcloper, J., Kropatsch, W.G. (eds.) CIARP 2008. LNCS, vol. 5197, pp. 195–202. Springer, Heidelberg (2008)CrossRefGoogle Scholar
  2. 2.
    Fred, A.L.N., Jain, A.K.: Combining multiple clustering using evidence accumulation. IEEE Trans. on Pat. Analysis and Machine Intelligence 27, 835–850 (2005)CrossRefGoogle Scholar
  3. 3.
    Strehl, A., Ghosh, J.: Cluster ensembles: a knowledge reuse framework for combining multiple partitions. J. Mach. Learn. Res. 3, 583–617 (2002)CrossRefMathSciNetGoogle Scholar
  4. 4.
    Pekalska, E., Duin, R.P.W.: The Dissimilarity Representation for Pattern Recognition: Foundations And Applications. In: Machine Perception and Artificial Intelligence. World Scientific Publishing Co., Singapore (2005)Google Scholar
  5. 5.
    Kuncheva, L., Hadjitodorov, S., Todorova, L.: Experimental comparison of cluster ensemble methods. In: Int. Conference on Information Fusion, pp. 1–7 (2006)Google Scholar
  6. 6.
    Handl, J., Knowles, J., Kell, D.: Computational cluster validation in post- genomic data analysis. Bioinformatics 21, 3201–3212 (2005)CrossRefGoogle Scholar
  7. 7.
    Al-Razgan, M., Domeniconi, C.: Random subspace ensembles for clustering categorical data. Studies in Computational Intelligence (SCI) 126, 31–48 (2008)CrossRefGoogle Scholar
  8. 8.
    UCI machine learning repository, http://archive.ics.uci.edu/ml/datasets.html

Copyright information

© Springer-Verlag Berlin Heidelberg 2009

Authors and Affiliations

  • Sandro Vega-Pons
    • 1
  • José Ruiz-Shulcloper
    • 1
  1. 1.Advanced Technologies Application Center (CENATAV)HavanaCuba

Personalised recommendations