Abstract
Complex and heterogeneous data sets can often be interpreted as having multiple clustering, each of which are valid but distinct from the others. Several algorithms involving multiple objective functions have been reported for such alternative clusterings. We propose a genetic algorithm based approach for obtaining valid but diverse clustering. A variable length genetic algorithm approach is used to enable varying number of clusters in each interpretation. A suitable method for population initialization and appropriate crossover and mutation operators are also used. Experimental results on benchmark data sets show that the method is comparable with related alternative clustering techniques.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Vinh, N.X., Epps, J.: minCEntropy: A Novel information theoretic Approach for the generation of alternative clusterings. In: Proceeding of 10th IEEE Int. Conf. on Data Mining (ICDM 2010), pp. 521–530. IEEE Computer Society (2010)
Dang, X.H., Bailey, J.: Hierarchical Information Theoretic Technique for the Discovery of Non Linear Alternative Clusterings. In: Proceeding of 16th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, pp. 573–582. ACM (2010)
Bae, E., Bailey, J.: COALA: A novel approach for the extraction of an alternate clustering of high quality and high dissimilarity. In: Proceeding of 6th International Conference on Data Mining (ICDM 2006), pp. 53–62. IEEE Computer Society (2006)
Cui, Y., Fern, X.Z., Dy, J.G.: Non-redundant Multi-view Clustering via Orthogonalization. In: Proceeding of 7th Int. Conf. on Data Mining (ICDM 2007). pp. 133–142. IEEE Computer Society (2007)
Davidson, I., Qi, Z.: Finding Alternative Clusterings using Constraints. In: Proceeding of 8th Int. Conf. on Data Mining (ICDM 2008). pp. 773–778. IEEE Computer Society (2008)
Maulik, U., Banbyopadhyay, S.: Genetic algorithm-based clustering technique. Pattern Recognition 33(9), 1455–1465 (2000)
Bandyopadhyay, S., Maulik, U.: Nonparametric genetic clustering: comparison of validity indices. IEEE Trans. on Systems, Man, and Cybernetics, Part C: Applications and Reviews. 31(1), 120–125 (2001)
Bandyopadhyay, S., Murthy, C.A., Pal, S.K.: VGA-Classifier: design and applications. IEEE Trans. on Systems, Man, and Cybernetics, Part B: Cybernetics 30(6), 890–895 (2000)
Truong, D.T., Battiti, R.: Cluster-Oriented Genetic Algorithm for Alternative Clustering. In: Proceeding of 12th IEEE Int. Conf. on Data Mining (ICDM 2012), pp. 1122–1127. IEEE Computer Society (2012)
UCI Machine Learning Repository, http://archive.ics.uci.edu/ml/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Saha, M., Mitra, P. (2014). VLGAAC: Variable Length Genetic Algorithm Based Alternative Clustering. In: Loo, C.K., Yap, K.S., Wong, K.W., Teoh, A., Huang, K. (eds) Neural Information Processing. ICONIP 2014. Lecture Notes in Computer Science, vol 8835. Springer, Cham. https://doi.org/10.1007/978-3-319-12640-1_24
Download citation
DOI: https://doi.org/10.1007/978-3-319-12640-1_24
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-12639-5
Online ISBN: 978-3-319-12640-1
eBook Packages: Computer ScienceComputer Science (R0)