Abstract
Clustering is used as an important technique to extract patterns from big data in various fields. Graph clustering as a subset of clustering has a lot of practical applications. Due to the NP-hardness of the graph clustering problem, many evolutionary algorithms, particularly the genetic algorithm have been presented. One of the most effective operators on the performance of the genetic algorithm is how to represent the solutions of a problem (i.e. encoding). The number of possible partitions of a graph is equal to Bell Number. In the literature, three encoding methods have been presented for graph clustering problem. The number of partitions that these encodings can generate is more than the Bell Number; which indicates that these methods generate a large number of same and iterative solutions which makes the speed of obtaining the solution unacceptable and leads to this fact that the good space search encounters a problem. To overcome this drawback, in this paper we present a new encoding method for graph clustering problem where the number of the generated solutions by this encoding is exactly equal to the Bell numbers. The initial results of our experiments represent that the quality of the obtained solutions by the new encoding is promising.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Mahdavi, M., Chehreghani, M.H., Abolhassani, H., Forsati, R.: Novel meta-heuristic algorithms for clustering web documents. Appl. Math. Comput. 201(1–2), 441–451 (2008)
Pizzuti, C., Rombo, S.E.: Algorithms and tools for protein-protein interaction networks clustering, with a special focus on population-based stochastic methods. Bioinformatics 30(10), 1343–1352 (2014)
Said, A., Abbasi, R.A., Maqbool, O., Daud, A., Aljohani, N.R.: CC-GA: a clustering coefficient based genetic algorithm for detecting communities in social networks. Appl. Soft Comput. 63, 59–70 (2018)
Krapivin, M., Marchese, M., Casati, F.: Exploring and understanding scientific metrics in citation networks. In: Zhou, J. (ed.) Complex 2009. LNICST, vol. 5, pp. 1550–1563. Springer, Heidelberg (2009). https://doi.org/10.1007/978-3-642-02469-6_35
Tumminello, M., Lillo, F., Piilo, J., Mantegna, R.N.: Identification of clusters of investors from their real trading activity in a financial market. New J. Phys. 14(1), 013041 (2012)
Beck, F., Diehl, S.: On the impact of software evolution on software clustering. Empir. Softw. Eng. 18(5), 970–1004 (2013)
Duncan, B., Peele, R.: Bell and Stirling numbers for graphs. J. Integer Seq. 12(09.7), 1 (2009)
Schulz, C.: Graph partitioning and graph clustering in theory and practice. Institute for Theoretical Informatics Karlsruhe Institute of Technology (KIT), pp. 24–187, 20 May 2016
Palmer, C.C., Kershenbaum, A.: Representing trees in genetic algorithms. IBM Thomas J. Watson Research Division (1994)
Mitchell, B.S., Mancoridis, S.: On the automatic modularization of software systems using the bunch tool. IEEE Trans. Softw. Eng. 32(3), 193–208 (2006)
Praditwong, K., Harman, M., Yao, X.: Software module clustering as a multi-objective search problem. IEEE Trans. Softw. Eng. 37(2), 264–282 (2011)
Parsa, S., Bushehrian, O.: A new encoding scheme and a framework to investigate genetic clustering algorithms. J. Res. Pract. Inf. Technol. 37(1), 127 (2005)
Isazadeh, A., Izadkhah, H., Elgedawy, I.: Source Code Modularization: Theory and Techniques. Springer, Heidelberg (2017). https://doi.org/10.1007/978-3-319-63346-6
Tajgardan, M., Izadkhah, H., Lotfi, S.: Software systems clustering using estimation of distribution approach. J. Appl. Comput. Sci. Methods 8(2), 99–113 (2016)
Izadkhah, H., Elgedawy, I., Isazadeh, A.: E-CDGM: an evolutionary call-dependency graph modularization approach for software systems. Cybern. Inf. Technol. 16(3), 70–90 (2016)
Mitchell, B.S.: A heuristic search approach to solving the software clustering problem. Ph.D. theses. Drexel University (2002)
Mahdavi, K.: A clustering genetic algorithm for software modularisation with a multiple hill climbing approach. Diss, Brunel University (2005)
Huang, J., Liu, J.: A similarity-based modularization quality measure for software module clustering problems. Inf. Sci. 342, 96–110 (2016)
Rajapati, A., Chhabra, J.K.: A particle swarm optimization-based heuristic for software module clustering problem. Arab. J. Sci. Eng. 43(12), 7083–7094 (2018)
Chhabra, J.K.: Harmony search based remodularization for object-oriented software systems. Comput. Lang. Syst. Struct. 47, 153–169 (2017)
Chhabra, J.K.: Many-objective artificial bee colony algorithm for large-scale software module clustering problem. Soft Comput. 22(19), 6341–6361 (2018)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Tabrizi, A.H.F., Izadkhah, H. (2019). A New Encoding Method for Graph Clustering Problem. In: Grandinetti, L., Mirtaheri, S., Shahbazian, R. (eds) High-Performance Computing and Big Data Analysis. TopHPC 2019. Communications in Computer and Information Science, vol 891. Springer, Cham. https://doi.org/10.1007/978-3-030-33495-6_26
Download citation
DOI: https://doi.org/10.1007/978-3-030-33495-6_26
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-33494-9
Online ISBN: 978-3-030-33495-6
eBook Packages: Computer ScienceComputer Science (R0)