Skip to main content

Developing an Efficient Clique-Based Algorithm for Community Detection in Large Graphs

  • Conference paper
  • First Online:
Proceedings of the International Conference on Advanced Intelligent Systems and Informatics 2017 (AISI 2017)

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 639))

  • 2945 Accesses

Abstract

Many computer science problems are structured as a network. Mobile, e-mail, social networks (MySpace, Friendster, Facebook, etc.), collaboration networks, and Protein-Protein Interaction (PPI), Gene Regulatory Networks (GRN) and Metabolic Networks (MN) in bioinformatics, are among several applications. Discovering communities in Networks is a recent and critical task in order to understand and model network structures. Several methods exist for community detection, such as modularity, clique, and random walk methods. These methods are somewhat limited because of the time needed to detect communities and their modularity. In this work, a Clique-based Community Detection Algorithm (CCDA) is proposed to overcome time and modularity limitations. The clique method is suitable since it arises in many real-world problems, as in bioinformatics, computational chemistry, and social networks. In definition, clique is a group of individuals who interact with one another and share similar interests. Based on this definition, if one vertex of a clique is assigned to a specific community, all other vertices in this clique often belong to the same community. CCDA develops a simple and fast method to detect maximum clique for specific vertex. In addition, testing is done for the closest neighbor node instead of testing all nodes in the graph. Since neighbor nodes are also sorted in descending order, it contributes to save more execution time. Furthermore, each node will be visited exactly once. To test the performance of CCDA, it is compared with previously proposed community detection algorithms (Louvain, and MACH with DDA-M2), using various datasets: Amazon (262111 nodes/1234877 vertices), DBLP (317080 nodes/1049866 vertices), and LiveJournal (4847571 nodes, 68993773 vertices). Results have proven the efficiency of the proposed method in terms of time performance and modularity.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 259.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Broder, A., Kumar, R., Maghoul, F., Raghavan, P., Rajagopalan, S., Stata, R., Tomkins, A., Wiener, J.: Graphstructure in the web. Comput. Netw. 33, 309–320 (2000)

    Article  Google Scholar 

  2. Mahmoud, H., Masulli, F., Rovetta, S., Russo, G.: Community detection in protein-protein interaction networks using spectral and graph approaches. In: Computational Intelligence Methods for Bioinformatics and Biostatistics. Lecture Notes in Computer Science, vol. 8452. Springer, Cham (2014)

    Google Scholar 

  3. Rajaraman, A., Leskovec, J., Ullman, J.D.: Mining of Massive Datasets (2010)

    Google Scholar 

  4. Newman, M.E.J.: The structure of scientific collaboration networks. Proc. Natl. Acad. Sci. U.S.A. 98, 404–409 (2001)

    Article  MathSciNet  MATH  Google Scholar 

  5. Meyers, R.A. (ed.).: Encyclopedia of Complexity and Systems Science. Springer, Berlin (2009). 978-0-387-30440-3. Fortunato, S., Castellano, C.

    Google Scholar 

  6. Saha, B., Mandal, A., Tripathy, S.B., Mukherjee, D.: Complex networks, communities and clustering: a survey. CoRR, abs/1503.06277 (2015)

    Google Scholar 

  7. Newman, M.E.J., Girvan, M.: Finding and evaluating community structure in networks. Phys. Rev. EStatistical Nonlinear Soft Matter Phys. 69(2 Pt 2), 116 (2003)

    Google Scholar 

  8. Meyers, R.A. (ed.).: Encyclopedia of Complexity and Systems Science, vol. 1. Springer, Berlin, eprint arXiv:0712.2716 (2009). Fortunato, S., Castellano, C.

  9. Bron, C., Kerbosch, J.: Finding all cliques of an undirected graph. Commun. ACM 16, 575–577 (1973)

    Article  MATH  Google Scholar 

  10. Gao, W., Wong, K.F., Xia, Y., Xu, R.: Clique percolation method for finding naturally cohesive and overlapping document clusters. In: Matsumoto, Y., Sproat, R.W., Wong, K.F., Zhang, M. (eds.) Computer Processing of Oriental Languages. Beyond the Orient: The Research Challenges Ahead. ICCPOL 2006. Lecture Notes in Computer Science, vol. 4285. Springer, Berlin (2006)

    Google Scholar 

  11. Palsetia, D., Patwary, M.M.A., Hendrix, W., Agrawal, A., Choudhary, A.: Clique guided community detection. In: IEEE International Conference on Big Data (Big Data) (2014)

    Google Scholar 

  12. Blondel, V.D., Guillaume, J.-L.: Renaud Lambiotte and Etienne Lefebvre: Fast unfolding of communities in large networks. arXiv:0803.0476 (2008)

  13. He, K., Li, Y., Soundarajan, S., Hopcroft, J.E.: Hidden Community Detection in Social Networks. eprint arXiv:1702.07462 (2017)

  14. Zachary, W.W.: An information flow model for conflict and fission in small groups. J. Anthropol. Res. 33, 452–473 (1977)

    Article  Google Scholar 

  15. Leskovec, J., Adamic, L., Huberman, B.A.: The dynamics of viral marketing. ACM Trans. Web (ACM TWEB), 1(1) (2007)

    Google Scholar 

  16. Yang, J., Leskovec, J.: Defining and evaluating network communities based on ground-truth. In: ICDM (2012)

    Google Scholar 

  17. Backstrom, L., Huttenlocher, D., Kleinberg, J., Lan, X.: Group formation in large social networks: membership, growth, and evolution. In: KDD (2006)

    Google Scholar 

  18. Leskovec, J., Lang, K., Dasgupta, A., Mahoney, M.: Community structure in large networks: natural cluster sizes and the absence of large well-defined clusters. Int. Math. 6(1), 29–123 (2009)

    MathSciNet  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Taysir Hassan A. Soliman .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG

About this paper

Cite this paper

Saad, H., Soliman, T.H.A., Rady, S. (2018). Developing an Efficient Clique-Based Algorithm for Community Detection in Large Graphs. In: Hassanien, A., Shaalan, K., Gaber, T., Tolba, M. (eds) Proceedings of the International Conference on Advanced Intelligent Systems and Informatics 2017. AISI 2017. Advances in Intelligent Systems and Computing, vol 639. Springer, Cham. https://doi.org/10.1007/978-3-319-64861-3_18

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-64861-3_18

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-64860-6

  • Online ISBN: 978-3-319-64861-3

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics