Abstract
In recent years, with the rapid growth of network scale, it becomes difficult to detect communities in large-scale networks for many existing algorithms. In this paper, a novel Parallel Multi-Label Propagation Algorithm (PMLPA) is proposed to detect the overlapping communities in networks. PMLPA employs a new label updating strategy using ankle-value in the label propagation procedure during each iteration. The new algorithm is implemented in the Spark framework for its power in distributed parallel computation. Experiments on artificial and real networks show that PMLPA is effective and efficient in community detection in large-scale networks.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Girvan, M., Newman, M.E.J.: Community structure in social and biological networks. Proc. Natl. Acad. Sci. 99(12), 7821–7826 (2002)
Tyler, J.R., Wilkinson, D.M., Huberman, B.A.: E-mail as spectroscopy: automated discovery of community structure within organizations. Inf. Soc. 21(2), 143–153 (2005)
Radicchi, F., Castellano, C., Cecconi, F., et al.: Defining and identifying communities in networks. Proc. Natl. Acad. Sci. U.S.A. 101(9), 2658–2663 (2004)
Newman, M.E.J., Girvan, M.: Finding and evaluating community structure in networks. Phys. Rev. E 69(2), 026113 (2004)
Newman, M.E.J.: Fast algorithm for detecting community structure in networks. Phys. Rev. E 69(6), 066133 (2004)
Blondel, V.D., Guillaume, J.L., Lambiotte, R., et al.: Fast unfolding of communities in large networks. J. Stat. Mech: Theory Exp. 2008(10), P10008 (2008)
Raghavan, U.N., Albert, R., Kumara, S.: Near linear time algorithm to detect community structures in large-scale networks. Phys. Rev. E 76(3), 036106 (2007)
Gregory, S.: Finding overlapping communities in networks by label propagation. New J. Phys. 12(10), 103018 (2010)
Wu, Z.H., Lin, Y.F., Gregory, S., et al.: Balanced multi-label propagation for overlapping community detection in social networks. J. Comput. Sci. Technol. 27(3), 468–479 (2012)
Fortunato, S., Barthélemy, M.: Resolution limit in community detection. Proc. Natl. Acad. Sci. 104(1), 36–41 (2007)
Good, B.H., de Montjoye, Y.A., Clauset, A.: Performance of modularity maximization in practical contexts. Phys. Rev. E 81(4), 046106 (2010)
Leung, I.X.Y., Hui, P., Lio, P., et al.: Towards real-time community detection in large networks. Phys. Rev. E 79(6), 066107 (2009)
Zhao, W., Martha, V., Xu, X.: PSCAN: a parallel structural clustering algorithm for big networks in MapReduce. In: 2013 IEEE 27th International Conference on Advanced Information Networking and Applications (AINA), pp. 862–869. IEEE (2013)
Apache Spark. http://spark.apache.org
Zaharia, M., Chowdhury, M., Das, T., et al.: Resilient distributed datasets: a fault-tolerant abstraction for in-memory cluster computing. In: Proceedings of the 9th USENIX Conference on Networked Systems Design and Implementation, p. 2. USENIX Association (2012)
Huang, J.B., Sun, H.L., Bortner, D., et al.: Mining hierarchical community structure within networks from density-connected traveling orders. J. Softw. (Chinese) 22(5), 951–961 (2011)
Lancichinetti, A., Fortunato, S.: Benchmarks for testing community detection algorithms on directed and weighted graphs with overlapping communities. Phys. Rev. E 80(1), 016118 (2009)
Leskovec, J., Krevl, A.: SNAP Datasets: Large Network Dataset Collection (2014)
Lancichinetti, A., Fortunato, S., Kertész, J.: Detecting the overlapping and hierarchical community structure in complex networks. New J. Phys. 11(3), 033015 (2009)
Newman, M.E.J., Girvan, M.: Finding and evaluating community structure in networks. Phys. Rev. E 69(2), 026113 (2004)
Nicosia, V., Mangioni, G., Carchiolo, V., et al.: Extending the definition of modularity to directed graphs with overlapping communities. J. Stat. Mech: Theory Exp. 2009(03), P03024 (2009)
Yang, J., Leskovec, J.: Defining and evaluating network communities based on ground-truth. Knowl. Inf. Syst. 42(1), 181–213 (2015)
Acknowledgments
This work is partly supported by the National Natural Science Foundation of China under Grants No. 61103175 and No. 61300104, the Key Project of Chinese Ministry of Education under Grant No. 212086, the Fujian Province High School Science Fund for Distinguished Young Scholars under Grand No. JA12016, the Program for New Century Excellent Talents in Fujian Province University under Grant No. JA13021, the Fujian Natural Science Funds for Distinguished Young Scholar under Grant No. 2014J06017, and the Natural Science Foundation of Fujian Province under Grant No. 2013J01230.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Li, R., Guo, W., Guo, K., Qiu, Q. (2015). Parallel Multi-label Propagation for Overlapping Community Detection in Large-Scale Networks. In: Bikakis, A., Zheng, X. (eds) Multi-disciplinary Trends in Artificial Intelligence. MIWAI 2015. Lecture Notes in Computer Science(), vol 9426. Springer, Cham. https://doi.org/10.1007/978-3-319-26181-2_33
Download citation
DOI: https://doi.org/10.1007/978-3-319-26181-2_33
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-26180-5
Online ISBN: 978-3-319-26181-2
eBook Packages: Computer ScienceComputer Science (R0)