Abstract
Link prediction refers to estimating the possibility of the existence of non-existent links between the nodes. The link prediction algorithms based on local information merely consider nodes’ attributes or a small amount of topology information about common neighbors. In this paper, we proposed a new measure motivated by the cohesion between common neighbors and the predicted nodes——LNL (Local Neighbors Link). Experiments show that, compared with four classical algorithms on seven real networks, LNL has the higher accuracy and robustness. Furthermore, we apply the link prediction algorithms into large-scale networks. We implement the LNL method in both MapReduce and Spark, the experiments show that the implementation by Spark has higher efficiency than using MapReduce.
Keywords
This work is supported in part by the National Key Basic Research and Department (973) Program of China (No. 2013CB329606), and the National Natural Science Foundation of China (No. 71231002, 61375058).
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Getoor, L., Diehl, C.P.: Link mining: a survey. ACM SIGKDD Explor. Newslett. 7(2), 3–12 (2005)
Li, J., Shang, X.Q., Guo, Y., Li, X.Y.: New approach of link prediction in PPI network. Appl. Res. Comput. 11, 016 (2012)
Wu, M.: Research on Relationship recommender systems based on link prediction, Beijing University of Posts and Telecommunications (2012)
Lü, L., Zhou, T.: Link prediction in complex networks: a survey. Physica A 390(6), 1150–1170 (2011)
Guns, R., Rousseau, R.: Recommending research collaborations using link prediction and random forest classifiers. Scientometrics 101(2), 1461–1473 (2014)
Zhang, Q., Shang, M., Lü, L.: Similarity-based classification in partially labeled networks. Int. J. Mod. Phys. C 21(06), 813–824 (2010)
Lü, L., Jin, C.H., Zhou, T.: Similarity index based on local paths for link prediction of complex networks. Phys. Rev. E 80(4), 046122 (2009)
Katz, L.: A new status index derived from sociometric analysis. Psychometrika 18(1), 39–43 (1953)
Liu, W., Lü, L.: Link prediction based on local random walk. EPL (Europhy. Lett.) 89(5), 58007 (2010)
Al Hasan, M., Chaoji, V., Salem, S., Zaki, M.: Link prediction using supervised learning. In: SDM 2006: Workshop on Link Analysis, Counter-terrorism and Security (2006)
Fire, M., Tenenboim, L., Lesser, O., Puzis, R., Rokach, L., Elovici, Y.: Link prediction in social networks using computationally efficient topological features. In: 2011 IEEE Third International Conference on Privacy, Security, Risk and Trust (PASSAT) and 2011 IEEE Third Inernational Conference on Social Computing (SocialCom), pp. 73–80. IEEE (2011)
Jaccard, P.: Etude comparative de la distribution florale dans une portion des Alpes et du Jura. Impr. Corbaz. (1901)
Adamic, L.A., Adar, E.: Friends and neighbors on the Web. Soc. Netw. 25(3), 211–230 (2003)
Barabási, A., Albert, R.: Emergence of scaling in random networks. Science 286(5439), 509–512 (1999)
Zhou, T., Lü, L., Zhang, Y.C.: Predicting missing links via local information. Eur. Phy. J. B. 71(4), 623–630 (2009)
Dong, Y.X., Ke, Q., Wu, B.: Link prediction based on node similarity. Comput. Sci. 38(7), 162 (2011)
Cannistraci, C.V., Alanis-Lobato, G., Ravasi, T.: From link-prediction in brain connectomes and protein interactomes to the local-community-paradigm in complex networks. Scientific reports, 3 (2013)
Dean, J., Ghemawat, S.: MapReduce: simplified data processing on large clusters. Commun. ACM. 51(1), 107–113 (2008)
Rao, J., Wu, B., Dong, Y.X.: Parallel link prediction in complex network using MapReduce. Ruanjian Xuebao/J. Softw. 23(12), 3175–3186 (2012)
Zaharia, M., Chowdhury, M., Franklin, M. J., Shenker, S., Stoica, I.: Spark: cluster computing with working sets. In: Proceedings of the 2nd USENIX Conference on Hot Topics in Cloud Computing, pp. 10–10 (2010)
Watts, D.J., Strogatz, S.H.: Nature 393, 440–442 (1998)
Batagelj, V., Mrvar, A.: Pajek datasets (2006). http://vlado.fmf.uni-lj.si/pub/networks/data/mix/USAir97.net
Adamic, L.A., Glance, N.: The political blogosphere and the 2004 US election: divided they blog. In: Proceedings of the 3rd International Workshop on Link Discovery, pp. 36–43. ACM (2005)
Vladimir Batagelj and Andrej Mrvar (2006): Pajek datasets. http://vlado.fmf.uni-lj.si/pub/networks/data/bio/Yeast/Yeast.htm
Facebook (NIPS) network dataset – {KONECT} (2015)
Hamsterster friendships network dataset – {KONECT} (2015)
Herlocker, J.L., Konstan, J.A., Terveen, L.G., Riedl, J.T.: Evaluating collaborative filtering recommender systems. ACM Trans. Inf. Syst. (TOIS) 22(1), 5–53 (2004)
Hanley, J.A., McNeil, B.J.: The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology 143(1), 29–36 (1982)
Leskovec J. Stanford large network dataset collection. http://snap.stanford.edu/data/
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Yang, J., Yang, L., Zhang, P. (2015). A New Link Prediction Algorithm Based on Local Links. In: Xiao, X., Zhang, Z. (eds) Web-Age Information Management. WAIM 2015. Lecture Notes in Computer Science(), vol 9391. Springer, Cham. https://doi.org/10.1007/978-3-319-23531-8_2
Download citation
DOI: https://doi.org/10.1007/978-3-319-23531-8_2
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-23530-1
Online ISBN: 978-3-319-23531-8
eBook Packages: Computer ScienceComputer Science (R0)