Synonyms
Definition
Many datasets can naturally be represented as graph where nodes represent instances and links represent relationships between those instances. A fundamental problem with these types of data is that the link information in the graph may be of dubious quality; links may incorrectly exist between unrelated nodes and links may be missing between two related nodes. The goal of link prediction is to predict the existence of incorrect or missing links between the nodes of the graph.
Theory/Solution
Inferring the existences of edges between nodes in a graph has traditionally been referred to as link prediction (Liben-Nowell and Kleinberg 2003a; Taskar et al. 2003). Link prediction is a challenging problem that has been studied in various guises in different domains. For example, in social network analysis, there is work on predicting friendship links (Zheleva et al. 2008), event participation links (i.e., coauthorship O’Madadhain...
References
Albert R, DasGupta B, Dondi R, Kachalo S, Sontag E, Zelikovsky A et al (2007) A novel method for signal transduction network inference from indirect experimental evidence. J Comput Biol 14:407–419
Balasubramanyan R, Carvalho VR, Cohen W (2009) Cutonce recipient recommendation and leak detection in action. In: Workshop on enhanced messaging, Chicago
Carvalho VR, Cohen WW (2007) Preventing information leaks in email. In: SIAM conference on data mining, Minneapolis
Chaiwanarom P, Lursinsap C (2008) Link completion using prediction by partial matching. In: International symposium on communications and information technologies, Vientiane
Clauset A, Moore C, Newman MEJ (2008) Hierarchical structure and the prediction of missing links in networks. Nature 453:98
Deng M, Mehta S, Sun F, Chen T (2002) Inferring domain-domain interactions from protein-protein interactions. Genome Res 12(10):1540–1548
Diehl C, Namata GM, Getoor L (2007) Relationship identification for social network discovery. In: Proceedings of the 22nd national conference on artificial intelligence, Vancouver
Farrell S, Campbell C, Myagmar S (2005) Relescope: an experiment in accelerating relationships. In: Extended abstracts on human factors in computing systems, Portland
Getoor L, Friedman N, Koller D, Taskar B (2003) Learning probabilistic models of link structure. Mach Learn 3:679–707
Goldenberg A, Kubica J, Komarek P, Moore A, Schneider J (2003) A comparison of statistical and machine learning algorithms on the task of link completion. In: Conference on knowledge discovery and data mining, workshop on link analysis for detecting complex behavior, Washington, DC
Huang Z, Lin DKJ (2008) The time-series link prediction problem with applications in communication surveillance. Inf J Comput 21:286–303
Huang Z, Zeng DD (2006) A link prediction approach to anomalous email detection. In: IEEE international conference on systems, man, and cybernetics, Taipei
Liben-Nowell D, Kleinberg J (2003a) The link prediction problem for social networks. In: International conference on information and knowledge management, New Orleans
Liben-Nowell and Kleinberg(2003b)
Milne D, Witten IH (2008) Learning to link with wikipedia. In: Proceedings of the 17th ACM conference on information and knowledge management, Napa Valley
O’Madadhain J, Hutchins J, Smyth P (2005) Prediction and ranking algorithms for event-based network data. SIGKDD Explor Newsl 7(2):23–30
Popescul A, Ungar LH (2003) Statistical relational learning for link prediction. In: International joint conferences on artificial intelligence workshop on learning statistical models from relational data
Rattigan MJ, Jensen D (2005a) The case for anomalous link discovery. SIGKDD Explor Newsl 7:41–47
Rattigan and Jensen(2005b)
Richardson M, Domingos P (2006) Markov logic networks. Mach Learn 62:107–136
Spring N, Wetherall D, Anderson T (2004) Reverse engineering the internet. SIGCOMM Comput Commun Rev 34(1):3–8
Sprinzak E, Altuvia Y, Margalit H (2006) Characterization and prediction of protein-protein interactions within and between complexes. Proc Natl Acad Sci 103(40):14718–14723
Szilagyi A, Grimm V, Arakaki AK, Skolnick J (2005a) Prediction of physical protein-protein interactions. Phys Biol 2(2):S1–S16
Szilagyi et al.(2005b)
Taskar B, Wong M-F, Abbeel P, Koller D (2003) Link prediction in relational data. In: Advances in neural information processing systems, Vancouver
Yu H, Paccanaro A, Trifonov V, Gerstein M (2006) Predicting interactions in protein networks by completing defective cliques. Bioinformatics 22(7):823–829
Zheleva E, Getoor L, Golbeck J, Kuter U (2008) Using friendship ties and family circles for link prediction. In: 2nd ACM SIGKDD workshop on social network mining and analysis, Las Vegas
Zhu J (2003) Mining web site link structure for adaptive web site navigation and search. Ph.D. thesis, University of Ulster at Jordanstown
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer Science+Business Media New York
About this entry
Cite this entry
Namata, G., Getoo, L. (2014). Link Prediction. In: Sammut, C., Webb, G. (eds) Encyclopedia of Machine Learning and Data Mining. Springer, Boston, MA. https://doi.org/10.1007/978-1-4899-7502-7_486-1
Download citation
DOI: https://doi.org/10.1007/978-1-4899-7502-7_486-1
Received:
Accepted:
Published:
Publisher Name: Springer, Boston, MA
Online ISBN: 978-1-4899-7502-7
eBook Packages: Springer Reference Computer SciencesReference Module Computer Science and Engineering