Skip to main content

S-Rank: A Supervised Ranking Framework for Relationship Prediction in Heterogeneous Information Networks

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9799))

Abstract

The most crucial part for relationship prediction in heterogeneous information networks (HIN) is how to effectively represent and utilize the information hidden in the creation of relationships. There exist three kinds of information that need to be considered, namely local structure information (Local-info), global structure information (Global-info) and attribute information (Attr-info). They influence relationship creation in a different but complementary way: Local-info is limited to the topologies around certain nodes thus it ignores the global position of node; methods using Global-info are biased to highly visible objects; and Attr-info can capture features related to objects and relations in networks. Therefore, it is essential to combine all the three kinds of information together. However, existing approaches utilize them separately or in a partially combined way since effectively encoding all the information together is not an easy task. In this paper, a novel three-phase Supervised Ranking framework (S-Rank) is proposed to tackle this issue. To the best of our knowledge, our work is the first to completely combine Global-info, Local-info and Attr-info together. Firstly, a Supervised PageRank strategy (SPR) is proposed to capture Global-info and Attr-info. Secondly, we propose a Meta Path-based Ranking method (MPR) to obtain Local-info in HIN. Finally, they are integrated into the final ranking result. Experiments on DBLP data demonstrate that the proposed S-Rank framework can effectively take advantage of all the three kinds of information for predicting citation relation and outperforms other well-known baseline approaches.

This work was partially supported by National Science Foundation of China (No. 61272374, No. 61300190 and No. 61572096) and 863 Project (No. 2015AA015463).

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    http://www.informatik.uni-trier.de/~ley/db/.

  2. 2.

    Remind that SRW here is distinct from SRW method mentioned in Sect. 1.2.

  3. 3.

    Available at http://aminer.org/billboard/DBLP_Citation.

References

  1. Backstrom, L., Leskovec, J.: Supervised random walks: predicting and recommending links in social networks. In: The Fourth ACM International Conference on Web Search and Data Mining, pp. 635–644. ACM (2011)

    Google Scholar 

  2. Cao, B., Kong, X., Yu, P.S.: Collective prediction of multiple types of links in heterogeneous information networks. In: ICDM 2014, pp. 50–59 (2014)

    Google Scholar 

  3. Deng, Z.H., Lai, B.Y., Wang, Z.H., Fang, G.D.: PAV: a novel model for ranking heterogeneous objects in bibliographic information networks. Expert Syst. Appl. 39(10), 9788–9796 (2012)

    Article  Google Scholar 

  4. Fan, R.E., Chang, K.W., Hsieh, C.J., Wang, X.R., Lin, C.J.: LIBLINEAR: a library for large linear classification. j. mach. learn. res. 9, 1871–1874 (2008)

    MATH  Google Scholar 

  5. Gao, B., Liu, T., Wei, W., Wang, T., Li, H.: Semi-supervised ranking on very large graphs with rich metadata. In: SIGKDD 2011, pp. 96–104 (2011)

    Google Scholar 

  6. Han, J.: Mining heterogeneous information networks: the next frontier. In: SIGKDD, pp. 2–3. ACM (2012)

    Google Scholar 

  7. He, J., Bailey, J., Zhang, R.: Exploiting transitive similarity and temporal dynamics for similarity search in heterogeneous information networks. In: Bhowmick, S.S., Dyreson, C.E., Jensen, C.S., Lee, M.L., Muliantara, A., Thalheim, B. (eds.) DASFAA 2014, Part II. LNCS, vol. 8422, pp. 141–155. Springer, Heidelberg (2014)

    Chapter  Google Scholar 

  8. Huang, J., Ling, C.X.: Using AUC and accuracy in evaluating learning algorithms. IEEE Trans. Knowl. Data Eng. 17(3), 299–310 (2005)

    Article  Google Scholar 

  9. Lee, J.B., Adorna, H.: Link prediction in a modified heterogeneous bibliographic network. In: ASONAM, pp. 442–449. IEEE (2012)

    Google Scholar 

  10. Liben-Nowell, D., Kleinberg, J.: The link-prediction problem for social networks. J. Am. Soc. Inf. Sci. Technol. 58(7), 1019–1031 (2007)

    Article  Google Scholar 

  11. Menon, A.K., Elkan, C.: Link prediction via matrix factorization. In: Gunopulos, D., Hofmann, T., Malerba, D., Vazirgiannis, M. (eds.) ECML PKDD 2011, Part II. LNCS, vol. 6912, pp. 437–452. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  12. Page, L., Brin, S., Motwani, R., Winograd, T.: The pagerank citation ranking: bringing order to the web (1999)

    Google Scholar 

  13. Rajkumar, A., Agarwal, S.: A statistical convergence perspective of algorithms for rank aggregation from pairwise data. In: ICML 2014, pp. 118–126 (2014)

    Google Scholar 

  14. Shen, W., Han, J., Wang, J.: A probabilistic model for linking named entities in web text with heterogeneous information networks. In: SIGMOD 2014, pp. 1199–1210 (2014)

    Google Scholar 

  15. Sun, Y., Barber, R., Gupta, M., Aggarwal, C.C., Han, J.: Co-author relationship prediction in heterogeneous bibliographic networks. In: ASONAM, pp. 121–128. IEEE (2011)

    Google Scholar 

  16. Sun, Y., Han, J., Aggarwal, C.C., Chawla, N.V.: When will it happen?: relationship prediction in heterogeneous information networks. In: WSDM, pp. 663–672 (2012)

    Google Scholar 

  17. Sun, Y., Han, J., Yan, X., Yu, P.S., Wu, T.: Pathsim: meta path-based top-k similarity search in heterogeneous information networks. PVLDB 4(11), 992–1003 (2011)

    Google Scholar 

  18. Tang, J., Zhang, J., Yao, L., Li, J., Zhang, L., Su, Z.: Arnetminer: extraction and mining of academic social networks. In: KDD, pp. 990–998 (2008)

    Google Scholar 

  19. Yin, Z., Gupta, M., Weninger, T., Han, J.: A unified framework for link recommendation using random walks. In: 2010 International Conference on Advances in Social Networks Analysis and Mining (ASONAM), pp. 152–159. IEEE (2010)

    Google Scholar 

  20. Yu, X., Gu, Q., Zhou, M., Han, J.: Citation prediction in heterogeneous bibliographic networks. In: SDM, pp. 1119–1130. SIAM (2012)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Wenxin Liang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Liang, W., He, X., Tang, D., Zhang, X. (2016). S-Rank: A Supervised Ranking Framework for Relationship Prediction in Heterogeneous Information Networks. In: Fujita, H., Ali, M., Selamat, A., Sasaki, J., Kurematsu, M. (eds) Trends in Applied Knowledge-Based Systems and Data Science. IEA/AIE 2016. Lecture Notes in Computer Science(), vol 9799. Springer, Cham. https://doi.org/10.1007/978-3-319-42007-3_26

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-42007-3_26

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-42006-6

  • Online ISBN: 978-3-319-42007-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics