Fast Sequence-Based Embedding with Diffusion Graphs

  • Benedek Rozemberczki
  • Rik Sarkar
Conference paper
Part of the Springer Proceedings in Complexity book series (SPCOM)


A graph embedding is a representation of graph vertices in a low- dimensional space, which approximately preserves properties such as distances between nodes. Vertex sequence-based embedding procedures use features extracted from linear sequences of nodes to create embeddings using a neural network. In this paper, we propose diffusion graphs as a method to rapidly generate vertex sequences for network embedding. Its computational efficiency is superior to previous methods due to simpler sequence generation, and it produces more accurate results. In experiments, we found that the performance relative to other methods improves with increasing edge density in the graph. In a community detection task, clustering nodes in the embedding space produces better results compared to other sequence-based embedding methods.



Benedek Rozemberczki was supported by the Centre for Doctoral Training in Data Science, funded by EPSRC (grant EP/L016427/1).


  1. 1.
    Agarwal, N., Liu, H., Murthy, S., Sen, A., Wang, X.: A social identity approach to identify familiar strangers in a social network. In: ICWSM (2009)Google Scholar
  2. 2.
    Alon, N., Avin, C., Kouckỳ, M., Kozma, G., Lotker, Z., Tuttle, M.R.: Many random walks are faster than one. Comb. Probab. Comput. 20(4), 481–502 (2011)MathSciNetCrossRefMATHGoogle Scholar
  3. 3.
    Chatr-Aryamontri, A., Breitkreutz, B.J., Oughtred, R., Boucher, L., et al.: The biogrid interaction database: 2015 update. Nucleic Acids Res. 43(D1), D470–D478 (2014)CrossRefGoogle Scholar
  4. 4.
    Clauset, A., Newman, M.E.J., Moore, C.: Finding community structure in very large networks. Phys. Rev. E 70(6), 066111 (2004)ADSCrossRefGoogle Scholar
  5. 5.
    Fire, M., Tenenboim, L., Lesser, O., Puzis, R., Rokach, L., Elovici, Y.: Link prediction in social networks using computationally efficient topological features. In: IEEE Third Inernational Conference on Social Computing (SocialCom), pp. 73–80. IEEE (2011)Google Scholar
  6. 6.
    Goyal, P., Ferrara, E.: Graph embedding techniques, applications, and performance: A survey. arXiv:1705.02801 (2017)
  7. 7.
    Grover, A., Leskovec, J.: node2vec: scalable feature learning for networks. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 855–864. ACM (2016)Google Scholar
  8. 8.
    Herman, I., Melançon, G., Marshall, M.S.: Graph visualization and navigation in information visualization: a survey. IEEE Trans. Vis. Comput. Graph. 6(1), 24–43 (2000)CrossRefGoogle Scholar
  9. 9.
    Mahoney, M.: Large text compression benchmark (2011)Google Scholar
  10. 10.
    McAuley, J., Leskovec, J.: Image labeling on a network: using social-network metadata for image classification. In: Computer Vision-ECCV, pp. 828–841 (2012)Google Scholar
  11. 11.
    Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. arXiv:1301.3781 (2013)
  12. 12.
    Newman, M.E.J.: Modularity and community structure in networks. Proc. Natl. Acad. Sci. 103(23), 8577–8582 (2006)ADSCrossRefGoogle Scholar
  13. 13.
    Perozzi, B., Al-Rfou, R., Skiena, S.: Deepwalk: online learning of social representations. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 701–710. ACM (2014)Google Scholar
  14. 14.
    Pons, P., Latapy, M.: Computing communities in large networks using random walks. J. Graph Algorithms Appl. 10(2), 191–218 (2006)MathSciNetCrossRefMATHGoogle Scholar
  15. 15.
    R Sarkar, Yin, X., Gao, J., Luo, F., Gu, X.D.: Greedy routing with guaranteed delivery using ricci flows. In: International Conference on Information Processing in Sensor Networks (IPSN), pp. 121–132. ACM (2009)Google Scholar
  16. 16.
    Shang, Y., Ruml, W., Zhang, Y., Fromherz, M.P.J.: Localization from mere connectivity. In: Proceedings of the 4th ACM International Symposium on Mobile Ad Hoc Networking and Computing, pp. 201–212. ACM (2003)Google Scholar
  17. 17.
    West, D.B., et al.: Introduction to Graph Theory. Prentice hall, Upper Saddle River (2001)Google Scholar
  18. 18.
    White, S., Smyth, P.: A spectral clustering approach to finding communities in graphs. In: Proceedings of the 2005 SIAM International Conference on Data Mining, pp. 274–285. SIAM (2005)Google Scholar
  19. 19.
    Yang, J., Leskovec, J.: Defining and evaluating network communities based on ground-truth. Knowl. Inf. Syst. 42(1), 181–213 (2015)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing AG 2018

Authors and Affiliations

  1. 1.School of InformaticsUniversity of EdinburghEdinburghU.K.

Personalised recommendations