Skip to main content
Book cover

CCF Conference on Big Data

Big Data 2018: Big Data pp 160–177Cite as

Multiple Meta Paths Combined for Vertex Embedding in Heterogeneous Networks

  • Conference paper
  • First Online:
  • 1926 Accesses

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 945))

Abstract

In the real-world many complex systems exist in the form of heterogeneous networks. As we all know, heterogeneous networks consist of various types of vertices and relations, so it is difficult to deal directly with data mining. At present, although many state-of-the-art methods of network representation learning have been developed, these methods can only deal with homogeneous networks or lose information when handling heterogeneous networks. In order to compensate for the weakness of the previous methods, we propose a multiple meta paths combined embedding (MMPCE) model to represent the heterogeneous networks. This method can automatically obtain the low-dimensional vector representation of vertices and preserve the rich semantic and structural information in the network. We conduct experiments on two real world datasets. The experimental results demonstrate the efficacy and efficiency of the proposed method in heterogeneous network mining tasks. Compare to the previous method, our model can cover a wider range of semantic information and be more flexible and scalable.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Tang, L., Liu, H.: Relational learning via latent social dimensions. In: Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 817–826. ACM (2009)

    Google Scholar 

  2. Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)

    Google Scholar 

  3. Ng, A.Y., Jordan, M.I., Weiss, Y.: On spectral clustering: analysis and an algorithm. In: Advances in Neural Information Processing Systems, pp. 849–856 (2002)

    Google Scholar 

  4. Ou, M., Cui, P., Pei, J., et al.: Asymmetric transitivity preserving graph embedding. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1105–1114. ACM (2016)

    Google Scholar 

  5. Weiss, Y., Torralba, A., Fergus, R.: Spectral hashing. In: Advances in Neural Information Processing Systems, pp. 1753–1760 (2009)

    Google Scholar 

  6. Roweis, S.T., Saul, L.K.: Nonlinear dimensionality reduction by locally linear embedding. Science 290(5500), 2323–2326 (2000)

    Article  Google Scholar 

  7. Tenenbaum, J.B., De, S.V., Langford, J.C.: A global geometric framework for nonlinear dimensionality reduction. Science 290(5500), 2319–2323 (2000)

    Article  Google Scholar 

  8. Yan, S., Xu, D., Zhang, B., et al.: Graph embedding and extensions: a general framework for dimensionality reduction. IEEE Trans. Pattern Anal. Mach. Intell. 29(1), 40–51 (2007)

    Article  Google Scholar 

  9. Mikolov, T., Sutskever, I., Chen, K., et al.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems, pp. 3111–3119 (2013)

    Google Scholar 

  10. Mikolov, T., Chen, K., Corrado, G., et al.: Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013)

  11. Perozzi, B., Al-Rfou, R., Skiena, S.: DeepWalk: online learning of social representations. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 701–710. ACM (2014)

    Google Scholar 

  12. Grover, A., Leskovec, J.: node2vec: scalable feature learning for networks. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 855–864. ACM (2016)

    Google Scholar 

  13. Tang, J., Qu, M., Wang, M., et al.: LINE: large-scale information network embedding. In: Proceedings of the 24th International Conference on World Wide Web. International World Wide Web Conferences Steering Committee, pp. 1067–1077 (2015)

    Google Scholar 

  14. Tang, J., Qu, M., Mei, Q.: PTE: predictive text embedding through large-scale heterogeneous text networks. In: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1165–1174. ACM (2015)

    Google Scholar 

  15. Wang, C.J., Wang, T.H., Yang, H.W., et al.: ICE: item concept embedding via textual information. In: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 85–94. ACM (2017)

    Google Scholar 

  16. Sun, Y., Han, J.: Mining Heterogeneous Information Networks: Principles and Methodologies, pp. 20–28. Morgan & Claypool Publishers, San Rafael (2012)

    Google Scholar 

  17. Sun, Y., Han, J., Yan, X., et al.: PathSim: meta path-based top-K similarity search in heterogeneous information networks. Proc. VLDB Endow. 4(11), 992–1003 (2011)

    Google Scholar 

  18. Li, A.Q., Ahmed, A., Ravi, S., et al.: Reducing the sampling complexity of topic models. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 891–900. ACM (2014)

    Google Scholar 

  19. Tang, J., Zhang, J., Yao, L., et al.: ArnetMiner: extraction and mining of academic social networks. In: Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 990–998. ACM (2008)

    Google Scholar 

  20. Harper, F.M., Konstan, J.A.: The movielens datasets: history and context. ACM Trans. Interact. Intell. Syst. (TiiS) 5(4), 19 (2016)

    Google Scholar 

Download references

Acknowledgment

The authors acknowledge the financial support from the following foundations: National Key R&D Program of China (No. 2017YFC0803700), National Natural Science Foundation of China (No. 61532021 and 61472141) and Shanghai Knowledge Service Platform Project (No. ZF1213).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tong Wu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Wu, T., Sha, C., Wang, X. (2018). Multiple Meta Paths Combined for Vertex Embedding in Heterogeneous Networks. In: Xu, Z., Gao, X., Miao, Q., Zhang, Y., Bu, J. (eds) Big Data. Big Data 2018. Communications in Computer and Information Science, vol 945. Springer, Singapore. https://doi.org/10.1007/978-981-13-2922-7_11

Download citation

  • DOI: https://doi.org/10.1007/978-981-13-2922-7_11

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-13-2921-0

  • Online ISBN: 978-981-13-2922-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics