Skip to main content

Social Identity Link Across Incomplete Social Information Sources Using Anchor Link Expansion

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9651))

Abstract

Social link identification SIL, that is to identify accounts across different online social networks that belong to the same user, is an important task in social network applications. Most existing methods to solve this problem directly applied machine-learning classifiers on features extracted from user’s rich information. In practice, however, only some limited user information can be obtained because of privacy concerns. In addition, we observe the existing methods cannot handle huge amount of potential account pairs from different OSNs. In this paper, we propose an effective SIL method to address the above two challenges by expanding known anchor links (seed account pairs belonging to the same person). In particular, we leverage potentially useful information possessed by the existing anchor link, and then develop a local expansion model to identify new social links, which are taken as a generated anchor link to be used for iteratively identifying additional new social link. We evaluate our method on two most popular Chinese social networks. Experimental results show our proposed method achieves much better performance in terms of both the number of correct account pairs and efficiency.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Li, X.-L., Foo, C.S., Tew, K.L., Ng, S.-K.: Searching for rising stars in bibliography networks. In: Zhou, X., Yokota, H., Deng, K., Liu, Q. (eds.) DASFAA 2009. LNCS, vol. 5463, pp. 288–292. Springer, Heidelberg (2009)

    Chapter  Google Scholar 

  2. Li, X.-L., Tan, A., Yu, P.S., Ng, S.-K.: ECODE: event-based community detection from social networks. In: Yu, J.X., Kim, M.H., Unland, R. (eds.) DASFAA 2011, Part I. LNCS, vol. 6587, pp. 22–37. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  3. Carmagnola, F., Cena, F.: User identification for cross-system personalization. Inf. Sci. 179(1–2), 16–32 (2009)

    Article  Google Scholar 

  4. Vosecky, J., Hong, D., Shen, V.Y.: User identification across multiple social networks. In: Proceedings of NDT 2009 (2009)

    Google Scholar 

  5. Zafarani, R., Liu, H.: Connecting corresponding identities across communities. In: Proceedings of ICWSM 2009 (2009)

    Google Scholar 

  6. Narayanan, A., Shmatikov, V.: De-anonymizing social networks. In: Proceedings of S&P (2009)

    Google Scholar 

  7. Liu, S., Wang, S., Zhu, F., Zhang, J., Krishnan, R.: HYDRA: large-scale social identity linkage via heterogeneous behavior modeling. In: Proceedings of SIGMOD 2014 (2014)

    Google Scholar 

  8. Zafarani, R., Liu, H.: Connecting users across social media sites: a behavioral-modeling approach. In: Proceedings of KDD 2013 (2013)

    Google Scholar 

  9. Jain, P., Kumaraguru, P.: Finding nemo: Searching and Resolving Identities of Users across Online Social Networks (2012). arXiv preprint arXiv:1212.6147

  10. Jain, P., Kumaraguru, P., Joshi, A.: @i seek ‘fb.me’: identifying users across multiple online social networks. In: Proceedings of WWW 2013 (2013)

    Google Scholar 

  11. Malhotra, A., Totti, L., Meira, W., Kumaraguru, P., Almeida, V.: Studying user footprints in different online social networks. In: Proceedings of Advances in Social Networks Analysis and Mining, 2012 (2012)

    Google Scholar 

  12. Nunes, A., Calado, P., Martins, B.: Resolving user identities over social networks through supervised learning and rich similarity features. In: Proceedings of SAC 2012 (2012)

    Google Scholar 

  13. Zhang, H., Kan, M.-Y., Liu, Y., Ma, S.: Online social network profile linkage. In: Jaafar, A., Mohamad Ali, N., Mohd Noah, S.A., Smeaton, A.F., Bruza, P., Bakar, Z.A., Jamil, N., Sembok, T.M.T. (eds.) AIRS 2014. LNCS, vol. 8870, pp. 197–208. Springer, Heidelberg (2014)

    Google Scholar 

  14. Liu, J., Zhang, F., Song, X., Song, Y.I., Lin, C.Y., Hon, H.W.: What’s in a Name?: an unsupervised approach to link users across communities. In: Proceedings of WSDM 2013 (2013)

    Google Scholar 

  15. Goga, O.: Matching User Accounts across Online Social Networks: Methods and Applications. Ph.D. thesis, University Pierre and Marie CURIE (2014)

    Google Scholar 

  16. Iofciu, T., Fankhauser, P., Abel, F., Bischoff, K.: Identifying users across social tagging systems. In: Proceedings of ICWSM 2011 (2011)

    Google Scholar 

  17. Kong, X., Zhang, J., Yu, P.S.: Inferring anchor links across multiple heterogeneous social networks. In: Proceedings of CIKM 2013 (2013)

    Google Scholar 

  18. Anwar, T., Abulaish, M.: An MCL-based text mining approach for namesake disambiguation on the web. In: Proceedings of ICWI 2012 (2012)

    Google Scholar 

  19. Carmagnola, F., Osborne, F., Torre, I.: User data discovery and aggregation: the CS-UDD algorithm. Inf. Sci. 270(20), 41–72 (2014)

    Article  Google Scholar 

  20. Goga, O., Lei, H., Krishnan, S., Friedland, G., Sommer, R., Teixeira, R.: Exploiting innocuous activity for correlating users across sites. In: Proceedings of WWW 2013 (2013)

    Google Scholar 

  21. Zhang, J.W., Yu, P.S., Zhou, Z.H.: Meta-path based multi-network collective link prediction. In: Proceedings of KDD 2014 (2014)

    Google Scholar 

  22. Li, J., Wang, G.A., Chen, H.: Identity matching using personal and social identity features. Inf. Syst. Front. 13(1), 101–113 (2011)

    Article  Google Scholar 

  23. Iofciu, T., Fankhauser, P., Abel, F., Bischoff, K.: Identifying users across social tagging systems. In: Proceedings of AAAI Conference on Weblogs and Social Media, 2011 (2011)

    Google Scholar 

  24. Chen, Y., Zhuang, C., Cao, Q., Hui, P.: Understanding cross-site linking in online social networks. In: Proceedings of SNA-KDD 2014 (2014)

    Google Scholar 

  25. Gusfield, D.: Algorithms on Strings, Trees and Sequences: Computer Science and Computational Biology. Cambridge University Press, New York (1999)

    MATH  Google Scholar 

  26. Rapoport, A.: Spread of information through a population with socio-structural bias i: assumption of transitivity. Bull. Math. Biophys. 15(4), 523–533 (1953)

    Article  MathSciNet  Google Scholar 

  27. Zhao, X., Sala, A., Zheng, H., Zhao, B.: Efficient shortest paths on massive social graphs. In: Proceedings of CollaborateCom 2011 (2011)

    Google Scholar 

Download references

Acknowledgments

This work was partially supported by grants from the National Natural Science Foundation of China (Grant No. U1533104, U1333109, 61301245, 61305107).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yuxiang Zhang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Zhang, Y., Wang, L., Li, X., Xiao, C. (2016). Social Identity Link Across Incomplete Social Information Sources Using Anchor Link Expansion. In: Bailey, J., Khan, L., Washio, T., Dobbie, G., Huang, J., Wang, R. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2016. Lecture Notes in Computer Science(), vol 9651. Springer, Cham. https://doi.org/10.1007/978-3-319-31753-3_32

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-31753-3_32

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-31752-6

  • Online ISBN: 978-3-319-31753-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics