International Conference on Web-Age Information Management

WAIM 2015: Web-Age Information Management pp 79-87 | Cite as

Mining Personal Interests of Microbloggers Based on Free Tags in SINA Weibo

  • Xiang WangEmail author
  • Xiang Yu
  • Bin Zhou
  • Yan Jia
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 9391)


SINA Weibo, a micro-blogging service, provides users with an application to record their brief postings about their lives. They can tag themselves using free tags to show their personal characteristics, but 78.2 % of all users do not tag themselves. In this paper, we try to mine user’s personal interests based on the self-defined free tags. A directed weighted graph is constructed with the interactive relations between users. We suppose that if two users have interacted with each other, they may share latent common interests. So interests can be propagated from a user to its interacted friends. Experiments on three SINA Weibo datasets show that our method performs better than exiting methods in mining user’s personal interests. Moreover, our method is more efficient than these methods since we do not use the content of user’s tweets but the user self-defined free tags only.


Interest Microblog Tag SINA Weibo 



The research was sponsored by National 973 Program (Grant No. 2013CB329604, 2013CB329601, 2013CB329602), NSFC (Grant No. 60933005, 91124002, 61202362), 863 Program (Grant No. 2012AA01A401, 2012AA01A402), National Key Technology R&D Program (Grant No. 2012BAH38B04, 2012BAH38B06).


  1. 1.
    Bhattacharya, P., et al.: Inferring user interests in the Twitter social network. In: Proceedings of the 8th ACM Conference on Recommender Systems. ACM (2014)Google Scholar
  2. 2.
    Claypool, M., et al.: Inferring user interests. Internet Comput. IEEE 5(6), 32–39 (2001)CrossRefGoogle Scholar
  3. 3.
    Fan, M., Zhou, Q., Zheng, T.F.: Mining the personal interests of microbloggers via exploiting wikipedia knowledge. In: Gelbukh, A. (ed.) CICLing 2014, Part II. LNCS, vol. 8404, pp. 188–200. Springer, Heidelberg (2014)CrossRefGoogle Scholar
  4. 4.
    Li, H.: Mining User Interest in Microblogs with a User-Topic Model (2013)Google Scholar
  5. 5.
    Li, X., Guo, L., Zhao, Y.E.: Tag-based social interests discovery. In: Proceedings of the 17th International Conference on World Wide Web. ACM (2008)Google Scholar
  6. 6.
    Michelson, M., Macskassy, S.A.: Discovering users’ topics of interests on twitter: a first look. In: Proceedings of the Fourth Workshop on Analytics for Noisy Unstructured Text Data 2010, pp. 73–80. ACM, Toronto (2010)Google Scholar
  7. 7.
    Xu, Z., et al.: Discovering user interest on twitter with a modified author-topic model. In: IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT), 2011. IEEE (2011)Google Scholar
  8. 8.
    Liu, Z., Chen, X., Sun, M.: Mining the interests of Chinese microbloggers via keyword extraction. Front. Comput. Sci. 6(1), 76–87 (2012)MathSciNetGoogle Scholar
  9. 9.
    Shen, W., et al.: Linking named entities in Tweets with knowledge base via user interests modeling. In: Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 2013, pp. 68–76. ACM, Chicago (2013)Google Scholar
  10. 10.
    Rosen-Zvi, M., et al.: Learning author-topic models from text corpora. ACM Trans. Inf. Syst. (TOIS) 28(1), 4 (2010)CrossRefGoogle Scholar
  11. 11.
    Wu, W., Zhang, B., Ostendorf, M.: Automatic generation of personalized annotation tags for twitter users. in Human Language Technologies. In: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics. Association for Computational Linguistics (2010)Google Scholar
  12. 12.
    Xiang, W., et al.: Improving text categorization with semantic knowledge in Wikipedia. IEICE Trans. Inf. Syst. 96(12), 2786–2794 (2013)Google Scholar
  13. 13.
    Wang, X., et al.: Computing semantic relatedness using chinese wikipedia links and taxonomy. J. Chin. Comput. Syst. 32(11), 2237–2242 (2011)Google Scholar
  14. 14.
    Michelson, M., Macskassy, S.A.: Discovering users’ topics of interest on twitter: a first look. In: Proceedings of the Fourth Workshop on Analytics for Noisy Unstructured Text Data. ACM (2010)Google Scholar
  15. 15.
    Gabrilovich, E., Markovitch, S.: Computing semantic relatedness using Wikipedia-based explicit semantic analysis. In: Proceedings of the 20th International Joint Conference on Artifical Intelligence 2007, pp. 1606–1611. Morgan Kaufmann Publishers Inc., Hyderabad (2007)Google Scholar
  16. 16.
    Dean, J., Ghemawat, S.: MapReduce: simplified data processing on large clusters. Commun. ACM 51(1), 107–113 (2008)CrossRefGoogle Scholar
  17. 17.
    Mihalcea, R., Tarau, P.: TextRank: bringing order into texts. In: Proceedings of EMNLP, Barcelona, Spain (2004)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2015

Authors and Affiliations

  1. 1.School of ComputerNational University of Defense TechnologyChangshaChina

Personalised recommendations