Machine Learning

, Volume 107, Issue 3, pp 551–578 | Cite as

Identifying and tracking topic-level influencers in the microblog streams

  • Sen Su
  • Yakun Wang
  • Zhongbao Zhang
  • Cheng Chang
  • Muhammad Azam Zia
Article
  • 163 Downloads

Abstract

Topic-level social influence analysis has been playing an important role in the online social networks like microblogs. Previous works usually use the cumulative number of links, such as the number of followers, to measure users’ topic-level influence in a static network. However, they ignore the dynamics of influence and the methods they proposed can not be applied to social streams. To address the limitations of prior works, we firstly propose a novel topic-level influence over time (TIT) model integrating the text, links and time to analyze the topic-level temporal influence of each user. We then design an influence decay based approach to measure users’ topic-level influence from the learned temporal influence. In order to track the influencers in data streams, we combine TIT and the influence decay method into a united online model (named oTIT), which is applicable to dynamic scenario. Through extensive experiments, we demonstrate the superiority of our approach, compared with the baseline and the state-of-the-art method. Moreover, we discover influence exhibits significantly different variation patterns over different topics, which verifies our viewpoint and gives us a new angle to understand its dynamic nature.

Keywords

Social influence Graphical model Online Sina Weibo 

Notes

Acknowledgements

This work is supported in part by the following funding agencies of China: National Natural Science Foundation under Grants 61170274 and U1534201, and the Fundamental Research Funds for the Central Universities (2015RC21).

References

  1. AlSumait, L., Barbará, D., & Domeniconi, C. (2008). On-line LDA: Adaptive topic models for mining text streams with applications to topic detection and tracking. In Eighth IEEE international conference on data mining, 2008. ICDM’08 (pp. 3–12). IEEE.Google Scholar
  2. Bakshy, E., Hofman, J. M., Mason, W. A., & Watts, D. J. (2011). Everyone’s an influencer: Quantifying influence on twitter. In Proceedings of the fourth ACM international conference on Web search and data mining (pp. 65–74). ACM.Google Scholar
  3. Bi, B., Kao, C., Wan, C., & Cho, J. (2014a). Who are experts specializing in landscape photography? In 20th ACM SIGKDD conference on knowledge discovery and data mining.Google Scholar
  4. Bi, B., Tian, Y., Sismanis, Y., Balmin, A., Cho, J. (2014b). Scalable topic-specific influence analysis on microblogs. In Proceedings of the 7th ACM international conference on Web search and data mining (pp. 513–522). ACM.Google Scholar
  5. Blei, D. M., & Lafferty, J. D. (2006). Dynamic topic models. In Proceedings of the 23rd international conference on machine learning (pp. 113–120). ACM.Google Scholar
  6. Blei, D. M., Ng, A. Y., & Jordan, M. I. (2003). Latent dirichlet allocation. Journal of Machine Learning Research, 3, 993–1022.MATHGoogle Scholar
  7. Cha, M., Haddadi, H., Benevenuto, F., & Gummadi, P. K. (2010). Measuring user influence in twitter: The million follower fallacy. ICWSM, 10(10–17), 30.Google Scholar
  8. Chang, J., & Blei, D. M. (2009). Relational topic models for document networks. AIStats, 9, 81–88.Google Scholar
  9. Crandall, D., Cosley, D., Huttenlocher, D., Kleinberg, J., & Suri, S. (2008). Feedback effects between similarity and social influence in online communities. In Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining (pp. 160–168). ACM.Google Scholar
  10. Diao, Q., Jiang, J., Zhu, F., & Lim, E. P. (2012). Finding bursty topics from microblogs. In Proceedings of the 50th annual meeting of the association for computational linguistics: Long papers (Vol. 1, pp. 536–544). Association for Computational Linguistics.Google Scholar
  11. Dietz, L., Bickel, S., & Scheffer, T. (2007). Unsupervised prediction of citation influences. In Proceedings of the 24th international conference on Machine learning (pp. 233–240). ACM.Google Scholar
  12. Embar, V. R., Bhattacharya, I., Pandit, V., & Vaculin, R. (2015). Online topic-based social influence analysis for the wimbledon championships. In Proceedings of the 21th ACM SIGKDD international conference on knowledge discovery and data mining (pp. 1759–1768). ACM.Google Scholar
  13. Erosheva, E., Fienberg, S., & Lafferty, J. (2004). Mixed-membership models of scientific publications. Proceedings of the National Academy of Sciences, 101(suppl 1), 5220–5227.CrossRefGoogle Scholar
  14. Foulds, J. R., & Smyth, P. (2013). Modeling scientific impact with topical influence regression. In EMNLP (pp. 113–123).Google Scholar
  15. Gerrish, S., & Blei, D. M. (2010). A language-based approach to measuring scholarly impact. ICML, 10, 375–382.Google Scholar
  16. Goyal, A., Bonchi, F., & Lakshmanan, L. V.(2010). Learning influence probabilities in social networks. In Proceedings of the third ACM international conference on Web search and data mining (pp. 241–250). ACM.Google Scholar
  17. Griffiths, T. L., & Steyvers, M. (2004). Finding scientific topics. Proceedings of the National Academy of Sciences, 101(suppl 1), 5228–5235.CrossRefGoogle Scholar
  18. Guille, A., Hacid, H., Favre, C., & Zighed, D. A. (2013). Information diffusion in online social networks: A survey. ACM SIGMOD Record, 42(2), 17–28.CrossRefGoogle Scholar
  19. He, X., Rekatsinas, T., Foulds, J., Getoor, L., & Liu, Y. (2015). Hawkestopic: A joint model for network inference and topic modeling from text-based cascades. In Proceedings of the 32nd international conference on machine learning.Google Scholar
  20. Kempe, D., Kleinberg, J., Tardos, É. (2003). Maximizing the spread of influence through a social network. In Proceedings of the ninth ACM SIGKDD international conference on knowledge discovery and data mining (pp. 137–146). ACM.Google Scholar
  21. Lampos, V., Aletras, N., & Preoţiuc-Pietro, D., & Cohn, T. (2014). Predicting and characterising user impact on twitter. In 14th conference of the European chapter of the Association for Computational Linguistics 2014, EACL 2014 (pp. 405–413).Google Scholar
  22. Lin, S., Wang, F., Hu, Q., & Yu, P. S. (2013). Extracting social events for learning better information diffusion models. In Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining (pp. 365–373). ACM.Google Scholar
  23. Liu, L., Tang, J., Han, J., Jiang, M., & Yang, S. (2010). Mining topic-level influence in heterogeneous networks. In Proceedings of the 19th ACM international conference on information and knowledge management (pp. 199–208). ACM.Google Scholar
  24. Mehmood, Y., Barbieri, N., Bonchi, F., & Ukkonen, A. (2013). CSI: Community-level social influence analysis. In H. Blockeel, K. Kersting, S. Nijssen, & F. Železný (Eds.), Machine learning and knowledge discovery in databases (pp. 48–63). Springer.Google Scholar
  25. Nallapati, R., Mcfarland, D. A., & Manning, C. D. (2011). Topicflow model: Unsupervised learning of topic-specific influences of hyperlinked documents. In AISTATS (pp. 543–551).Google Scholar
  26. Pal, A., & Counts, S. (2011). Identifying topical authorities in microblogs. In Proceedings of the fourth ACM international conference on Web search and data mining (pp. 45–54). ACM.Google Scholar
  27. Romero, D. M., Galuba, W., Asur, S., & Huberman, B. A. (2011). Influence and passivity in social media. In D. Gunopulos, T. Hofmann, D. Malerba, & M. Vazirgiannis (Eds.), Machine learning and knowledge discovery in databases (pp. 18–33). Springer.Google Scholar
  28. Saez-Trumper, D., Comarela, G., Almeida, V., Baeza-Yates, R., & Benevenuto, F. (2012). Finding trendsetters in information networks. In Proceedings of the 18th ACM SIGKDD international conference on knowledge discovery and data mining (pp. 1014–1022). ACM.Google Scholar
  29. Subbian, K., Aggarwal, C. C., & Srivastava, J. (2016). Querying and tracking influencers in social streams. In Proceedings of the ninth ACM international conference on Web search and data mining (pp. 493–502). ACM.Google Scholar
  30. Tang, J., Sun, J., Wang, C., & Yang, Z. (2009). Social influence analysis in large-scale networks. In Proceedings of the 15th ACM SIGKDD international conference on knowledge discovery and data mining (pp. 807–816). ACM.Google Scholar
  31. Tang, J., Zhang, J., Jin, R., Yang, Z., Cai, K., Zhang, L., et al. (2011). Topic level expertise search over heterogeneous networks. Machine Learning, 82(2), 211–237.MathSciNetCrossRefGoogle Scholar
  32. Wang, X., & McCallum, A. (2006). Topics over time: A non-Markov continuous-time model of topical trends. In Proceedings of the 12th ACM SIGKDD international conference on knowledge discovery and data mining (pp. 424–433). ACM.Google Scholar
  33. Wang, Z., Wang, S., & Huang, Q. (2015). Latent influence propagation on dynamic networks. In 2015 IEEE China summit and international conference on signal and information processing (ChinaSIP) (pp. 777–781). IEEE.Google Scholar
  34. Wang, Y., Zhang, Z., Su, S., Chang, C., & Zia, M. A. (2016). Topic-level influencers identification in the microblog sphere. In ECAI (pp. 1559–1560).Google Scholar
  35. Weng, J., Lim, E. P., Jiang, J., & He, Q. (2010). Twitterrank: Finding topic-sensitive influential twitterers. In Proceedings of the third ACM international conference on Web search and data mining (pp. 261–270). ACM.Google Scholar
  36. Zhang, J., Liu, B., Tang, J., Chen, T., & Li, J. (2013). Social influence locality for modeling retweeting behaviors. In Proceedings of the twenty-third international joint conference on artificial intelligence (pp. 2761–2767). AAAI Press.Google Scholar
  37. Zhang, J., Wang, C., Wang, J., & Yu, J. X. (2014). Inferring continuous dynamic social influence and personal preference for temporal behavior prediction. Proceedings of the VLDB Endowment, 8(3), 269–280.CrossRefGoogle Scholar

Copyright information

© The Author(s) 2017

Authors and Affiliations

  • Sen Su
    • 1
  • Yakun Wang
    • 1
  • Zhongbao Zhang
    • 1
  • Cheng Chang
    • 1
  • Muhammad Azam Zia
    • 1
  1. 1.State Key Laboratory of Networking and Switching TechnologyBeijing University of Posts and TelecommunicationsBeijingChina

Personalised recommendations