Predicting Information Diffusion in Online Social Platforms: A Twitter Case Study

  • Kateryna Lytvyniuk
  • Rajesh SharmaEmail author
  • Anna Jurek-Loughrey
Conference paper
Part of the Studies in Computational Intelligence book series (SCI, volume 812)


Online social media has become a part of everyday life of modern society. A lot of information is created on these platforms and shared with the community continuously. Predicting information diffusion on online social platforms has been studied in the past by many researchers as it has its applications in various domains such as viral marketing, news propagation etc. Some information spreads faster compared to others depending on topic of interest of the online users. In this work, we investigate the information diffusion problem using Twitter data as a use case study. We define tweet popularity as number of retweets any original message receives. In total we extracted 27 features which can be categorised into content, user, sentiment and initial retweeting behaviour for creating our prediction model. We study the problem of predicting as a multiclass prediction task. Three datasets from Twitter about three different topics are collected and analysed for building and testing various models based on different machine learning algorithms. The models were able to predict up to 60% of overall accuracy and an F1 score of 67% is obtained. The models are created using one of the dataset and tested on all the datasets, which shows that the model is robust enough to handle different topics.


Online social networks Information diffusion Machine learning Data analytics 



This work is supported by H2020 framework project, SoBigData, grant number 654024.


  1. 1.
    Hong, L., Dan, O., Davison, B.D.: Predicting popular messages in twitter. In: Proceedings of the 20th International Conference Companion on World Wide Web, vol. 46, no. 3, pp. 57–58 (2011)Google Scholar
  2. 2.
    Naveed, N., Gottron, T., Kunegis, J., Alhadi, A.C.: Bad news travel fast: a content-based analysis of interestingness on twitter. In: Proceedings of the 3rd International Web Science Conference, pp. 8:1–8:7 (2011)Google Scholar
  3. 3.
    Yang, J., Counts, S.: Predicting the speed, scale, and range of information diffusion in twitter. In: Proceedings of the Fourth International Conference on Weblogs and Social Media, vol. 2010, no. 10(2010)Google Scholar
  4. 4.
    Kafeza, E., Kanavos, A., Makris, C., Vikatos, P.: Predicting information diffusion patterns in twitter. In: 10th IFIP International Conference on Artificial Intelligence Applications and Innovations (AIAI), vol. 2014, no. 10 (2014)Google Scholar
  5. 5.
    Taxidou, I., Fischer, P.M.: Online analysis of information diffusion in twitter. In: Proceedings of the 23rd International Conference on World Wide Web, vol. 2014 (2014)Google Scholar
  6. 6.
    Kupavskii, A., Ostroumova, L., Umnov, A., Usachev, S., Serdyukov, P., Gusev, G., Kustarev, A.: Prediction of retweet cascade size over time. In: Proceedings of the 21st ACM International Conference on Information and Knowledge Management, vol. 2012 (2012)Google Scholar
  7. 7.
    Shafiq, Z., Liu, A.: Cascade size prediction in online social networks. In: IFIP Networking Conference (IFIP Networking) and Workshops, vol. 2017 (2017)Google Scholar
  8. 8.
    Chen, J., Li, H., Wu, Z., Hossain, M.S.: Sentiment analysis of the correlation between regular tweets and retweets. In: IEEE 16th International Symposium on Network Computing and Applications (NCA), pp. 1–5 (2017)Google Scholar
  9. 9.
    Wu, B., Shen, H.: Analyzing and predicting news popularity on twitter. Int. J. Inf. Manag. 35(6), 702–711 (2015)Google Scholar
  10. 10.
    Okubo, K., Oida, K.: A successful advertising strategy over twitter. Comput. Inf. Sci. 10, 10–22 (2017)Google Scholar
  11. 11.
    Mazloom, M., Rietveld, R., Rudinac, S., Worring, M., van Dolen, W.: Multimodal popularity prediction of brand-related social media posts. In: Proceedings of the 2016 ACM on Multimedia Conference, pp. 197–201 (2016)Google Scholar
  12. 12.
    Sarabchi, F.: Quantitative Prediction of Twitter Message Dissemination: A Machine Learning Approach. Technical University of Delft (2015)Google Scholar
  13. 13.
    Cazzoli, L., Sharma, R., Treccani, M., Lillo, M.: A large scale study to understand the relation between twitter and financial market. In: Third European Network Intelligence Conference (ENIC), pp. 98–105 (2016)Google Scholar
  14. 14.
    Adamic, L.A., Glance, N.: The political blogosphere and the 2004 U.S. election: divided they Blog. In: Proceedings of the 3rd International Workshop on Link Discovery, pp. 36–43 (2005)Google Scholar
  15. 15.
    Cohen, K., Johansson, F., Kaati, L., Mork, J.C.: Detecting linguistic markers for radical violence in social media. Terror. Polit. Violence 26(1), 246–256 (2014)Google Scholar
  16. 16.
    Bakshy, E., Hofman, J.M., Mason, W.A., Watts, D.J.: Everyone’s an influencer: quantifying influence on twitter. In: Proceedings of the Fourth ACM International Conference on Web Search and Data Mining, pp. 65–74 (2011)Google Scholar

Copyright information

© Springer Nature Switzerland AG 2019

Authors and Affiliations

  • Kateryna Lytvyniuk
    • 1
  • Rajesh Sharma
    • 1
    Email author
  • Anna Jurek-Loughrey
    • 2
  1. 1.University of TartuTartuEstonia
  2. 2.Queen’s University BelfastBelfastUK

Personalised recommendations