Skip to main content

Predicting Retweet Behavior in Online Social Networks Based on Locally Available Information

  • Conference paper
  • First Online:
Social Informatics (SocInfo 2016)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10047))

Included in the following conference series:

  • 2586 Accesses

Abstract

Behavior prediction in online social networks (OSNs) has attracted lots of attention due to its vast applications. However, most previous work needs global network information to train classifiers. Due to the large data volume and privacy concern, it is infeasible to obtain global network information for every OSN. We propose a decentralized framework, named REPULSE, to predict whether a target user will retweet a message relayed by his friends. We also identify a new set of community-related features that improve retweet prediction accuracy considerably.

To demonstrate the value of community-related features, we propose another framework named HOTPIE to predict tweets popularity. Utilizing community-related features can boost the F1 score of popularity prediction from 0.43 to 0.55. To the best of our knowledge, this is the first work which systematically studies the impact of global vs. locally observable information on the prediction of retweet behavior in OSNs.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    In Twitter, only a complete set of users who have retweeted the same message is shown, without disclosing the actual ordering. This set of forwarding users in Twitter aggregates information from different retweet-paths in the overall diffusion graph. Note that the set of forwarding users can serve the same purpose as retweet-paths do.

References

  1. Artzi, Y., Pantel, P., Gamon, M.: Association for computational linguistics: human language technologies. In: Predicting responses to microblog posts. In: Proceedings of the 2012 Conference of the North American, pp. 602–606. Association for Computational Linguistics (2012)

    Google Scholar 

  2. Bandari, R., Asur, S., Huberman, B.A.: The pulse of news in social media: Forecasting popularity. arXiv preprint (2012). arXiv:1202.0332

  3. Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)

    MATH  Google Scholar 

  4. Bollen, J., Mao, H., Zeng, X.: Twitter mood predicts the stock market. J. Comput. Sci. 2(1), 1–8 (2011)

    Article  Google Scholar 

  5. Boyd, D., Golder, S., Lotan, G.: Tweet, tweet, retweet: conversational aspects of retweeting on twitter. In: 2010 43rd Hawaii International Conference on System Sciences (HICSS), pp. 1–10. IEEE (2010)

    Google Scholar 

  6. Chen, K., Chen, T., Zheng, G., Jin, O., Yao, E., Yu, Y.: Collaborative personalized tweet recommendation. In: Proceedings of the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 661–670. ACM (2012)

    Google Scholar 

  7. Cheng, J., Adamic, L., Dow, P.A., Kleinberg, J.M., Leskovec, J.: Can cascades be predicted? In: Proceedings of the 23rd International Conference on World Wide Web, pp. 925–936. ACM (2014)

    Google Scholar 

  8. Chesney, T.: Networked individuals predict a community wide outcome from their local information. Decis. Support Syst. 57, 11–21 (2014)

    Article  Google Scholar 

  9. Earle, P.S., Bowden, D.C., Guy, M.: Twitter earthquake detection: earthquake monitoring in a social world. Annal. Geophys. 54(6), 708–715 (2012)

    Google Scholar 

  10. Guille, A., Hacid, H., Favre, C., Zighed, D.A.: Information diffusion in online social networks: a survey. ACM SIGMOD Rec. 42(2), 17–28 (2013)

    Article  Google Scholar 

  11. Hong, L., Dan, O., Davison, B.D.: Predicting popular messages in twitter. In: Proceedings of the 20th International Conference Companion on World Wide Web, pp. 57–58. ACM (2011)

    Google Scholar 

  12. Kupavskii, A., Ostroumova, L., Umnov, A., Usachev, S., Serdyukov, P., Gusev, G., Kustarev, A.: Prediction of retweet cascade size over time. In: Proceedings of the 21st ACM International Conference on Information and Knowledge Management, pp. 2335–2338. ACM (2012)

    Google Scholar 

  13. Lee, K., Mahmud, J., Chen, J., Zhou, M., Nichols, J.: Who will retweet this? Automatically identifying and engaging strangers on twitter to spread information. In: Proceedings of the 19th International Conference on Intelligent User Interfaces, pp. 247–256. ACM (2014)

    Google Scholar 

  14. Luo, Z., Osborne, M., Tang, J., Wang, T.: Who will retweet me? finding retweeters in twitter. In: Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 869–872. ACM (2013)

    Google Scholar 

  15. Ma, Z., Sun, A., Cong, G.: On predicting the popularity of newly emerging hashtags in twitter. J. Am. Soc. Inform. Sci. Technol. 64(7), 1399–1410 (2013)

    Article  Google Scholar 

  16. Naveed, N., Gottron, T., Kunegis, J., Alhadi, A.C.: Bad news travel fast: A content-based analysis of interestingness on twitter. In: Proceedings of the 3rd International Web Science Conference, p. 8. ACM (2011)

    Google Scholar 

  17. Petrovic, S., Osborne, M., Lavrenko, V.: Rt to win! predicting message propagation in twitter. In: ICWSM (2011)

    Google Scholar 

  18. Suh, B., Hong, L., Pirolli, P., Chi, E.H.: Want to be retweeted? large scale analytics on factors impacting retweet in twitter network. In: 2010 IEEE Second International Conference on Social Computing (socialcom), pp. 177–184. IEEE (2010)

    Google Scholar 

  19. Tang, L., Ni, Z., Xiong, H., Zhu, H.: Locating targets through mention in twitter. World Wide Web 18(4), 1019–1049 (2015)

    Article  Google Scholar 

  20. Tsoumakas, G., Katakis, I., Vlahavas, I.: Mining multi-label data. In: Data Mining and Knowledge Discovery Handbook, pp. 667–685. Springer (2009)

    Google Scholar 

  21. Tsur, O., Rappoport, A.: What’s in a hashtag?: content based prediction of the spread of ideas in microblogging communities. In: Proceedings of the Fifth ACM International Conference on Web Search and Data Mining, pp. 643–652. ACM (2012)

    Google Scholar 

  22. Tumasjan, A., Sprenger, T.O., Sandner, P.G., Welpe, I.M.: Predicting elections with twitter: What 140 characters reveal about political sentiment. In: ICWSM 2010, pp. 178–185 (2010)

    Google Scholar 

  23. Uysal, I., Croft, W.B.: User oriented tweet ranking: a filtering approach to microblogs. In: Proceedings of the 20th ACM International Conference on Information and Knowledge Management, pp. 2261–2264. ACM (2011)

    Google Scholar 

  24. Yan, X., Guo, J., Lan, Y., Cheng, X.: A biterm topic model for short texts. In: Proceedings of the 22nd International Conference on World Wide Web, pp. 1445–1456. International World Wide Web Conferences Steering Committee (2013)

    Google Scholar 

  25. Yang, J., Counts, S.: Predicting the speed, scale, and range of information diffusion in twitter. In: ICWSM 2010, pp. 355–358 (2010)

    Google Scholar 

  26. Zaman, T., Fox, E.B., Bradlow, E.T., et al.: A bayesian approach for predicting the popularity of tweets. Annal. Appl. Stat. 8(3), 1583–1611 (2014)

    Article  MathSciNet  MATH  Google Scholar 

  27. Zaman, T.R., Herbrich, R., Van Gael, J., Stern, D.: Predicting information spreading in twitter. In: Workshop on Computational Social Science and the Wisdom of Crowds, Nips, vol. 104, pp. 17599–601. Citeseer (2010)

    Google Scholar 

  28. Zhang, X., Fuehres, H., Gloor, P.A.: Predicting stock market indicators through twitter i hope it is not as bad as i fear. Procedia-Soc. Behav. Sci. 26, 55–62 (2011)

    Article  Google Scholar 

  29. Zhao, Q., Erdogdu, M.A., He, H.Y., Rajaraman, A., Leskovec, J.: Seismic: A self-exciting point process model for predicting tweet popularity. In: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1513–1522. ACM (2015)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Guanchen Li .

Editor information

Editors and Affiliations

Appendices

A Workflow of HOTPIE

Fig. 6.
figure 6

Workflow of HOTPIE

B Confusion Matrices of HOTPIE and PPuG

Table 4. Confusion matrix of using HOTPIE, with per class accuracy
Table 5. Confusion matrix of using PPuG, with per class accuracy
Table 6. Confusion matrix without community-related features, with per class accuracy

C Full Feature List

Table 7. Feature names with feature IDs

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing AG

About this paper

Cite this paper

Li, G., Lau, W.C. (2016). Predicting Retweet Behavior in Online Social Networks Based on Locally Available Information. In: Spiro, E., Ahn, YY. (eds) Social Informatics. SocInfo 2016. Lecture Notes in Computer Science(), vol 10047. Springer, Cham. https://doi.org/10.1007/978-3-319-47874-6_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-47874-6_8

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-47873-9

  • Online ISBN: 978-3-319-47874-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics