Predicting Retweet Behavior in Online Social Networks Based on Locally Available Information

Li, Guanchen; Lau, Wing Cheong

doi:10.1007/978-3-319-47874-6_8

Guanchen Li¹⁵ &
Wing Cheong Lau¹⁵

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10047))

Included in the following conference series:

International Conference on Social Informatics

2586 Accesses

Abstract

Behavior prediction in online social networks (OSNs) has attracted lots of attention due to its vast applications. However, most previous work needs global network information to train classifiers. Due to the large data volume and privacy concern, it is infeasible to obtain global network information for every OSN. We propose a decentralized framework, named REPULSE, to predict whether a target user will retweet a message relayed by his friends. We also identify a new set of community-related features that improve retweet prediction accuracy considerably.

To demonstrate the value of community-related features, we propose another framework named HOTPIE to predict tweets popularity. Utilizing community-related features can boost the F1 score of popularity prediction from 0.43 to 0.55. To the best of our knowledge, this is the first work which systematically studies the impact of global vs. locally observable information on the prediction of retweet behavior in OSNs.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
In Twitter, only a complete set of users who have retweeted the same message is shown, without disclosing the actual ordering. This set of forwarding users in Twitter aggregates information from different retweet-paths in the overall diffusion graph. Note that the set of forwarding users can serve the same purpose as retweet-paths do.

References

Artzi, Y., Pantel, P., Gamon, M.: Association for computational linguistics: human language technologies. In: Predicting responses to microblog posts. In: Proceedings of the 2012 Conference of the North American, pp. 602–606. Association for Computational Linguistics (2012)
Google Scholar
Bandari, R., Asur, S., Huberman, B.A.: The pulse of news in social media: Forecasting popularity. arXiv preprint (2012). arXiv:1202.0332
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)
MATH Google Scholar
Bollen, J., Mao, H., Zeng, X.: Twitter mood predicts the stock market. J. Comput. Sci. 2(1), 1–8 (2011)
Article Google Scholar
Boyd, D., Golder, S., Lotan, G.: Tweet, tweet, retweet: conversational aspects of retweeting on twitter. In: 2010 43rd Hawaii International Conference on System Sciences (HICSS), pp. 1–10. IEEE (2010)
Google Scholar
Chen, K., Chen, T., Zheng, G., Jin, O., Yao, E., Yu, Y.: Collaborative personalized tweet recommendation. In: Proceedings of the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 661–670. ACM (2012)
Google Scholar
Cheng, J., Adamic, L., Dow, P.A., Kleinberg, J.M., Leskovec, J.: Can cascades be predicted? In: Proceedings of the 23rd International Conference on World Wide Web, pp. 925–936. ACM (2014)
Google Scholar
Chesney, T.: Networked individuals predict a community wide outcome from their local information. Decis. Support Syst. 57, 11–21 (2014)
Article Google Scholar
Earle, P.S., Bowden, D.C., Guy, M.: Twitter earthquake detection: earthquake monitoring in a social world. Annal. Geophys. 54(6), 708–715 (2012)
Google Scholar
Guille, A., Hacid, H., Favre, C., Zighed, D.A.: Information diffusion in online social networks: a survey. ACM SIGMOD Rec. 42(2), 17–28 (2013)
Article Google Scholar
Hong, L., Dan, O., Davison, B.D.: Predicting popular messages in twitter. In: Proceedings of the 20th International Conference Companion on World Wide Web, pp. 57–58. ACM (2011)
Google Scholar
Kupavskii, A., Ostroumova, L., Umnov, A., Usachev, S., Serdyukov, P., Gusev, G., Kustarev, A.: Prediction of retweet cascade size over time. In: Proceedings of the 21st ACM International Conference on Information and Knowledge Management, pp. 2335–2338. ACM (2012)
Google Scholar
Lee, K., Mahmud, J., Chen, J., Zhou, M., Nichols, J.: Who will retweet this? Automatically identifying and engaging strangers on twitter to spread information. In: Proceedings of the 19th International Conference on Intelligent User Interfaces, pp. 247–256. ACM (2014)
Google Scholar
Luo, Z., Osborne, M., Tang, J., Wang, T.: Who will retweet me? finding retweeters in twitter. In: Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 869–872. ACM (2013)
Google Scholar
Ma, Z., Sun, A., Cong, G.: On predicting the popularity of newly emerging hashtags in twitter. J. Am. Soc. Inform. Sci. Technol. 64(7), 1399–1410 (2013)
Article Google Scholar
Naveed, N., Gottron, T., Kunegis, J., Alhadi, A.C.: Bad news travel fast: A content-based analysis of interestingness on twitter. In: Proceedings of the 3rd International Web Science Conference, p. 8. ACM (2011)
Google Scholar
Petrovic, S., Osborne, M., Lavrenko, V.: Rt to win! predicting message propagation in twitter. In: ICWSM (2011)
Google Scholar
Suh, B., Hong, L., Pirolli, P., Chi, E.H.: Want to be retweeted? large scale analytics on factors impacting retweet in twitter network. In: 2010 IEEE Second International Conference on Social Computing (socialcom), pp. 177–184. IEEE (2010)
Google Scholar
Tang, L., Ni, Z., Xiong, H., Zhu, H.: Locating targets through mention in twitter. World Wide Web 18(4), 1019–1049 (2015)
Article Google Scholar
Tsoumakas, G., Katakis, I., Vlahavas, I.: Mining multi-label data. In: Data Mining and Knowledge Discovery Handbook, pp. 667–685. Springer (2009)
Google Scholar
Tsur, O., Rappoport, A.: What’s in a hashtag?: content based prediction of the spread of ideas in microblogging communities. In: Proceedings of the Fifth ACM International Conference on Web Search and Data Mining, pp. 643–652. ACM (2012)
Google Scholar
Tumasjan, A., Sprenger, T.O., Sandner, P.G., Welpe, I.M.: Predicting elections with twitter: What 140 characters reveal about political sentiment. In: ICWSM 2010, pp. 178–185 (2010)
Google Scholar
Uysal, I., Croft, W.B.: User oriented tweet ranking: a filtering approach to microblogs. In: Proceedings of the 20th ACM International Conference on Information and Knowledge Management, pp. 2261–2264. ACM (2011)
Google Scholar
Yan, X., Guo, J., Lan, Y., Cheng, X.: A biterm topic model for short texts. In: Proceedings of the 22nd International Conference on World Wide Web, pp. 1445–1456. International World Wide Web Conferences Steering Committee (2013)
Google Scholar
Yang, J., Counts, S.: Predicting the speed, scale, and range of information diffusion in twitter. In: ICWSM 2010, pp. 355–358 (2010)
Google Scholar
Zaman, T., Fox, E.B., Bradlow, E.T., et al.: A bayesian approach for predicting the popularity of tweets. Annal. Appl. Stat. 8(3), 1583–1611 (2014)
Article MathSciNet MATH Google Scholar
Zaman, T.R., Herbrich, R., Van Gael, J., Stern, D.: Predicting information spreading in twitter. In: Workshop on Computational Social Science and the Wisdom of Crowds, Nips, vol. 104, pp. 17599–601. Citeseer (2010)
Google Scholar
Zhang, X., Fuehres, H., Gloor, P.A.: Predicting stock market indicators through twitter i hope it is not as bad as i fear. Procedia-Soc. Behav. Sci. 26, 55–62 (2011)
Article Google Scholar
Zhao, Q., Erdogdu, M.A., He, H.Y., Rajaraman, A., Leskovec, J.: Seismic: A self-exciting point process model for predicting tweet popularity. In: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1513–1522. ACM (2015)
Google Scholar

Download references

Author information

Authors and Affiliations

The Chinese University of Hong Kong, Shatin, Hong Kong
Guanchen Li & Wing Cheong Lau

Authors

Guanchen Li
View author publications
You can also search for this author in PubMed Google Scholar
Wing Cheong Lau
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Guanchen Li .

Editor information

Editors and Affiliations

University of Washington, Seattle, Washington, USA
Emma Spiro
Indiana University, Bloomington, Indiana, USA
Yong-Yeol Ahn

Appendices

A Workflow of HOTPIE

B Confusion Matrices of HOTPIE and PPuG

Table 4. Confusion matrix of using HOTPIE, with per class accuracy

Full size table

Table 5. Confusion matrix of using PPuG, with per class accuracy

Full size table

Table 6. Confusion matrix without community-related features, with per class accuracy

Full size table

C Full Feature List

Table 7. Feature names with feature IDs

Full size table

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, G., Lau, W.C. (2016). Predicting Retweet Behavior in Online Social Networks Based on Locally Available Information. In: Spiro, E., Ahn, YY. (eds) Social Informatics. SocInfo 2016. Lecture Notes in Computer Science(), vol 10047. Springer, Cham. https://doi.org/10.1007/978-3-319-47874-6_8

Download citation

DOI: https://doi.org/10.1007/978-3-319-47874-6_8
Published: 19 October 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-47873-9
Online ISBN: 978-3-319-47874-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Predicting Retweet Behavior in Online Social Networks Based on Locally Available Information

Abstract

Access this chapter

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Appendices

A Workflow of HOTPIE

B Confusion Matrices of HOTPIE and PPuG

C Full Feature List

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation