Abstract
Social media is an important data source to compliment traditional epidemic surveillance. However, misinformation in social media hinders the exploitation of valuable information. Analysis of information credibility has drawn much attention of academia in recent years. In this paper, we focus on analyzing the credibility of influenza posts published on Sina Weibo. We propose a semi-supervised probabilistic graphical model to jointly learn the interactions between user trustworthiness, content reliability, and post credibility. To test the performance of the approach, we apply it to identify credible influenza posts published from May 2013 to June 2014 on Sina Weibo. Random Forests and the Bayesian Network are used as baselines for evaluation. The results show that our approach performs effectively with the highest average accuracy of 71.7 %, f-measure 51 %. Our proposed framework significantly outperformed the baselines in detecting credible influenza posts on Sina Weibo.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Al-Eidan, R., Al-Khalifa H., Al-Salman A.: Measuring the credibility of arabic text content in Twitter. In: 2010 Fifth International Conference on Digital Information Management (ICDIM), pp. 285–291. IEEE (2010)
Yang, C.C., Yang, H., Jiang, L., Zhang, M.: Social media mining for drug safety signal detection. In: Proceedings of the 2012 International Workshop on Smart Health and Wellbeing, pp. 33–40. ACM (2012)
Yang, H., Yang, C.C.: Harnessing social media for drug-drug interactions detection. In: 2013 IEEE International Conference on Healthcare Informatics (ICHI), pp. 22–29. IEEE (2013)
Gupta, A., Kumaraguru, P.: Credibility ranking of tweets during high impact events. In: Proceedings of the 1st Workshop on Privacy and Security in Online Social Media, p. 2. ACM (2012)
Yang, J., Counts, S., Morris, M.R., Hoff, A.: Microblog credibility perceptions: comparing the USA and China. In: Proceedings of the 2013 Conference on Computer Supported Cooperative Work, pp. 575–586. ACM (2013)
AlMansour, A.A., Brankovic, L., Iliopoulos, C.S.: A model for recalibrating credibility in different contexts and languages-a Twitter case study. Int. J. Digital Inf. Wirel. Commun. (IJDIWC) 4(1), 53–62 (2014)
Walter, Z.: Web credibility and stickiness of content web sites. In: International Conference on Wireless Communications, Networking and Mobile Computing, pp. 3820–3823. IEEE (2007)
Juffinger, A., Granitzer, M., Lex, E.: Blog credibility ranking by exploiting verified content. In: Proceedings of the 3rd Workshop on Information Credibility on the Web, pp. 51–58. ACM (2009)
Vydiswaran, V., Zhai, C., Roth, D.: Content-driven trust propagation framework. In: Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 974–982. ACM (2011)
Wanas, N., El-Saban, M., Ashour, H., Ammar, W.: Automatic scoring of online discussion posts. In: Proceedings of the 2nd ACM Workshop on information Credibility on the Web, pp. 19–26. ACM (2008)
Benevenuto, F., Magno, G., Rodrigues, T., Almeida, V.: Detecting spammers on Twitter. In: Collaboration, Electronic Messaging, Anti-abuse and Spam Conference (CEAS), vol. 6, p. 12 (2010)
Castillo, C., Mendoza, M., Poblete, B.: Information credibility on Twitter. In: Proceedings of the 20th International Conference on World Wide Web, pp. 675–684. ACM (2011)
Qazvinian, V., Rosengren, E., Radev, D.R., Mei, Q.: Rumor has it: identifying misinformation in microblogs. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 1589–1599. Association for Computational Linguistics (2011)
Gupta, M., Zhao, P., Han, J.: Evaluating event credibility on Twitter. In: SDM, pp. 153–164. SIAM (2012)
Pasternack, J., Roth, D.: Latent credibility analysis. In: Proceedings of the 22nd International Conference on World Wide Web, pp. 1009–1020. International World Wide Web Conferences Steering Committee (2013)
Sondhi, P., Vydiswaran, V., Zhai, C.: Reliability prediction of webpages in the medical domain. In: Baeza-Yates, R., de Vries, A.P., Zaragoza, H., Cambazoglu, B., Murdock, V., Lempel, R., Silvestri, F. (eds.) ECIR 2012. LNCS, vol. 7224, pp. 219–231. Springer, Heidelberg (2012)
Mukherjee, S., Weikum, G., Danescu-Niculescu-Mizil, C.: People on drugs: credibility of user statements in health communities. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 65–74. ACM (2014)
Lafferty, J., McCallum, A., Pereira, F.C.: Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data (2001)
Acknowledgements
We thank Jingwei Li, Qihui Xia, and Lidan Chen for the help with preprocessing and labeling data. We also show our great appreciation to professor Hsinchun Chen for the help with revising this paper. We finally would like to thank all the reviewers for their modification suggestions.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Guo, Q., Huang, W.(., Huang, K., Liu, X. (2016). Information Credibility: A Probabilistic Graphical Model for Identifying Credible Influenza Posts on Social Media. In: Zheng, X., Zeng, D., Chen, H., Leischow, S. (eds) Smart Health. ICSH 2015. Lecture Notes in Computer Science(), vol 9545. Springer, Cham. https://doi.org/10.1007/978-3-319-29175-8_12
Download citation
DOI: https://doi.org/10.1007/978-3-319-29175-8_12
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-29174-1
Online ISBN: 978-3-319-29175-8
eBook Packages: Computer ScienceComputer Science (R0)