Information Credibility: A Probabilistic Graphical Model for Identifying Credible Influenza Posts on Social Media

Guo, Qiaozhen; Huang, Wei (Wayne); Huang, Kai; Liu, Xiao

doi:10.1007/978-3-319-29175-8_12

Qiaozhen Guo¹⁷,
Wei (Wayne) Huang¹⁷,
Kai Huang¹⁸ &
…
Xiao Liu¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9545))

Included in the following conference series:

ICSH

2615 Accesses
2 Citations
1 Altmetric

Abstract

Social media is an important data source to compliment traditional epidemic surveillance. However, misinformation in social media hinders the exploitation of valuable information. Analysis of information credibility has drawn much attention of academia in recent years. In this paper, we focus on analyzing the credibility of influenza posts published on Sina Weibo. We propose a semi-supervised probabilistic graphical model to jointly learn the interactions between user trustworthiness, content reliability, and post credibility. To test the performance of the approach, we apply it to identify credible influenza posts published from May 2013 to June 2014 on Sina Weibo. Random Forests and the Bayesian Network are used as baselines for evaluation. The results show that our approach performs effectively with the highest average accuracy of 71.7 %, f-measure 51 %. Our proposed framework significantly outperformed the baselines in detecting credible influenza posts on Sina Weibo.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

References

Al-Eidan, R., Al-Khalifa H., Al-Salman A.: Measuring the credibility of arabic text content in Twitter. In: 2010 Fifth International Conference on Digital Information Management (ICDIM), pp. 285–291. IEEE (2010)
Google Scholar
Yang, C.C., Yang, H., Jiang, L., Zhang, M.: Social media mining for drug safety signal detection. In: Proceedings of the 2012 International Workshop on Smart Health and Wellbeing, pp. 33–40. ACM (2012)
Google Scholar
Yang, H., Yang, C.C.: Harnessing social media for drug-drug interactions detection. In: 2013 IEEE International Conference on Healthcare Informatics (ICHI), pp. 22–29. IEEE (2013)
Google Scholar
Gupta, A., Kumaraguru, P.: Credibility ranking of tweets during high impact events. In: Proceedings of the 1st Workshop on Privacy and Security in Online Social Media, p. 2. ACM (2012)
Google Scholar
Yang, J., Counts, S., Morris, M.R., Hoff, A.: Microblog credibility perceptions: comparing the USA and China. In: Proceedings of the 2013 Conference on Computer Supported Cooperative Work, pp. 575–586. ACM (2013)
Google Scholar
AlMansour, A.A., Brankovic, L., Iliopoulos, C.S.: A model for recalibrating credibility in different contexts and languages-a Twitter case study. Int. J. Digital Inf. Wirel. Commun. (IJDIWC) 4(1), 53–62 (2014)
Google Scholar
Walter, Z.: Web credibility and stickiness of content web sites. In: International Conference on Wireless Communications, Networking and Mobile Computing, pp. 3820–3823. IEEE (2007)
Google Scholar
Juffinger, A., Granitzer, M., Lex, E.: Blog credibility ranking by exploiting verified content. In: Proceedings of the 3rd Workshop on Information Credibility on the Web, pp. 51–58. ACM (2009)
Google Scholar
Vydiswaran, V., Zhai, C., Roth, D.: Content-driven trust propagation framework. In: Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 974–982. ACM (2011)
Google Scholar
Wanas, N., El-Saban, M., Ashour, H., Ammar, W.: Automatic scoring of online discussion posts. In: Proceedings of the 2nd ACM Workshop on information Credibility on the Web, pp. 19–26. ACM (2008)
Google Scholar
Benevenuto, F., Magno, G., Rodrigues, T., Almeida, V.: Detecting spammers on Twitter. In: Collaboration, Electronic Messaging, Anti-abuse and Spam Conference (CEAS), vol. 6, p. 12 (2010)
Google Scholar
Castillo, C., Mendoza, M., Poblete, B.: Information credibility on Twitter. In: Proceedings of the 20th International Conference on World Wide Web, pp. 675–684. ACM (2011)
Google Scholar
Qazvinian, V., Rosengren, E., Radev, D.R., Mei, Q.: Rumor has it: identifying misinformation in microblogs. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 1589–1599. Association for Computational Linguistics (2011)
Google Scholar
Gupta, M., Zhao, P., Han, J.: Evaluating event credibility on Twitter. In: SDM, pp. 153–164. SIAM (2012)
Google Scholar
Pasternack, J., Roth, D.: Latent credibility analysis. In: Proceedings of the 22nd International Conference on World Wide Web, pp. 1009–1020. International World Wide Web Conferences Steering Committee (2013)
Google Scholar
Sondhi, P., Vydiswaran, V., Zhai, C.: Reliability prediction of webpages in the medical domain. In: Baeza-Yates, R., de Vries, A.P., Zaragoza, H., Cambazoglu, B., Murdock, V., Lempel, R., Silvestri, F. (eds.) ECIR 2012. LNCS, vol. 7224, pp. 219–231. Springer, Heidelberg (2012)
Chapter Google Scholar
Mukherjee, S., Weikum, G., Danescu-Niculescu-Mizil, C.: People on drugs: credibility of user statements in health communities. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 65–74. ACM (2014)
Google Scholar
Lafferty, J., McCallum, A., Pereira, F.C.: Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data (2001)
Google Scholar

Download references

Acknowledgements

We thank Jingwei Li, Qihui Xia, and Lidan Chen for the help with preprocessing and labeling data. We also show our great appreciation to professor Hsinchun Chen for the help with revising this paper. We finally would like to thank all the reviewers for their modification suggestions.

Author information

Authors and Affiliations

School of Management, Xi’an Jiaotong University, Xi’an, China
Qiaozhen Guo & Wei (Wayne) Huang
School of Computer Science, Fudan University, Shanghai, China
Kai Huang
Department of Management Information Systems, University of Arizona, Tucson, USA
Xiao Liu

Authors

Qiaozhen Guo
View author publications
You can also search for this author in PubMed Google Scholar
Wei (Wayne) Huang
View author publications
You can also search for this author in PubMed Google Scholar
Kai Huang
View author publications
You can also search for this author in PubMed Google Scholar
Xiao Liu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Qiaozhen Guo .

Editor information

Editors and Affiliations

Institute of Automation,Bldg.1004, Chinese Academy of Sciences, Beijing, China
Xiaolong Zheng
University of Arizona, Tucson, Arizona, USA
Daniel Dajun Zeng
University of Arizona, Phoenix, USA
Hsinchun Chen
Mayo Clinic, Scottsdale, USA
Scott J. Leischow

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Guo, Q., Huang, W.(., Huang, K., Liu, X. (2016). Information Credibility: A Probabilistic Graphical Model for Identifying Credible Influenza Posts on Social Media. In: Zheng, X., Zeng, D., Chen, H., Leischow, S. (eds) Smart Health. ICSH 2015. Lecture Notes in Computer Science(), vol 9545. Springer, Cham. https://doi.org/10.1007/978-3-319-29175-8_12

Download citation

DOI: https://doi.org/10.1007/978-3-319-29175-8_12
Published: 20 January 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-29174-1
Online ISBN: 978-3-319-29175-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics