Abstract
During emerging disease outbreaks, massive information are disseminated through social network. In China, Sina microblog system as the biggest social network provide a novel way to monitoring the development of emerging disease and public awareness. However, only a small percentage of microblogs could wide spread. Therefore, predict popularity of microblogs timely are meaningful for emergency management. In this paper, a Judgment method for popularity level prediction of microblog is proposed and the temporal pattern between cases number and repost number is verified. Repost number is considered to measure the impact of microblogs. To predict the popularity of microblogs, Granger causality test was used to verify the temporal correlation pattern between development of disease and public concern while an Judgment method based on five classical classification models were proposed. Through analyses, case number of emerging disease are Granger causality of the popularity level of microblogs and the regression model got the best result when lag was three. By Judgment method, more than 86 % microblogs can be classified correctly. The proposed Judgment method based on user, microblog and emerging disease information could analysis the popularity level of microblogs speedily and accurately. This is important and meaningful for monitoring the development of future public health event.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Brownstein, J.S., Freifeld, C.C., Mado, L.C.: Digital disease detection-harnessing the web for public health surveillance. New Engl. J. Med. 360, 2153–2157 (2009)
Wilson, K., Brownstein, J.S.: Early detection of disease outbreaks using the internet. Can. Med. Assoc. J. 180, 829–831 (2009)
Lampos, V., De Bie, T., Cristianini, N.: Flu detector - tracking epidemics on twitter. In: Balcázar, J.L., Bonchi, F., Gionis, A., Sebag, M. (eds.) ECML PKDD 2010, Part III. LNCS, vol. 6323, pp. 599–602. Springer, Heidelberg (2010)
The global public health intelligence network (GPHIN). http://www.who.int/csr/alertresponse/epidemicintelligence/en/
The medical information system (medisys). http://medusa.jrc.it/medisys/
Ginsberg, J., Mohebbi, M.H., Patel, R.S., Brammer, L., Smolinski, M.S., et al.: Detecting influenza epidemics using search engine query data. Nature 457, 1012–1014 (2008)
Yuan, Q., Nsoesie, E.O., Lv, B., Peng, G., Chunara, R., et al.: Monitoring influenza epidemics in china with search query from baidu. PLoS ONE 8, e64323 (2013)
Naveed, N., Gottron, T., Kunegis, J., Alhadi, A.C.: Bad news travel fast: A content-based analysis of interestingness on twitter. In: Proceedings of the 3rd International Web Science Conference (2011)
Salathé, M., Freifeld, C.C., Mekaru, S.R., Tomasulo, A.F., Brownstein, J.S.: Influenza A (H7N9) and the importance of digital epidemiology. New Engl. J. Med. 369, 401–404 (2013)
Yang, J., Counts, S.: Comparing information diffusion structure in weblogs and microblogs. In: ICWSM (2010)
Fernandez-Luque, L., Karlsen, R., Bonander, J.: Review of extracting information from the social web for health personalization. J. Med. Internet Res. 13, e15 (2011)
Kostkova, P.: A roadmap to integrated digital public health surveillance: the vision and the challenges. In: Proceedings of the 22nd International Conference on World Wide Web Companion, International World Wide Web Conferences Steering Committee, pp. 687–694 (2013)
Yang, J., Counts, S.: Predicting the speed, scale, and range of information diffusion in twitter. ICWSM 10, 355–358 (2010)
Hong, L., Dan, O., Davison, B.D.: Predicting popular messages in twitter. In: Proceedings of the 20th International Conference Companion on World Wide Web, pp. 57–58. ACM (2011)
Jenders, M., Kasneci, G., Naumann, F.: Analyzing and predicting viral tweets. In: Proceedings of the 22nd International Conference on World Wide Web Companion, International World Wide Web Conferences Steering Committee, pp. 657–664 (2013)
Starbird, K., Palen, L.: (how) will the revolution be retweeted? information diffusion and the 2011 egyptian uprising. In: Proceedings of the ACM 2012 Conference on Computer Supported Cooperative Work, pp. 7–16. ACM (2012)
Chew, C., Eysenbach, G.: Pandemics in the age of twitter: content analysis of tweets during the 2009 H1N1 outbreak. PLoS ONE 5, e14118 (2010)
Yang, J., Leskovec, J.: Patterns of temporal variation in online media. In: Proceedings of the Fourth ACM INTERNATIONAL Conference on Web Search and Data Mining, pp. 177–186. ACM (2011)
Lee, C., Kwak, H., Park, H., Moon, S.: Finding influentials based on the temporal order of information adoption in twitter. In: Proceedings of the 19th International Conference on World Wide Web, pp. 1137–1138. ACM (2010)
Hay, S.I., George, D.B., Moyes, C.L., Brownstein, J.S.: Big data opportunities for global infectious disease surveillance. PLoS medicine 10, e1001413 (2013)
Paul, M.J., Dredze, M.: You are what you tweet: Analyzing twitter for public health. In: ICWSM (2011)
Kwak, H., Lee, C., Park, H., Moon, S.: What is twitter, a social network or a news media? In: Proceedings of the 19th International Conference on World Wide Web, pp. 591–600. ACM (2010)
Signorini, A., Segre, A.M., Polgreen, P.M.: The use of twitter to track levels of disease activity and public concern in the us during the influenza A H1N1 pandemic. PLoS ONE 6, e19467 (2011)
Collier, N., Son, N.T., Ngoc, M.N.T.: Omg u got u? analysis of shared health messages for bio-surveillance. In: Semantic Mining in Biomedicine (2010)
Chunara, R., Andrews, J.R., Brownstein, J.S., et al.: Social and news media enable estimation of epidemiological patterns early in the 2010 haitian cholera outbreak. Am. J. Trop. Med. Hyg. 86, 39 (2012)
Human infection with influenza A(H7N9) virus in china. http://www.who.int
Human infection with influenza A(H7N9) virus in china. http://www.chinapop.gov.cn/yjb/s3578/201305/67d505cd37eb4a419f17518bdbe05b54.shtml
Guo, Z., Li, Z., Tu, H.: Sina microblog: an information-driven online social network. In: 2011 International Conference on Cyberworlds (CW), pp. 160–167. IEEE (2011)
The hot index of hotword in sina microblog system. http://data.weibo.com/index/hotword
The registered users of sina microblog system exceeded 300 million and more than 100 million microblogs a day. http://news.xinhuanet.com/tech/2012-02/29/c_122769084.htm
Granger, C.W.: Investigating causal relations by econometric models and cross-spectral methods. Econom. J. Econom. Soc. 37, 424–438 (1969)
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., et al.: The weka data mining software: an update. ACM SIGKDD Explor. Newsl. 11, 10–18 (2009)
Bishop, C.M., et al.: Pattern Recognition and Machine Learning, vol. 1. Springer, New York (2006)
Lim, T.S., Loh, W.Y., Shih, Y.S.: A comparison of prediction accuracy, complexity, and training time of thirty-three old and new classification algorithms. Mach. Learn. 40, 203–228 (2000)
Han, J., Kamber, M., Pei, J.: Data Mining: Concepts and Techniques. Morgan kaufmann, San Francisco (2006)
Keerthi, S.S., Shevade, S.K., Bhattacharyya, C., Murthy, K.R.K.: Improvements to platt’s SMO algorithm for SVM classifier design. Neural Comput. 13, 637–649 (2001)
Margineantu, D.D., Dietterich, T.G.: Pruning adaptive boosting. In: ICML. Citeseer, volume 97, pp. 211–218 (1997)
Barboza, P., Vaillant, L., Mawudeku, A., Nelson, N.P., Hartley, D.M., et al.: Evaluation of epidemic intelligence systems integrated in the early alerting and reporting project for the detection of A/H5N1 influenza events. PLoS ONE 8, e57252 (2013)
Acknowledgments
This study was funded by National Natural Science Foundation of China (Nos.90924302, 91224008, 91024030, 91324007) and Important National Science & Technology Specific Projects (Nos.2012ZX10004801, 2013ZX10004218).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Liu, J., Cao, Z., Zeng, D. (2014). Predicting Popularity of Microblogs in Emerging Disease Event. In: Chen, Y., et al. Web-Age Information Management. WAIM 2014. Lecture Notes in Computer Science(), vol 8597. Springer, Cham. https://doi.org/10.1007/978-3-319-11538-2_1
Download citation
DOI: https://doi.org/10.1007/978-3-319-11538-2_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-11537-5
Online ISBN: 978-3-319-11538-2
eBook Packages: Computer ScienceComputer Science (R0)