Skip to main content

Epidemic Outbreak and Spread Detection System Based on Twitter Data

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7231))

Abstract

Social Network systems, such as Twitter, can serve as important data sources to provide collective intelligence and awareness of health problems in real time. The challenges of utilizing social media data include that the volume of data is large but distributed and of a highly unstructured form. Appropriate data gathering, scrubbing and aggregating efforts for these data are required to transform them for meaningful use. In this paper, we discuss such a social media data ETL (Extract-Transform-Load) method, to provide a user-friendly, dynamic method for visualizing outbreaks and the spread of developing epidemics in space and time. We have developed the Epidemics Outbreak and Spread Detection System (EOSDS) as a prototype that makes use of the rich information retrievable in real time from Twitter. EOSDS provides three different visualization methods of spreading epidemics, static map, distribution map, and filter map, to investigate public health threats in the space and time dimensions. The results of these visualizations in our experiments correlate well with relevant CDC official reports, a gold standard used by health informatics scientists. In our experiments, the EOSDS also detected an unusual situation not shown in the CDC reports, but confirmed by online news media.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   54.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   69.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Ginsberg, J., Mohebbi, M.H., Patel, R.S., Brammer, L., Smolinski, M.S., Brilliant, L.: Detecting influenza epidemics using search engine query data. Nature 457, 1012–1014 (2009)

    Article  Google Scholar 

  2. Sipping from the fire hose: Making sense of a torrent of tweets. The Economist, p. 68 (2011)

    Google Scholar 

  3. Twitter developers documentation, https://dev.twitter.com/docs

  4. Aramaki, E., Maskawa, S., Morita, M.: Twitter Catches The Flu: Detecting Influenza Epidemics using Twitter. In: Conference on Empirical Methods in Natural Language Processing, EMNLP 2011 (2011)

    Google Scholar 

  5. 140dev libraries, http://140dev.com/ (accessed on February 8, 2012)

  6. Carley, K.M., Columbus, D., Bigrigg, M., Kunkel, F.: Automap user guide (2011)

    Google Scholar 

  7. Google Map API, http://code.google.com/apis/maps/documentation/geocoding/

  8. Aurousseau, M.: On Lists of Words and Lists of Names. The Geographical Journal 105, 61–67 (1945)

    Article  Google Scholar 

  9. National Places Gazetteer, http://www.census.gov/geo/www/gazetteer/files/Gaz_places_national.txt (accessed on February 8, 2012)

  10. Mazzocchi, S., Garland, S., Lee, R.: SIMILE: practical metadata for the semantic web. O’Reilly (2005)

    Google Scholar 

  11. CDC Listeria report on September 30, http://www.cdc.gov/listeria/outbreaks/cantaloupes-jensen-farms/093011/index.html

  12. Wyoming news report, http://www.health.wyo.gov/news.aspx?NewsID=498 (accessed on February 8, 2012)

  13. CDC Listeria report on October 7, http://www.cdc.gov/listeria/outbreaks/cantaloupes-jensen-farms/100711/index.html (accessed on February 8, 2012)

  14. Brownstein, J.S., Freifeld, C.C., Reis, B.Y., Mandl, K.D.: Surveillance Sans Frontières: Internet-Based Emerging Infectious Disease Intelligence and the HealthMap Project. PLoS Med. 5(7), e151 (2008), doi:10.1371/journal.pmed.0050151

    Google Scholar 

  15. Cheng, Z., Caverlee, J., Lee, K.: Proceedings of the 19th ACM International Conference on Information and Knowledge Management (CIKM 2010), Toronto, Canada, October 26-30 (2010)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Ji, X., Chun, S.A., Geller, J. (2012). Epidemic Outbreak and Spread Detection System Based on Twitter Data. In: He, J., Liu, X., Krupinski, E.A., Xu, G. (eds) Health Information Science. HIS 2012. Lecture Notes in Computer Science, vol 7231. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-29361-0_19

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-29361-0_19

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-29360-3

  • Online ISBN: 978-3-642-29361-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics