Abstract
Medical social-media data provides a wealth of data generated by both healthcare professionals and patients alike. In fact, there are many medical social-media sites such as forums, where patients freely dialog with a healthcare professional or with other patients, often posing questions and responding to advice, or Weblogs, where groups of people describe their experiences with medical conditions and the various treatment plans to treat those conditions. All in all, one can no longer ignore the fact that social media has dramatically changed the structure of healthcare delivery in many ways. Simply from a medical data standpoint alone, social-media platforms have altered the way medical information is disseminated. That is, important medical information is no longer found exclusively in patients’ clinical narratives, commonly shared by physicians and other healthcare workers at regular professional meetings and conferences. Instead, user-generated content on the Web has become a new source of useful information to be added to the conventional methods of collecting clinical data. The challenge we face, however, is to design information extraction tools that can make the rich resources of medical data found in social-media postings exploitable. In this chapter we analyze the linguistic features of medical social-media postings juxtaposed to the linguistic features of both clinical narratives (e.g., discharge summaries, chart reviews, and operative reports) and biomedical literature, for which there already exists tools for performing information extraction. We show the shortcomings of these mapping tools when applied to medical social-media postings, and propose ways to improve such tools so that the wealth of medical data located in medical social-media can be made available to healthcare providers, pharmaceutical companies, and government-supported epidemiological agencies.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
While younger populations were fast in adopting these new technologies, the number of older adults using social media is also growing fast.
- 2.
- 3.
- 4.
- 5.
- 6.
- 7.
e.g., http://clinicalcases.org
- 8.
- 9.
- 10.
- 11.
- 12.
- 13.
http://www.thisisms.com/forum/daily-life-f35/topic20839.html (Section "Daily Life").
- 14.
'http://rssfeeds.webmd.com/rss/rss.aspx?RSSSource = RSS_PUBLIC'.
- 15.
- 16.
http:www.biolabeler.com
- 17.
- 18.
- 19.
- 20.
https://wiki.nci.nih.gov/display/VKC/cTAKES + 2.5.
References
Aase L, Goldman D, Gould M, Noseworthy J, Timimi F (2012) Bringing the Social-media Revolution to Health Care. Mayo Foundation for Medical Education & Research, United States, 2012
Altarum Institute (2012) Social-media and Health Care: Applications for Aging and Advanced Illness Populations. Highlights from Duke University’s 07–08 May 2012, Durham, U.S., http://www.dukehsac.com/files/2012/09/CHAPI-Social-Media-and-Health-Care-Paper-1.pdf [downloaded October 25, 2012]
Aronson A (2001) Effective mapping of biomedical text to the UMLS metathesaurus: the metamap program. Proc AMIA Symp 2001:17–21
Aronson AR, Bodenreider O, Demner-Fushman D, Fug KW, Lee VK, Mork JG, Névéol A, Peters L, Roger WJ (2007) From indexing the biomedical literature to coding clinical text: experience with MIT and machine learning approaches. ACL, Workshop BioNLP, Prague, Czech Republic
Barla M, Bielikova M (2010) Ordinary web pages as a source for metadata acquisition for open corpus user modeling. In: White B, IsaĂas P, Andone D (eds.), Proceedings of the IADIS International Conference on WWW/Internet. (Timisoara, Romania). IADIS, 2010, pp 227–233
Boulos MNK, Maramba I, Wheeler S (2006) Wikis, blogs and podcasts: a new generation of web-based tools for virtual collaborative clinical practice and education. BMC Med Educ 6:41
Chapman WW, Fiszman M, Dowling JN, Chapman BE, Rindflesch TC (2004) Identifying respiratory findings in emergency department reports for biosurveillance using metamap. Stud Health Technol Inform 107:487–491
Cohen AM, Hersh WR (2005) A survey of current cork in biomedical text mining. Brief Bioinform 6(1):57–71
Denecke K (2012) An architecture for diversity-aware search for medical web content. Methods Inf Med 51(6):549–556
Denecke K, Dolog P, Smrz P (2012) Making use of social-media data in public health. In: Alain Mille et al (eds) Proceedings of the 21st World wide web conference, WWW 2012, Lyon, France, 16–20 April 2012, pp 243–246
Etzioni O, Fader A, Christensen J, Soderland S (2011) Open information extraction: the second generation, mausam. International joint conference on artificial intelligence, 2011, Barcelona, Catalonia, Spain
Friedman C, Kra P, Rzhetsky A (2002) Two biomedical sublanguages: a description based on the theories of Zellig Harris. J Biomed Inform 35:222–235
Grishman R (1998) Information extraction and speech recognition. In: Proceedings of the broadcast news transcription and understanding workshop, Lansdowne, VA, February 1998
Hillan J (2003) Physician use of patient-centered weblogs and online journals. Clin Med Res 1(4):333–335
Himmel W, Reincke U, Michelmann HW (2008) Using text mining to classify lay requests to a medical expert forum and to prepare semiautomatic answers, SAS global forum, San Antonio, TX
Jonquet C, Shah NH, Musen MA (2009) The open biomedical annotator. Summit on Translat Bioinform 2009:56–60
Kahn CEJ, Rubin DL (2009) Automated semantic indexing of figure captions to improve radiology image retrieval. J Am Med Inform Assoc 16:280–286
Kovic I, Lulic I, Brumini G (2008) Examining the medical blogosphere: an online survey of medical bloggers. J Med Internet Res 10(3):e28
McCray AT (2003) An upper level ontology for the biomedical domain. Comp Funct Genomics 4:80–84
McCray AT, Burgun A, Bodenreider O (2001) Aggregating UMLS semantic types for reducing conceptual complexity. Medinfo 10(1):216–220
Meystre SM, Savova GK, Kipper-Schuler KC, Hurdle JF (2008) Extracting information from textual documents in the electronic health record: a review of recent research. Yearb Med Inform 2008:128–144
Miller EA, Pole A (2010) Diagnosis blog: checking up on health blogs in the blogosphere. Am J Public Health 100(8):1514–1519
Rizzo G, Troncy R (2012) NERD: a framework for unifying named entity recognition and disambiguation web extraction tools. System demonstration at the 13th conference of the European chapter of the association for computational linguistics (EACL’2012), Avignon, France, 23–27 April 2012
Stewart SA, von Maltzahn ME, Raza Abidi SS (2012) Comparing metamap to mgrep as a tool for mapping free text to formal medical lexions. In: Proceedings of the 1st international workshop on knowledge extraction & consolidation from social-media in conjunction with the 11th international semantic web conference (ISWC 2012), Boston, USA, 12 November 2012, pp 63–77
Zeng QT, Tse T (2006) Exploring and developing consumer health vocabularies. J Am Med Inform Assoc 13(1):24–29
Zeng QT, Tse T, Divita G et al (2007) Term Identification methods for consumer health vocabulary development. J Med Internet Res 9(1):e4
Zhou X, Zhang X, Hu X. Dragon toolkit: incorporating auto-learned semantic knowledge into large-scale text retrieval and mining. In: Proceedings of the 19th IEEE international conference on tools with artificial intelligence (ICTAI), Patras, Greece, 29–31 October 2007
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer Science+Business Media New York
About this chapter
Cite this chapter
Denecke, K., Soltani, N. (2013). The Burgeoning of Medical Social-Media Postings and the Need for Improved Natural Language Mapping Tools. In: Neustein, A., Markowitz, J. (eds) Where Humans Meet Machines. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-6934-6_2
Download citation
DOI: https://doi.org/10.1007/978-1-4614-6934-6_2
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-6933-9
Online ISBN: 978-1-4614-6934-6
eBook Packages: EngineeringEngineering (R0)