Mining Twitter Data for Potential Drug Effects
Adverse drug reactions have become one of the top causes of deaths. For surveillance of adverse drug events, patients have gradually become involved in reporting their experiences with medications through the use of dedicated and structured systems. The emerging of social networking provides a way for patients to describe their drug experiences online in less-structured free text format. We developed a computational approach that collects, processes and analyzes Twitter data for drug effects. Our approach uses a machine-learning-based classifier to classify personal experience tweets, and use NLM’s MetaMap software to recognize and extract word phrases that belong to drug effects. Our results on 5 medications demonstrate the validity of our approach, and its ability to correctly extract potential drug effects from the Twitter data. It is conceivable that social media data can serve to complement and/or supplement traditional time-consuming and costly surveillance methods.
KeywordsBig Data Drug Effect Identification Natural Language Processing Pharmacovigilance Social Media Mining Twitter
Unable to display preview. Download preview PDF.
- 1.World Health Organization. Pharmacovigilance: ensuring the safe use of medicines. Geneva, Switzerland: World Health Organization (2004)Google Scholar
- 7.Levinson, D.R., General, I.: Adverse events in hospitals: national incidence among Medicare beneficiaries. Department of Health & Human Services (2010)Google Scholar
- 12.Stone, B.: https://blog.twitter.com/2009/whats-happening2009 (cited 2013)
- 13.Signorini, A., Segre, A.M., Polgreen, P.: The use of Twitter to track levels of disease activity and public concern in the U.S. during the influenza A H1N1 pandemic. PLoS One 6(5), e19467 (2011)Google Scholar
- 15.Corley, C.D., Mikler, A.R., Singh, K.P., Cook, D.J.: Monitoring influenza trends through mining social media. In: International Conference on Bioinformatics & Computational Biology (2009)Google Scholar
- 17.Leaman, R., Wojtulewicz, L., Sullivan, R., Skariah, A., Yang, J., Gonzalez, G.: Towards internet-age pharmacovigilance: extracting adverse drug reactions from user posts to health-related social networks. In: Proceedings of the 2010 Workshop on Biomedical Natural Language Processing. Association for Computational Linguistics (2010)Google Scholar
- 18.Verma, S., Vieweg, S., Corvey, W.J., Palen, L., Martin, J.H., Palmer, M., et al.: Natural Language Processing to the Rescue?: Extracting ‘Situational Awareness’ Tweets During Mass Emergency. In: Proc ICWSM (2011)Google Scholar
- 19.Elgersma, E., de Rijke, M.: Personal vs non-personal blogs: Initial classification experiments. In: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM (2008)Google Scholar
- 20.Li, S., Huang, C.-R., Zhou, G., Lee, S.: Employing personal/impersonal views in supervised and semi-supervised sentiment classification. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics (2010)Google Scholar
- 21.Bird, S.: NLTK: The natural language toolkit. In: Proceedings of the COLING/ACL on Interactive Presentation Sessions. Association for Computational Linguistics (2006)Google Scholar
- 22.Aronson, A.R., Lang, F.M.: An overview of MetaMap: historical perspective and recent advances. J. Am. Med. Inform. Assoc. 17(3), 229–236 (2010)Google Scholar
- 23.Bian, J., Topaloglu, U., Yu, F.: Towards large-scale twitter mining for drug-related adverse events. In: Proceedings of the 2012 International Workshop on Smart Health and Wellbeing, pp. 25–32. ACM (2012)Google Scholar