Clustering of Medical Publications for Evidence Based Medicine Summarisation

  • Sara Faisal Shash
  • Diego Mollá
Part of the Lecture Notes in Computer Science book series (LNCS, volume 7885)


We present a study of the clustering properties of medical publications for the aim of Evidence Based Medicine summarisation. Given a dataset of documents that have been manually assigned to groups related to clinical answers, we apply K-Means clustering and verify that the documents can be clustered reasonably well. We advance the implications of such clustering for natural language processing tasks in Evidence Based Medicine.


Evidence Base Medicine Clinical Question Semantic Type Document Cluster Cover Method 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Cohen, A.M., Adams, C.E., Davis, J.M., Yu, C., Yu, P.S., Meng, W., Duggan, L., McDonagh, M., Smalheiser, N.R.: Evidence Based Medicine: The essential role of systematic retrieval and the need for automated text mining tools. In: Proceedings of the 1st ACM International Health Informatics Symposium, pp. 376–380 (2010)Google Scholar
  2. 2.
    Andrews, N.O., Fox, E.A.: Recent Developments in Document Clustering. Tech. rep., Virginia Tech. (2007)Google Scholar
  3. 3.
    Pratt, W., Fagan, L.: The Usefulness of Dynamically Categorizing Search Results. Journal of the American Medical Informatics Association 7(6), 605–617 (2000)CrossRefGoogle Scholar
  4. 4.
    Lin, J.J., Demner-Fushman, D.: Semantic clustering of answers to clinical questions. In: AMIA Annual Symposium Proceedings (2007)Google Scholar
  5. 5.
    Lin, Y., Li, W., Chen, K., Liu, Y.: A Document Clustering and Ranking System for Exploring {MEDLINE} Citations. Journal of the American Medical Informatics Association 14(5), 651–661 (2007)CrossRefGoogle Scholar
  6. 6.
    Mollá, D., Santiago-Martínez, M.E.: Development of a Corpus for Evidence Based Medicine Summarisation. In: Proceedings of the Australasian Language Technology Workshop (2011)Google Scholar
  7. 7.
    Aronson, A.R.: Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program. In: Proceedings of the 2001 AMIA Annual Symposium, pp. 17–21 (January 2001)Google Scholar
  8. 8.
    Mardia, K.V., Kent, J.T., Bibby, J.M.: Multivariate Analysis. Academic Press, London (1979)zbMATHGoogle Scholar
  9. 9.
    Can, F., Ozkarahan, E.A.: Concepts and Effectiveness of the Cover-Coefficient-Based Clustering Methodology for Text Databases. ACM Transactions on Database Systems 15(4), 483–517 (1990)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2013

Authors and Affiliations

  • Sara Faisal Shash
    • 1
  • Diego Mollá
    • 1
  1. 1.Department of ComputingMacquarie UniversitySydneyAustralia

Personalised recommendations