Skip to main content

Text Mining for Medical Documents Using a Hidden Markov Model

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4182))

Abstract

We propose a semantic tagger that provides high level concept information for phrases in clinical documents. It delineates such information from the statements written by doctors in patient records. The tagging, based on Hidden Markov Model (HMM), is performed on the documents that have been tagged with Unified Medical Language System (UMLS), Part-of-Speech (POS), and abbreviation tags. The result can be used to extract clinical knowledge that can support decision making or quality assurance of medical treatment.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Rabiner, L.R., et al.: An Introduction to Hidden Markov Models. IEEE ASSP Magazine (1986)

    Google Scholar 

  2. van Guilder, L.: Automated Part of Speech Tagging:A Brief Overview. In: Handout for LING, vol. 361 (1995)

    Google Scholar 

  3. Kupiec, J.: Robust part-of-speech tagging using a hidden Markov model. In: Computer Speech and Language, pp. 225–242 (1992)

    Google Scholar 

  4. Cutting, D., et al.: A Practical Part-of-Speech Tagger. In: Proceedings of the 3rd ACL, pp. 133–140 (1992)

    Google Scholar 

  5. Ruch, P.: MEDTAG: Tag-like Semantics for Medical Document Indexing. In: Proceedings of AMIA 1999, pp. 35–42 (1999)

    Google Scholar 

  6. Johnson, S.B.: A Semantic Lexicon for Medical Language Processing. JAMIA 6(3), 205–218 (1999)

    Google Scholar 

  7. Hahn, U.: Tagging Medical Documents with High Accuracy. In: Pacific Rim International Conference on Artificial Intelligence Auckland, Newzealand, pp. 852–861 (2004)

    Google Scholar 

  8. Paulussen, H.: DILEMMA-2: A Lemmatizer-Tagger for Medical Abstracts. In: Proceeings of ANLP, pp. 141–146 (1992)

    Google Scholar 

  9. Friedman, C.: Automatic Structuring of Sublanguage Information, pp. 85–102. IEA, London (1986)

    Google Scholar 

  10. Chi, E.C., et al.: Processing Free-text Input to Obtain a Database of Medical Information. In: Proceedings of the 8th Annual ACM-SIGIR Conference (1985)

    Google Scholar 

  11. Hahn, U.: Automatic Knowledge Acquisition from Medical Texts. In: Proceedings of the 1996 AMIA Annual Fall Symposium, pp. 383–387 (1996)

    Google Scholar 

  12. What is CDA?: http://www.h17.org.au/CDA.htm#CDA

  13. Elworthy, D.: Does Baum-Welch Re-estimation Help Taggers? In: Proceedings of the 27th ACL (1989)

    Google Scholar 

  14. Merialdo, B.: Tagging English Text with a Probabilistic Model. Computational Linguistics 20(2), 155–172 (1994)

    Google Scholar 

  15. Viterbi, A.J.: Error bounds for convolutional codes and an asymptotically optimal decoding algorithm. IEEE Transactions of Information Theory 13, 260–269 (1967)

    Article  MATH  Google Scholar 

  16. Baum, L.: An inequality and associated maximization technique in statistical estimation for probabilistic functions of a Markov process. Inequalities 3, 1–8 (1972)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Jang, H., Song, S.K., Myaeng, S.H. (2006). Text Mining for Medical Documents Using a Hidden Markov Model. In: Ng, H.T., Leong, MK., Kan, MY., Ji, D. (eds) Information Retrieval Technology. AIRS 2006. Lecture Notes in Computer Science, vol 4182. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11880592_45

Download citation

  • DOI: https://doi.org/10.1007/11880592_45

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-45780-0

  • Online ISBN: 978-3-540-46237-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics