Skip to main content

CIE-9-MC Code Classification with knn and SVM

  • Conference paper
Bioinspired Applications in Artificial and Natural Computation (IWINAC 2009)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 5602))

Abstract

This paper is concerned with automatic classification of texts in a medical domain. The process consists in classifying reports of medical discharges into classes defined by the CIE-9-MC codes. We will assign CIE-9-MC codes to reports using either a knn model or support vector machines. One of the added values of this work is the construction of the collection using the discharge reports of a medical service. This is a difficult collection because of the high number of classes and the uneven balance between classes. In this work we study different representations of the collection, different classication models, and different weighting schemes to assign CIE-9-MC codes. Our use of document expansion is particularly novel: the training documents are expanded with the descriptions of the assigned codes taken from CIE-9-MC. We also apply SVMs to produce a ranking of classes for each test document. This innovative use of SVM offers good results in such a complicated domain.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Aas, K., Eikvil, L.: Text categorisation: A survey. Technical report, Norwegian Computing Center (1999)

    Google Scholar 

  2. Hsu, C.W., Lin, C.J.: A comparison of methods for multiclass support vector machines. IEEE Transactions on Neural Networks 13(2) (2002)

    Google Scholar 

  3. Joachims, T.: Making large-scale svm learning practical. In: Advances in Kernel Methods - Support Vector Learning. MIT press, Cambridge (1999)

    Google Scholar 

  4. Larkey, L., Croft, W.B.: Automatic assignment of cie-9 codes to discharge summaries. Technical report, CIIR (1995)

    Google Scholar 

  5. Sebastiani, F.: Machine learning in automated text categorization. ACM Computing Surveys 34(1), 1–47 (2002)

    Article  Google Scholar 

  6. Vapnik, V.N.: The Nature of Statistical Learning Theory. Springer, Heidelberg (1995)

    Book  MATH  Google Scholar 

  7. Yang, Y., Liu, X.: A re-examination of text categorization methods. In: Proc. SIGIR 1999, the 22nd ACM Conference on Research and Development in Information Retrieval, Berkeley, USA, August 1999, pp. 42–49 (1999)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Lojo, D., Losada, D.E., Barreiro, Á. (2009). CIE-9-MC Code Classification with knn and SVM. In: Mira, J., Ferrández, J.M., Álvarez, J.R., de la Paz, F., Toledo, F.J. (eds) Bioinspired Applications in Artificial and Natural Computation. IWINAC 2009. Lecture Notes in Computer Science, vol 5602. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02267-8_53

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-02267-8_53

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-02266-1

  • Online ISBN: 978-3-642-02267-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics