Skip to main content

An Active Ingredients Entity Recogniser System Based on Profiles

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9612))

Abstract

This paper describes an active ingredients named entity recogniser. Our machine learning system, which is language and domain independent, employs unsupervised feature generation and weighting from the training data. The proposed automatic feature extraction process is based on generating a profile for the given entity without traditional knowledge resources (such as dictionaries). Our results (F1 87.3 % [95 %CI: 82.07–92.53]) proves that unsupervised feature generation can achieve a high performance for this task.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Björne, J., Kaewphan, S., Salakoski, T.: UTurku: drug named entity recognition and drug-drug interaction extraction using svm classification and domain knowledge. In: Proceedings of the Seventh International Workshop on Semantic Evaluation, vol. 2, pp. 651–659 (2013)

    Google Scholar 

  2. Collazo, A., Ceballo, A., Puig, D.D., Gutiérrez, Y., Abreu, J.I., Pérez, R., Fernández Orqín, A., Montoyo, A., Muñoz, R., Camara, F.: UMCC_DLSI: Semantic and Lexical features for detection and classification Drugs in biomedical texts. In: Proceedings of the Seventh International Workshop on Semantic Evaluation, vol. 2, pp. 636–643 (2013). http://www.aclweb.org/anthology/S13-2106

  3. Deléger, L., Grouin, C., Zweigenbaum, P.: Extracting medical information from narrative patient records: the case of medication-related information. J. Am. Med. Inform. Assoc. 17(5), 555–558 (2010)

    Article  Google Scholar 

  4. Doan, S., Bastarache, L., Klimkowski, S., Denny, J.C., Xu, H.: Integrating existing natural language processing tools for medication extraction from discharge summaries. J. Am. Med. Inform. Assoc. 17(5), 528–531 (2010)

    Article  Google Scholar 

  5. Ely, J.W., Osheroff, J.A., Ebell, M.H., Bergus, G.R., Levy, B.T., Chambliss, M., Evans, E.R.: Analysis of questions asked by family doctors regarding patient care. Br. Med. J. 319, 358–361 (1999)

    Article  Google Scholar 

  6. Feldman, R., Sanger, J.: The Text Mining Handbook: Advanced Approaches in Analyzing Unstructured Data. Cambridge University Press, New York (2007)

    Google Scholar 

  7. Freund, Y., Schapire, R.E.: Large margin classification using the perceptron algorithm. Mach. Learn. 37(3), 277–296 (1999)

    Article  MATH  Google Scholar 

  8. Friedman, C., Rindflesch, T.C., Corn, M.: Natural language processing: state of the art and prospects for significant progress, a workshop sponsored by the National Library of Medicine. J. Biomed. Inform. 46(5), 765–773 (2013)

    Article  Google Scholar 

  9. Gonzalez-Gonzalez, A.I., Dawes, M., Sanchez-Mateos, J., Riesgo-Fuertes, R., Escortell-Mayor, E., Sanz-Cuesta, T., Hernandez-Fernandez, T.: Information needs and information-seeking behavior of primary care physicians. Ann. Family Med. 5, 345–352 (2007)

    Article  Google Scholar 

  10. González-González, A.I., Sánchez Mateos, J., Sanz Cuesta, T., Riesgo Fuertes, R., Escortell Mayor, E., Hernández Fernández, T.: Estudio de las necesidades de información generadas por los médicos de atención primaria (proyecto ENIGMA)*. Atención primaria, 38(4), 219–224 (2006)

    Google Scholar 

  11. Grego, T., Couto, F.M.: LASIGE : using Conditional Random Fields and ChEBI ontology. In: Proceedings of the Seventh International Workshop on Semantic Evaluation, vol. 2, pp. 660–666 (2013). http://aclweb.org/anthology/S13-2109

  12. Hamon, T., Grabar, N.: Linguistic approach for identification of medication names and related information in clinical narratives. J. Am. Med. Inform. Assoc. 17(5), 549–554 (2010)

    Article  Google Scholar 

  13. Huber, T., Linden, U.D., Rockt, T.: WBI-NER : The impact of domain-specific features on the performance of identifying and classifying mentions of drugs. In: Proceedings of the Seventh International Workshop on Semantic Evaluation, vol. 2, pp. 356–363 (2013). http://www.aclweb.org/anthology/S13-2058

  14. Lopes, L., Vieira, R.: Building and applying profiles through term extraction. In: X Brazilian Symposium in Information and Human Language Technology, Natal, Brazil, pp. 91–100 (2015)

    Google Scholar 

  15. Moreno, I., Moreda, P., Romá-Ferri, M.: Reconocimiento de entidades nombradas en dominios restringidos. In: Actas del III Workshop en Tecnologías de la Informática, Alicante, Spain, pp. 41–57 (2012)

    Google Scholar 

  16. Moreno, I., Moreda, P., Romá-Ferri, M.T.: MaNER: a MedicAl named entity recogniser. In: Biemann, C., Handschuh, S., Freitas, A., Meziane, F., Métais, E. (eds.) NLDB 2015. LNCS, vol. 9103, pp. 418–423. Springer, Heidelberg (2015)

    Chapter  Google Scholar 

  17. Padró, L., Stanilovsky, E.: FreeLing 3.0: towards wider multilinguality. In: Proceedings of the Language Resources and Evaluation Conference, Istanbul, Turkey (2012)

    Google Scholar 

  18. Patrick, J., Li, M.: High accuracy information extraction of medication information from clinical notes: 2009 i2b2 medication extraction challenge. J. Am. Med. Inform. Assoc. 17(5), 524–527 (2010)

    Article  Google Scholar 

  19. Platt, J.: Using analytic QP and sparseness to speed training of support vector machines. In: Proceedings of the Advances in Neural Information Processing Systems, vol. 11, pp. 557–563 (1999)

    Google Scholar 

  20. Sanchez-Cisneros, D., Aparicio Gali, F.: UEM-UC3M: An Ontology-based named entity recognition system for biomedical texts. In: Proceedings of the Seventh International Workshop on Semantic Evaluation, vol. 2, pp. 622–627 (2013)

    Google Scholar 

  21. Segura-Bedmar, I., Revert, R., Martínez, P.: Detecting drugs and adverse events from Spanish health social media streams. In: Proceedings of the 5th International Workshop on Health Text Mining and Information Analysis, pp. 106–115 (2014)

    Google Scholar 

  22. Singh, A.: Natural language processing for less privileged languages: Where do we come from? Where are we going? In: IJCNLP-08 Workshop on NLP for Less Privileged Languages, Hyderabad, India, pp. 7–12 (2008)

    Google Scholar 

  23. Tikk, D., Solt, I.: Improving textual medication extraction using combined conditional random fields and rule-based systems. J. Am. Med. Inform. Assoc. 17(5), 540–544 (2010)

    Article  Google Scholar 

  24. Yang, H.: Automatic extraction of medication information from medical discharge summaries. J. Am. Med. Inform. Assoc. 17(5), 545–548 (2010). http://dx.doi.org/10.1136/jamia.2010.003863

    Article  Google Scholar 

Download references

Acknowledgments

This paper has been partially supported by the Spanish Government (grant no. TIN2015-65100-R).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Isabel Moreno .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Moreno, I., Moreda, P., Romá-Ferri, M.T. (2016). An Active Ingredients Entity Recogniser System Based on Profiles. In: Métais, E., Meziane, F., Saraee, M., Sugumaran, V., Vadera, S. (eds) Natural Language Processing and Information Systems. NLDB 2016. Lecture Notes in Computer Science(), vol 9612. Springer, Cham. https://doi.org/10.1007/978-3-319-41754-7_25

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-41754-7_25

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-41753-0

  • Online ISBN: 978-3-319-41754-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics