Skip to main content

Discovery, Enrichment and Disambiguation of Acronyms

  • Conference paper
  • First Online:
  • 1218 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9829))

Abstract

Acronym disambiguation is the process of linking an acronym in a given text to its intended expansion in the text. Acronyms are frequently used in short-texts such as news headlines and tweets. The direct application of state-of-art named entity disambiguation approaches on short text results in poor performance since, entities are not associated with their acronyms in the Knowledge Bases. Also, many acronyms in short-text represent out of Knowledge Base entities. Existing acronym dictionaries such as Acronymfinder also cannot be used for disambiguation as contextual information requires for disambiguation is absent in them. In this paper, we propose a system for effective disambiguation acronyms in short-text. In particular, we built an Acronym dictionary that is automatically updated with new acronyms by continuous monitoring of news media. Each acronym in our Acronym dictionary is enriched with additional meta information comprised of category, location and context words extracted from news articles. We use our enriched Acronym dictionary for disambiguation of acronyms in short-texts. Experimental results shows that our system is efficient in discovery and disambiguation of acronyms.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    Threshold is 0, 1 and 2 for acronyms with two letters, three letters and greater than three letters respectively.

  2. 2.

    https://www.geodatasource.com/world-cities-database/free.

  3. 3.

    http://www.acronymfinder.com/Index--.html.

  4. 4.

    Narendra Modi, Smriti Irani, MS Dhoni, Arun Jaitely, Rahul Gandhi, Arvind Kejriwal and Ravishankar Prasad.

References

  1. Silva, G., Montgomery, C.A.: Knowledge representation for automated understanding of natural language discourse. Comput. Humanit. 11(4), 223–243 (1977)

    Article  Google Scholar 

  2. Lavi, O., Auerbach, G., Persky, E.: Dynamic natural language understanding. US Patent 7,840,400, 23 Nov 2010

    Google Scholar 

  3. Feng, S., Xiong, Y., Yao, C., Zheng, L., Liu, W.: Acronym extraction and disambiguation in large-scale organizational web pages. In: Proceedings of the 18th ACM Conference on Information and Knowledge Management, CIKM 2009, ACM (2009)

    Google Scholar 

  4. Li, C., Ji, L., Yan, J.: Acronym disambiguation using word embedding. In: Twenty-Ninth AAAI Conference on Artificial Intelligence (2015)

    Google Scholar 

  5. Zhang, W., Sim, Y.C., Su, J., Tan, C.L.: Entity linking with effective acronym expansion, instance selection and topic modeling. In: Proceedings of Internationl Joint Conference on Artifical Intelligence, IJCAI 2011, AAAI Press (2011)

    Google Scholar 

  6. HaCohen-Kerner, Y., Kass, A., Peretz, A.: Combined one sense disambiguation of abbreviations. In: Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies, ACL (2008)

    Google Scholar 

  7. McInnes, B.T., Pedersen, T., Liu, Y., Pakhomov, S.V., Melton, G.B.: Using second-order vectors in a knowledge-based method for acronym disambiguation. In: Proceedings of the Fifteenth Conference on Computational Natural Language Learning, CoNLL 2011, Association for Computational Linguistics (2011)

    Google Scholar 

  8. Nguyen, D.B., Hoffart, J., Theobald, M., Weikum, G.: Aida-light: high-throughput named-entity disambiguation. In: Linked Data on the Web at WWW (2014)

    Google Scholar 

  9. Ferragina, P., Scaiella, U.: Fast and accurate annotation of short texts with wikipedia pages. Proceedings of arXiv preprint (2010)

    Google Scholar 

  10. Ratinov, L., Roth, D., Downey, D., Anderson, M.: Local and global algorithms for disambiguation to wikipedia. In: Proceedings of ACL (2011)

    Google Scholar 

  11. Barua, J., Patel, D., Agrawal, A.K.: Removing noise content from online news articles. In: Proceedings of the 20th International Conference on Management of Data, COMAD 2014, Computer Society of India (2014)

    Google Scholar 

  12. Taneva, B., Cheng, T., Chakrabarti, K., He, Y.: Mining acronym expansions and their meanings using query click log. In: Proceedings of the 22nd International Conference on World Wide Web, WWW 2013, ACM (2013)

    Google Scholar 

  13. Ehrmann, M., Della Rocca, L., Steinberger, R., Tanev, H.: Acronym recognition and processing in 22 languages. arXiv preprint arXiv:1309.6185 (2013)

  14. Dannélls, D.: Automatic acronym recognition. In: Proceedings of the Eleventh Conference of the European Chapter of the Association for Computational Linguistics. EACL 2006, Association for Computational Linguistics (2006)

    Google Scholar 

  15. Sánchez, D., Isern, D.: Automatic extraction of acronym definitions from the web. Appl. Intell. 34(2), 311–327 (2011)

    Article  Google Scholar 

  16. Nadeau, D., Turney, P.D.: A supervised learning approach to acronym identification. In: Kégl, B., Lee, H.-H. (eds.) Canadian AI 2005. LNCS (LNAI), vol. 3501, pp. 319–329. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  17. Zahariev, M.: Automatic sense disambiguation for acronyms. In: Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. SIGIR 2004, ACM (2004)

    Google Scholar 

  18. Choi, D., Kim, P.: Identifying the most appropriate expansion of acronyms used in wikipedia text. Softw. Pract. Experience 45(8), 1073–1086 (2015)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jayendra Barua .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Barua, J., Patel, D. (2016). Discovery, Enrichment and Disambiguation of Acronyms. In: Madria, S., Hara, T. (eds) Big Data Analytics and Knowledge Discovery. DaWaK 2016. Lecture Notes in Computer Science(), vol 9829. Springer, Cham. https://doi.org/10.1007/978-3-319-43946-4_23

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-43946-4_23

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-43945-7

  • Online ISBN: 978-3-319-43946-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics