Skip to main content

A Compact Arabic Lexical Semantics Language Resource Based on the Theory of Semantic Fields

  • Conference paper
Advances in Natural Language Processing (GoTAL 2008)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5221))

Included in the following conference series:

Abstract

Applications of statistical Arabic NLP in general, and text mining in specific, along with the tools underneath perform much better as the statistical processing operates on deeper language factorizations than on raw text. Lexical semantic factorization is very important in this regard due to its feasibility, high level of abstraction, and the language independence of its output.

In the core of such a factorization lies an Arabic lexical semantic DB. While building this LR, we had to go beyond the conventional exclusive collection of words from dictionaries and thesauri that cannot alone produce a satisfactory coverage of this highly inflective and derivative language.

This paper is hence devoted to the design and implementation of an Arabic lexical semantics LR that enables the retrieval of the possible senses of any given Arabic word at a high coverage.

Instead of tying full Arabic words to their possible senses, our LR flexibly relates morphologically and PoS-tags constrained Arabic lexical compounds to a predefined limited set of semantic fields across which the standard semantic relations are defined. With the aid of the same large-scale Arabic morphological analyzer and PoS tagger in the runtime, the possible senses of virtually any given Arabic word are retrievable.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Attia, M.: Theory and Implementation of a Large-Scale Arabic Phonetic Transcriptor, and Applications. PhD thesis, Dept. of Electronics and Electrical Communications, Cairo University (2005), http://www.RDI-eg.com/RDI/Technologies/paper.htm

  2. Attia, M.: A Large-Scale Computational Processor of the Arabic Morphology, and Applications, M.Sc. thesis, Dept. of Computer Engineering, Faculty of Engineering, Cairo University (2000), http://www.RDI-eg.com/RDI/Technologies/paper.htm

  3. Attia, M., Rashwan, M.: A Large-Scale Arabic POS Tagger Based on a Compact Arabic POS Tags Set, and Application on the Statistical Inference of Syntactic Diacritics of Arabic Text Words, Proceedings of the Arabic Language Technologies and Resources Int’l Conference; NEMLAR, Cairo (2004), http://www.RDI-eg.com/RDI/Technologies/paper.htm

  4. Black, W., Elkateb, S., Rodriguez, H., Alkhalifa, M., Vossen, P., Fellbaum, C.: Introducing the Arabic Word Net Project (2006), http://NLPweb.kaist.ac.kr/gwc/pdf2006/74.pdf

  5. Diab, M.: The Feasibility of Bootstrapping an Arabic Word Net Leveraging Parallel Corpora and an English Word Net. In: Proceedings of the Arabic Language Technologies and Resources Int’l Conference; NEMLAR, Cairo (2004)

    Google Scholar 

  6. Dichy, J., Hassoun, M.: The DINAR.1 (DIctionnaire INformatisé de l’ARabe, version 1) Arabic Lexical Resource, an outline of contents and methodology. The ELRA news letter 10(2) (April-June 2005)

    Google Scholar 

  7. Ghonaimy, M.A.: A Tutorial Review on Word Nets. In: Proceedings of the 4th Conference on Language Engineering; CLE 2003, the Egyptian Society of Language Engineering (ESLE) (2003)

    Google Scholar 

  8. Hearst, M.: Untangling Text Data Mining. In: Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics (ACL), (1999), http://www.sims.Berkeley.edu/~hearst/papers/acl99/acl99-tdm.html

  9. Jurafsky, D., Martin, J.H.: Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Processing. Prentice-Hall, Englewood Cliffs (2000)

    Google Scholar 

  10. Lehrer, A.: Semantic Fields and Lexical Structures, Amsterdam-London (1974)

    Google Scholar 

  11. Riloff, E., Jones, R.: Learning Dictionaries for Information Extraction Using Multi-level Boot-strapping. In: Proceedings of AAAI (1999)

    Google Scholar 

  12. Schütze, H., Manning, C.D.: Foundations of Statistical Natural Language Processing. The MIT Press, Cambridge (2000)

    MATH  Google Scholar 

  13. Vossen, P.: Euro Word Net; General Document, Version 3, Final, University of Amsterdam (2002), http://www.hum.uva.nl/~ewn

  14. Yaseen, et al.: Building Annotated Written and Spoken Arabic LR’s in NEMLAR Project. In: LREC 2006 conference Genoa-Italy (May 2006), http://www.lrec-conf.org/lrec2006

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Attia, M., Rashwan, M., Ragheb, A., Al-Badrashiny, M., Al-Basoumy, H., Abdou, S. (2008). A Compact Arabic Lexical Semantics Language Resource Based on the Theory of Semantic Fields. In: Nordström, B., Ranta, A. (eds) Advances in Natural Language Processing. GoTAL 2008. Lecture Notes in Computer Science(), vol 5221. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85287-2_7

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-85287-2_7

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-85286-5

  • Online ISBN: 978-3-540-85287-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics