Skip to main content

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 179))

Abstract

Processing of Arabic language is eminent for the fact that currently the number of computer and Internet users in the Arab word is growing tremendously. The problem of stemming is very important in information retrieval, knowledge mining and language processing. Arabic has very complex morphology and stemming rules that must deal with many specific properties of Arabic. This paper describes very simple rules for stemming of Arabic words. Two of these rules are universal, i.e. they are applicable to any word category, and one rule for each of the four categories: nouns, verbs, adverbs and adjectives. The rules were more successful in case of adverbs. As for nouns, verbs and adjectives, some errors occurred especially in case of suffix processing.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Encyclopaedia Britannica Online. Alphabet. http://www.britannica.com/EBchecked/topic/17212/alphabet (2011)

  2. Buckwalter, T.: In: Ide, N., Veronis, J., Soudi, A., Bosch, A.v.d., Neumann, G. (eds.) Arabic Computational Morphology, Text, Speech and Language Technology, vol. 38, pp. 23–41. Springer, The Netherlands (2007). http://dx.doi.org/10.1007/978-1-4020-6046-53

  3. Habash, N.Y.: Synthesis lectures on human language technologies. 3(1), 1 (2010). 10.2200/S00277ED1V01Y201008HLT010. http://www.morganclaypool.com/doi/abs/10.2200/S00277ED1V01Y201008HLT010

  4. Gillies, A., Erl, E., Trenkle, J., Schlosser, S.: In: Proceedings of the Symposium on Document Image Understanding Technology (1999)

    Google Scholar 

  5. Trenkle, J., Gilles, A., Eriandson, E., Schlosser, S., Cavin, S.: In: Symposium on Document Image Understanding Technology, pp. 159–168 (2001)

    Google Scholar 

  6. Maamouri, M., Bies, A., Kulick, S.: In: Proceedings of the British Computer Society Arabic NLP/MT Conference (2006)

    Google Scholar 

  7. Soori H, Platos J, Snášel V, Abdulla, H.: In: Snášel, V., Platos, J., El-Qawasmeh, E. (eds.) Digital Information Processing and Communications, Communications in Computer and Information Science, vol. 188, pp. 97–105. Springer, Berlin (2011). http://dx.doi.org/10.1007/978-3-642-22389-19

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hussein Soori .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Soori, H., Platoš, J., Snášel, V. (2013). Simple Stemming Rules for Arabic Language. In: Kudělka, M., Pokorný, J., Snášel, V., Abraham, A. (eds) Proceedings of the Third International Conference on Intelligent Human Computer Interaction (IHCI 2011), Prague, Czech Republic, August, 2011. Advances in Intelligent Systems and Computing, vol 179. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31603-6_9

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-31603-6_9

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-31602-9

  • Online ISBN: 978-3-642-31603-6

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics