Skip to main content

An Application of Pattern Matching Stemmer in Arabic Dialogue System

  • Conference paper
Agent and Multi-Agent Systems: Technologies and Applications (KES-AMSTA 2011)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6682))

Abstract

This paper proposes the building of a stemmer for the Arabic language. This stemmer is largely based on pattern matching and pattern strength techniques. Stemmers are algorithms to extract root from a word by removing its affixes. Stemming has been applied for large number of applications, such as: indexing, information retrieval systems, and web search engines. This paper will also proposes the application of stemming as a pre-processing stage in a dialogue system (DS). The proposed stemmer was compared with three other well known stemmers and achieved favourable accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Hayder, K., Al Ameed, S.O.A.K., Amna, A., Kaabi, A., Khadija, S., Shebli, A., F, N.H.A.N.N., Shamsi, A., Shaikha, S., Muhairi, A.: Arabic Light Stemmer: A New En-Hanced Approach

    Google Scholar 

  2. El-Khoribi, R., Ismael, M.: An intelligent system based on statistical learning for searching in arabic text. ICGST International Journal on Artificial Intelligence and Machine Learning (2006)

    Google Scholar 

  3. Eiman Tamah, A.-S., Jessica, L.: Towards an error-free Arabic stemming. In: Proceeding of the 2nd ACM Workshop on Improving Non English Web Searching. ACM, New York (2008)

    Google Scholar 

  4. O’Shea, K., Bandar, Z., Crockett, K.: A Novel Approach for Constructing Conversational Agents using Sentence Similarity Measures (2008)

    Google Scholar 

  5. Abu Shawar, B., Atwell, E.: Chatbots: Are they Really Useful? (2005)

    Google Scholar 

  6. Al-Kharashi, A.I., Evens, M.: Comparing words, stems, and roots as index terms in an Arabic Information Retrieval System. J. Am. Soc. Inf. Sci. 45(8), 548–560 (1994)

    Article  Google Scholar 

  7. Sawalha, M., Atwell, E.: Comparative evaluation of arabic language morphological analysers and stemmers. In: Proceedings of COLING 2008 22nd International Conference on Comptational Linguistics (2008)

    Google Scholar 

  8. Sawalha, M., Atwell, E.: توظيف قواعد النحو والصرف في بناء محلل صرفي للغة العربية. An application of grammar in building morphological analyzer for Arabic

    Google Scholar 

  9. Diab, M., Hacioglu, K., Jurafsky, D.: Automatic tagging of Arabic text: from raw text to base phrase chunks. In: Proceedings of HLT-NAACL 2004: Short Papers. Association for Computational Linguistics, Boston (2004)

    Google Scholar 

  10. Al-Saidat, E., Al-Momani, I.: Future markers in modern standard arabic and jorda-nian arabic: A contrastive study. European Journal of Social Sciences 12 (2010)

    Google Scholar 

  11. Khoja, S.: Stemming Arabic Text (1999), http://zeus.cs.pacificu.edu/shereen/research.htm

  12. Buckwalter, T.: official web site, http://www.qamus.org

  13. Al-Shalabi, R., Kanaan, G., Al-Serhan, H.: New approach for extracting Arabic roots. In: International Arab Conference on Information Technology (ACIT 2003), Egypt (2003)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Hijjawi, M., Bandar, Z., Crockett, K., Mclean, D. (2011). An Application of Pattern Matching Stemmer in Arabic Dialogue System. In: O’Shea, J., Nguyen, N.T., Crockett, K., Howlett, R.J., Jain, L.C. (eds) Agent and Multi-Agent Systems: Technologies and Applications. KES-AMSTA 2011. Lecture Notes in Computer Science(), vol 6682. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22000-5_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-22000-5_5

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-21999-3

  • Online ISBN: 978-3-642-22000-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics