Advertisement

Towards the Refinement of the Arabic Soundex

  • Nedjma Djouhra Ousidhoum
  • Nacéra Bensaou
Part of the Lecture Notes in Computer Science book series (LNCS, volume 7934)

Abstract

In this paper, we present phonetic encoding functions that play the role of hash functions in the indexation of an Arabic dictionary. They allow us to answer approximate queries that, given a query word, ask for all the words that are phonetically similar to it. They consider the phonetic features of the standard Arabic language and involve some possible phonetic alterations induced by specific habits in the pronunciation of Arabic.

We propose two functions, the first one is called the ”Algerian Dialect Refinement” and it takes into account phonetic confusions usually known to the Algerian people while speaking Arabic; and the second one is named the ”Speech Therapy Refinement” and it examines some mispronunciations common to children.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
  2. 2.
    National Archives: The Soundex Indexing System, http://www.archives.gov/research/census/soundex.html
  3. 3.
    Aqeel, S.U., et al.: On the Development of Name Search Techniques for Arabic. J. Am. Soc. Inf. Sci. Technol. 57(6), 728–739 (2006)CrossRefGoogle Scholar
  4. 4.
    Ben Hamadou, A.: Vérification et correction automatiques par analyse affixale des textes écrits en langage naturel: le cas de l’arabe non voyellé. PhD thesis, University of Sciences, Technology and Medicine of Tunis (2003)Google Scholar
  5. 5.
    Al Husseiny, A.: Dirassat Qur’aniya-2- Ahkam At-Tajweed Bee Riwayet Arsh An Nafia An Tariq Al’azraq. Maktabat Arradwan (2005)Google Scholar
  6. 6.
    Hall, P.A.V., Dowling, G.R.: Approximate String Matching. Computing Surveys 12(4) (1980)Google Scholar
  7. 7.
    Lait, A., Randell, B.: An Assessment of Name Matching Algorithms. Technical Report, University of Newcastle upon Tyne (1993)Google Scholar
  8. 8.
    Navarro, G.: A Guided Tour to Approximate String Matching. ACM Comput. Surv. 33(1), 31–88 (2001), doi:10.1145/375360.375365CrossRefGoogle Scholar
  9. 9.
    Navarro, G., Baeza-Yates, R.: Very Fast and Simple Approximate String Matching. Information Processing Letters (1999)Google Scholar
  10. 10.
    Ousidhoum, N.D., Bensalah, A., Bensaou, N.: A New Classical Arabic Soundex algorithm. In: Proceedings of the Second Conference on Advances in Communication and Information Technologies (2012), http://doi.searchdl.org/03.CSS.2012.3.28
  11. 11.
    Philips, L.: Hanging on the Metaphone. Computer Language 7(12) (December 1990)Google Scholar
  12. 12.
    Philips, L.: The Double Metaphone Search Algorithm. Dr Dobb’s (2003)Google Scholar
  13. 13.
    Precision Indexing Staff: The Daitch-Mokoto Soundex Reference Guide. Heritage Quest (1994)Google Scholar
  14. 14.
    Rytting, C.A., et al.: Error Correction for Arabic Dictionary Lookup. In: Proceedings of the Seventh International Conference on Language Resources and Evaluation, LREC 2010 (2010)Google Scholar
  15. 15.
    Shaalan, K., Allam, A., Gomah, A.: Towards Automatic Spell Checking for Arabic. In: Proceedings of the Fourth Conference on Language Engineering, Egyptian Society of Language Engineering, ELSE (2003)Google Scholar
  16. 16.
    Shaalan, K., et al.: Arabic Word Generation and Modelling for Spell Checking. In: Proceedings of the Eight International Conference on Language Resources and Evaluation, LREC 2012 (2012)Google Scholar
  17. 17.
    Shaalan, K., Aref, R., Fahmy, A.: An Approach for Analyzing and Correcting Spelling Errors for Non-native Arabic learners. In: Proceedings of the 7th International Conference on Informatics and Systems, INFOS 2010. Cairo University (2010)Google Scholar
  18. 18.
    Taft, R.L.: Name Searching Techniques. Technical Report, New York State Identification and Intelligence System, Albany, N.Y. (1970)Google Scholar
  19. 19.
    Yahia, M.E., Saeed, M.E., Salih, A.M.: An Intelligent Algorithm For Arabic Soundex Function Using Intuitionistic Fuzzy Logic. In: International IEEE Conference on Intelligent Systems, IS (2006)Google Scholar
  20. 20.
    Watson, J.C.E.: The Phonology and Morphology of Arabic. OUP Oxford (2007)Google Scholar
  21. 21.
    Ben Othmane Zribi, C., Ben Ahmed, M.: Efficient Automatic Correction of Misspelled Arabic Words Based on Contextual Information. In: Palade, V., Howlett, R.J., Jain, L. (eds.) KES 2003. LNCS, vol. 2773, pp. 770–777. Springer, Heidelberg (2003)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2013

Authors and Affiliations

  • Nedjma Djouhra Ousidhoum
    • 1
  • Nacéra Bensaou
    • 1
  1. 1.University of Sciences and technology Houari BoumedienneAlgiersAlgeria

Personalised recommendations