Skip to main content

A Novel Approach to Identify Factor Posing Pronunciation Disorders

  • Conference paper
  • First Online:
Computational Collective Intelligence (ICCCI 2016)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9875))

Included in the following conference series:

Abstract

Literature seems rich with approaches which are based on the features contained in the speech signal and natural language processing techniques to detect vocal pathologies in human speeches. From the literature, we can mention also that several factors (vocal pathology, non-native speaker, psychological state, age …) can pose pronunciation disorders [10]. But to our knowledge, no work has treated pathological speech to identify factor posing pronunciation disorders. The current work consists in introducing an original approach based on the forced alignment score [8] to identify the factor posing mispronunciations contained in the Arabic speech. We distinguish two main factors: the pronunciation disorders can be from native speakers with vocal pathology or from non-native speakers who do not master Arabic-phoneme pronunciation. The results are encouraging; we attain an identification rate of 95 %. Biologists and computer scientists can benefit from our proposed approach to design high performance systems of vocal pathology diagnostic.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Terbeh, N., Maraoui, M., Zrigui, M.: Probabilistic approach for detection of vocal pathologies in the Arabic speech. In: Gelbukh, A. (ed.). LNCS, vol. 9042, pp. 606–616. Springer, Heidelberg (2015)

    Google Scholar 

  2. Alghamdi, M., Almuhtasib, H., Elshafei, M.: Arabic phonological rules. King Saud Univ. J. Comput. Sci. Inf. 16, 1–25 (2004)

    Google Scholar 

  3. Terbeh, N., Labidi, M., Zrigui, M.: Automatic speech correction: a step to speech recognition for people with disabilities. In: ICTA 2013, Hammamet-Tunisia, 23–26 October 2013 (2013)

    Google Scholar 

  4. Terbeh, N., Zrigui, M.: Vers la Correction Automatique de la Parole Arabe. In: Citala 2014, Oujda-Morocco, 26–27 November 2014 (2014)

    Google Scholar 

  5. Patane, G., Russo, M.: The enhanced LBG algorithm. Neural Netw. 14(9), 1219–1237 (2001)

    Article  Google Scholar 

  6. Bréhilin, L., Gascuel, O.: Modèles de Markov caches et apprentissage de sequences

    Google Scholar 

  7. Majidnezhad, V., Kheidorov, I.: An ANN-based method for detecting vocal fold pathology. Int. J. Comput. Appl. 62(7), 1–4 (2013)

    Google Scholar 

  8. Jurafsky, D., Ward, W., Zhang, B., Herold, K., Yu, X., Zhang, S.: What kind of pronunciation variation is hard for triphones to model? In: ICASSP 2001, Salt Lake City, UT, 7–11 May 2001

    Google Scholar 

  9. Majidnezhad, V., Kheidorov, I.: A HMM-based method for vocal fold pathology diagnosis. IJCSI Int. J. Comput. Sci. Issues 9(6), 135–138 (2012). No. 2

    Google Scholar 

  10. Kim, J., Kumar, N., Tsiartas, A., Li, M., Narayanan, S.: Intelligibility classification of pathological speech using fusion of multiple subsystems. In: Proceedings of Interspeech, Portland, Oregon, USA, pp. 534–537 (2012)

    Google Scholar 

  11. Paquet, P.: L’utilisation des réseaux de neurones artificiels en finance. Document de recherche n° 1997-1 (1997)

    Google Scholar 

  12. Archaux, C., Laanaya, H., Martin, A., Khenchaf, A.: An SVM based churn detector in prepaid mobile telephony (2004)

    Google Scholar 

  13. Kukharchik, P., Martynov, D., Kheidorov, I., Kotov, O.: Vocal fold pathology detection using modified wavelet-like features and support vector machines. In: 15th European Signal Processing Conference (EUSIPCO 2007), Poznan, Poland, 3–7 September 2007

    Google Scholar 

  14. Damerval, C.: Ondelettes pour la détection de caractéristiques en traitement d’images. Doctoral thesis, Mai 2008

    Google Scholar 

  15. Plante, F., Christian, B.-V.: Détection acoustique des pathologies phonatoires chez l’enfant. Doctoral thesis (1993)

    Google Scholar 

  16. Terbeh, N., Zrigui, M.: Vocal pathologies detection and mispronounced phonemes identification: case of Arabic continuous speech. In: LREC 2016, Portorož-Slovenia, 23–28 May 2016 (2016)

    Google Scholar 

  17. http://www.un.org/french/disabilities/default.asp?navid=35&pid=833, [consulted 6 April 2016]

  18. http://kenanaonline.com/users/dkkhaledelnagar/photos/1238136361, [consulted 24 April 2016]

  19. Blanc-Brude, T.: Intégration de commandes vocales dans un environnement d’apprentissage par l’action: enjeux ergonomiques. Doctoral dissertation, Grenoble 1 (2004)

    Google Scholar 

  20. Biadsy, F., Hirschberg, J., Habash, N.: Spoken Arabic dialect identification using phonotactic modeling. In: Proceedings of the EACL 2009 Workshop on Computational Approaches to Semitic Languages, pp. 53–61. Association for Computational Linguistics (2009)

    Google Scholar 

Download references

Acknowledgments

We would like to benefit from this opportunity to express my deepest regards to all members of the evaluation research committee in the ICCCI scientific conference. We would like also to extend our advance thanks to Mr. Mounir ZRIGUI for his valuable advices and encouragement.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Naim Terbeh .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Terbeh, N., Zrigui, M. (2016). A Novel Approach to Identify Factor Posing Pronunciation Disorders. In: Nguyen, NT., Iliadis, L., Manolopoulos, Y., Trawiński, B. (eds) Computational Collective Intelligence. ICCCI 2016. Lecture Notes in Computer Science(), vol 9875. Springer, Cham. https://doi.org/10.1007/978-3-319-45243-2_14

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-45243-2_14

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-45242-5

  • Online ISBN: 978-3-319-45243-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics