Interactive Voice Application-Based Amazigh Speech Recognition

Hamidi, Mohamed; Satori, Hassan; Zealouk, Ouissam; Satori, Khalid

doi:10.1007/978-981-15-0947-6_26

Mohamed Hamidi¹⁷,
Hassan Satori¹⁷,
Ouissam Zealouk¹⁷ &
…
Khalid Satori¹⁷

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1076))

1446 Accesses
5 Citations

Abstract

This paper aims to build an interactive speaker-independent automatic Amazigh speech recognition system. The proposed system offers a methodology to extract data remotely from a distance database using the combined interactive voice response (IVR) and automatic speech recognition (ASR) technologies. We describe our experience to design an interactive speech system based on hidden Markov models (HMMs), Gaussian mixture models (GMMs) and Mel frequency spectral coefficients (MFCCs) based on ten first Amazigh digits and six Amazigh words. The best-obtained performance is 89.64% by using 3 HMMs and 16 GMMs.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Asterisk IVR. http://www.asterisk.org. Accessed Jan 2015
Shah, K., Ghrera, S.P., Thaker, A.: A novel approach for security issues in VoIP networks in virtualization with IVR. arXiv preprint arXiv:1206.1748 (2012)
Anwar, Z., Yurcik, W., Johnson, R.E., Hafiz, M., Campbell, R.H.: Multiple design patterns for voice over IP (VoIP) security. In: 25th IEEE International Performance, Computing, and Communications Conference, IPCCC 2006, pp. 8-pp (2006, April)
Google Scholar
Rafique, M.Z., Akbar, M.A., Farooq, M.: Evaluating DoS attacks against SIP-based VoIP systems. In: Global Telecommunications Conference, GLOBECOM 2009, pp. 1–6. IEEE (2009, November)
Google Scholar
Basu, J., Bepari, M.S., Roy, R., Khan, S.: Real time challenges to handle the telephonic speech recognition system. In: Proceedings of the Fourth International Conference on Signal and Image Processing 2012 (ICSIP 2012), pp. 395–408. Springer, India (2013)
Google Scholar
Aust, H., Oerder, M., Seide, F., Steinbiss, V.: The Philips automatic train timetable information system. Speech Commun. 17(3), 249–262 (1995)
Google Scholar
Bhat, C., Mithun, B.S., Saxena, V., Kulkarni, V., Kopparapu, S.: Deploying usable speech enabled IVR systems for mass use. International Conference on Human Computer Interactions (ICHCI), pp. 1–5 (2013)
Google Scholar
Satori, H., ElHaoussi, F.: Investigation Amazigh speech recognition using CMU tools. Int. J. Speech Technol. 17(3), 235–243 (2014)
Article Google Scholar
Hamidi, M., Satori, H., Satori, K.: Implementing a voice interface in VOIP network with IVR server using Amazigh digits. Int. J. Multi-disciplinary Sci. 2(2), 38–43 (2016)
Google Scholar
Madsen, L., Van Meggelen, J., Bryant, R.: Asterisk: The Definitive Guide. O’Reilly Media, Inc., pp. 121–145, 737–745, 417–478 (2011)
Google Scholar
Penton, J., Terzoli, A.: Asterisk: A converged TDM and packet-based communications system. In: Proceedings of SATNAC 2003-Next Generation Networks (2003)
Google Scholar
Handley, M., Schulzrinne, H., Schooler, E., et al.: RFC 2543. SIP: Session Initiation Protocol (1999)
Google Scholar
Oracle VM VirtualBox. https://www.virtualbox.org/. Accessed Jan 2015
Huang, X., Acero, A., Hon, H.W., Foreword By-Reddy, R.: Spoken language processing: a guide to theory, algorithm, and system development. Prentice Hall PTR (2001)
Google Scholar
Satori, H., Zealouk, O., Satori, K., ElHaoussi, F.: Voice comparison between smokers and non-smokers using HMM speech recognition system. Int. J. Speech Technol. 20(4), 771–777 (2017)
Article Google Scholar
Zealouk, O., Satori, H., Hamidi, M., Satori, K.: Speech recognition for Moroccan dialects: feature extraction and classification methods. J. Adv. Res. Dyn. Control Syst. 11(2), 1401–1408 (2019)
Google Scholar
Hamidi, M., Satori, H., Zealouk, O., Satori, K.: Speech coding effect on Amazigh alphabet speech recognition performance. J. Adv. Res. Dyn. Control Syst. 11(2), 1392–1400 (2019)
Google Scholar
Zealouk, O., Satori, H., Hamidi, M., Laaidi, N., Satori, K.: Vocal parameters analysis of smoker using Amazigh language. Int. J. Speech Technol. 21(1), 85–91 (2018)
Article Google Scholar
Zealouk, O., Satori, H., Hamidi, M., Satori, K.: Voice pathology assessment based on automatic speech recognition using Amazigh digits. In: Proceedings of the 2nd International Conference on Smart Digital Environment, pp. 100–105. ACM (2018)
Google Scholar
Mohamed, H., Hassan, S., Ouissam, Z., Khalid, S., Naouar, L.: Interactive voice response server voice network administration using hidden Markov model speech recognition system. In: 2018 Second World Conference on Smart Trends in Systems, Security and Sustainability (WorldS4- IEEE), pp. 16–21 (2018, October)
Google Scholar
Boukous, A.: Société, langues et cultures au Maroc: Enjeux symboliques (No. 8). Faculté des lettres et des sciences humans-Rabat (1995)
Google Scholar
Beal, M.J., Ghahramani, Z., Rasmussen, C.E.: The infinite hidden Markov model. In: Advances in Neural Information Processing Systems, pp. 577–584 (2002)
Google Scholar
Shanmugham, S., Burnett, D.: Media Resource Control Protocol Version 2 (MRCPv2) (2012)
Google Scholar

Download references

Author information

Authors and Affiliations

LIIAN Laboratory, FSDM, USMBA, Fez, Morocco
Mohamed Hamidi, Hassan Satori, Ouissam Zealouk & Khalid Satori

Authors

Mohamed Hamidi
View author publications
You can also search for this author in PubMed Google Scholar
Hassan Satori
View author publications
You can also search for this author in PubMed Google Scholar
Ouissam Zealouk
View author publications
You can also search for this author in PubMed Google Scholar
Khalid Satori
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mohamed Hamidi .

Editor information

Editors and Affiliations

Department of Electronics and Communication Engineering, Shri Ramswaroop Memorial Group of Professional Colleges (SRMGPC), Lucknow, Uttar Pradesh, India
Vikrant Bhateja
School of Computer Engineering, Kalinga Institute of Industrial Technology (KIIT), Bhubaneswar, Odisha, India
Suresh Chandra Satapathy
Department of Computer Sciences, Faculty of Sciences Dhar Mahraz, Sidi Mohammed Ben Abbdallah University, Fez, Morocco
Hassan Satori

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hamidi, M., Satori, H., Zealouk, O., Satori, K. (2020). Interactive Voice Application-Based Amazigh Speech Recognition. In: Bhateja, V., Satapathy, S., Satori, H. (eds) Embedded Systems and Artificial Intelligence. Advances in Intelligent Systems and Computing, vol 1076. Springer, Singapore. https://doi.org/10.1007/978-981-15-0947-6_26

Download citation

DOI: https://doi.org/10.1007/978-981-15-0947-6_26
Published: 08 April 2020
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-0946-9
Online ISBN: 978-981-15-0947-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics