Hardware Implementation of MFCC Feature Extraction for Speech Recognition on FPGA

Dao, Van-Lan; Nguyen, Van-Danh; Nguyen, Hai-Duong; Hoang, Van-Phuc

doi:10.1007/978-3-319-49073-1_27

Van-Lan Dao¹⁹,
Van-Danh Nguyen¹⁹,
Hai-Duong Nguyen¹⁹ &
…
Van-Phuc Hoang¹⁹

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 538))

Included in the following conference series:

International Conference on Advances in Information and Communication Technology

1280 Accesses
5 Citations

Abstract

In this paper, an FPGA-based Mel Frequency Cepstral Coefficient (MFCC) IP core for speech recognition is presented. The implementation results on FPGA show that the proposed MFCC core achieves higher resource usage efficiency compared with other designs.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Boves, L., den Os, E.: Speaker recognition in telecom applications. In: Proceedings of the Interactive Voice Technology for Telecommunications Applications, Torino, pp. 203–208 (1998)
Google Scholar
McLoughlin, I.V., Sharifzadeh, H.R.: Speech recognition engine adaptions for smart home dialogues. In: Proceedings of the International Conference on Information, Communications and Signal Processing, Singapore, pp. 1–5 (2007)
Google Scholar
Marchetto, E., Avanzini, F., Flego, F.: An automatic speaker recognition system for intelligence applications. In: Proceedings of the European Signal Processing, Glasgow, pp. 1612–1616 (2009)
Google Scholar
Selvan, K., Joseph, A., Anish Babu, K.K.: Speaker recognition system for security applications. In: IEEE Recent Advances in Intelligent Computational Systems (RAICS), pp. 26–30 (2013)
Google Scholar
Ajgou, R., Sbaa, S., Ghendir, S., Chamsa, A., Taleb-Ahmed, A.: Robust remote speaker recognition system based on AR-MFCC features and efficient speech activity detection algorithm. In: International Symposium on Wireless Communications Systems (ISWCS), Barcelona, pp. 722–727 (2014)
Google Scholar
Malode, A.A., Sahare, S.L.: An improved speaker recognition by using VQ and HMM. In: Proceedings of the International on Sustainable Energy and Intelligent Systems (SEISCON 2012), Tiruchengode, pp. 1–7 (2012)
Google Scholar
Lung, V.D., Truong, V.N.: Vietnamese speech recognition using dynamic time warping and coefficient of correlation. In: Proceedings of the International Conference on Control, Automation and Information Sciences (ICCAIS), Nha Trang, pp. 64–67 (2013)
Google Scholar
Tuzun, O.B., Demirekler, M., Nakiboglu, K.B.: Comparison of parametric and non-parametric representations of speech for recognition. In: Proceedings, pp. 65–68 (1994)
Google Scholar
Openshaw, J.P., Sun, Z.P., Mason, J.S.: A comparison of composite features under degraded speech in speaker recognition. In: Proceedings on Acoustics, Speech, and Signal Processing, vol. 2, Minneapolis, USA, pp. 371–374 (1993)
Google Scholar
Vergin, R., O’Shaughnessy, D., Gupta, V.: Compensated mel frequency cepstrum coefficients. In: Proceedings on Acoustics, Speech, and Signal Processing, Minneapolis, USA, pp. 323–326 (1996)
Google Scholar
Ibrahim, N.J., et al.: Quranic verse recitation feature extraction using Mel-frequency cepstral coefficients (MFCC). In: Proceedings of the International Colloquium on Signal Processing and Its Applications (CSPA), Kuala Lumpur, Malaysia (2008)
Google Scholar
Price, J., Sophomore Student: Design an automatic speech recognition system using maltab. University of Maryland Estern Shore Princess Anne
Google Scholar
Wang, J.-C., Wang, J.-F., Weng, Y.-S.: Chip design of mel frequency cepstral coefficients for speech recognition. In: Proceedings of the Advanced IEEE International Conference on Acoustics, Speech, and Signal Processing, Istanbul, vol. 6, pp. 3658–3661 (2000)
Google Scholar
Wassi, G., Iloga, S., Romain, O., Granado, B.: FPGA-based real-time MFCC extraction for automatic audio indexing on FM broadcast data. In: Proceedings on Design and Architectures for Signal and Image Processing (DASIP), Krakow, pp. 1–6 (2015)
Google Scholar
Bahoura, M., Ezzaidi, H.: Hardware implementation of MFCC feature extraction for respiratory sounds analysis. In: Proceedings of the International Workshop on Systems, Signal Processing and their Applications (WoSSPA), Algiers, pp. 226–229 (2013)
Google Scholar
Ehkan, P., Zakaria, F.F., Warip, M.N.M., Sauli, Z., Elshaikh, M.: Hardware implementation of MFCC-based feature extraction for speaker recognition. In: Sulaiman, H.A., Othman, M.A., Othman, M.F.I., Rahim, Y.A., Pee, N.C. (eds.) Advanced Computer and Communication Engineering Technology. LNEE, vol. 315, pp. 471–480. Springer, Heidelberg (2015). doi:10.1007/978-3-319-07674-4_46
Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Radio-Electronics, Le Quy Don Technical University, Hanoi, Vietnam
Van-Lan Dao, Van-Danh Nguyen, Hai-Duong Nguyen & Van-Phuc Hoang

Authors

Van-Lan Dao
View author publications
You can also search for this author in PubMed Google Scholar
Van-Danh Nguyen
View author publications
You can also search for this author in PubMed Google Scholar
Hai-Duong Nguyen
View author publications
You can also search for this author in PubMed Google Scholar
Van-Phuc Hoang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Van-Lan Dao .

Editor information

Editors and Affiliations

School of Information Science、Area of Human Life Design , Japan Advanced Institute of Science and Technology, Nomi-shi, Ishikawa, Japan
Masato Akagi
Department of Computer Science, VNU University of Engineering and Technology, Hanoi, Vietnam
Thanh-Thuy Nguyen
Faculty of Information Technology, Thai Nguyen University of Information and Communication Technology, Thai Nguyen, Vietnam
Duc-Thai Vu
Thai Nguyen University of Information and Communication Technology, Thai Nguyen, Vietnam
Trung-Nghia Phung
School of Knowledge Science、Area of Knowledge Management , Japan Advanced Institute of Science and Technology, Nomi-shi, Ishikawa, Japan
Van-Nam Huynh

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Dao, VL., Nguyen, VD., Nguyen, HD., Hoang, VP. (2017). Hardware Implementation of MFCC Feature Extraction for Speech Recognition on FPGA. In: Akagi, M., Nguyen, TT., Vu, DT., Phung, TN., Huynh, VN. (eds) Advances in Information and Communication Technology. ICTA 2016. Advances in Intelligent Systems and Computing, vol 538. Springer, Cham. https://doi.org/10.1007/978-3-319-49073-1_27

Download citation

DOI: https://doi.org/10.1007/978-3-319-49073-1_27
Published: 12 November 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-49072-4
Online ISBN: 978-3-319-49073-1
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics