Skip to main content

Hardware Implementation of MFCC Feature Extraction for Speech Recognition on FPGA

  • Conference paper
  • First Online:
Advances in Information and Communication Technology (ICTA 2016)

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 538))

Abstract

In this paper, an FPGA-based Mel Frequency Cepstral Coefficient (MFCC) IP core for speech recognition is presented. The implementation results on FPGA show that the proposed MFCC core achieves higher resource usage efficiency compared with other designs.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Boves, L., den Os, E.: Speaker recognition in telecom applications. In: Proceedings of the Interactive Voice Technology for Telecommunications Applications, Torino, pp. 203–208 (1998)

    Google Scholar 

  2. McLoughlin, I.V., Sharifzadeh, H.R.: Speech recognition engine adaptions for smart home dialogues. In: Proceedings of the International Conference on Information, Communications and Signal Processing, Singapore, pp. 1–5 (2007)

    Google Scholar 

  3. Marchetto, E., Avanzini, F., Flego, F.: An automatic speaker recognition system for intelligence applications. In: Proceedings of the European Signal Processing, Glasgow, pp. 1612–1616 (2009)

    Google Scholar 

  4. Selvan, K., Joseph, A., Anish Babu, K.K.: Speaker recognition system for security applications. In: IEEE Recent Advances in Intelligent Computational Systems (RAICS), pp. 26–30 (2013)

    Google Scholar 

  5. Ajgou, R., Sbaa, S., Ghendir, S., Chamsa, A., Taleb-Ahmed, A.: Robust remote speaker recognition system based on AR-MFCC features and efficient speech activity detection algorithm. In: International Symposium on Wireless Communications Systems (ISWCS), Barcelona, pp. 722–727 (2014)

    Google Scholar 

  6. Malode, A.A., Sahare, S.L.: An improved speaker recognition by using VQ and HMM. In: Proceedings of the International on Sustainable Energy and Intelligent Systems (SEISCON 2012), Tiruchengode, pp. 1–7 (2012)

    Google Scholar 

  7. Lung, V.D., Truong, V.N.: Vietnamese speech recognition using dynamic time warping and coefficient of correlation. In: Proceedings of the International Conference on Control, Automation and Information Sciences (ICCAIS), Nha Trang, pp. 64–67 (2013)

    Google Scholar 

  8. Tuzun, O.B., Demirekler, M., Nakiboglu, K.B.: Comparison of parametric and non-parametric representations of speech for recognition. In: Proceedings, pp. 65–68 (1994)

    Google Scholar 

  9. Openshaw, J.P., Sun, Z.P., Mason, J.S.: A comparison of composite features under degraded speech in speaker recognition. In: Proceedings on Acoustics, Speech, and Signal Processing, vol. 2, Minneapolis, USA, pp. 371–374 (1993)

    Google Scholar 

  10. Vergin, R., O’Shaughnessy, D., Gupta, V.: Compensated mel frequency cepstrum coefficients. In: Proceedings on Acoustics, Speech, and Signal Processing, Minneapolis, USA, pp. 323–326 (1996)

    Google Scholar 

  11. Ibrahim, N.J., et al.: Quranic verse recitation feature extraction using Mel-frequency cepstral coefficients (MFCC). In: Proceedings of the International Colloquium on Signal Processing and Its Applications (CSPA), Kuala Lumpur, Malaysia (2008)

    Google Scholar 

  12. Price, J., Sophomore Student: Design an automatic speech recognition system using maltab. University of Maryland Estern Shore Princess Anne

    Google Scholar 

  13. Wang, J.-C., Wang, J.-F., Weng, Y.-S.: Chip design of mel frequency cepstral coefficients for speech recognition. In: Proceedings of the Advanced IEEE International Conference on Acoustics, Speech, and Signal Processing, Istanbul, vol. 6, pp. 3658–3661 (2000)

    Google Scholar 

  14. Wassi, G., Iloga, S., Romain, O., Granado, B.: FPGA-based real-time MFCC extraction for automatic audio indexing on FM broadcast data. In: Proceedings on Design and Architectures for Signal and Image Processing (DASIP), Krakow, pp. 1–6 (2015)

    Google Scholar 

  15. Bahoura, M., Ezzaidi, H.: Hardware implementation of MFCC feature extraction for respiratory sounds analysis. In: Proceedings of the International Workshop on Systems, Signal Processing and their Applications (WoSSPA), Algiers, pp. 226–229 (2013)

    Google Scholar 

  16. Ehkan, P., Zakaria, F.F., Warip, M.N.M., Sauli, Z., Elshaikh, M.: Hardware implementation of MFCC-based feature extraction for speaker recognition. In: Sulaiman, H.A., Othman, M.A., Othman, M.F.I., Rahim, Y.A., Pee, N.C. (eds.) Advanced Computer and Communication Engineering Technology. LNEE, vol. 315, pp. 471–480. Springer, Heidelberg (2015). doi:10.1007/978-3-319-07674-4_46

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Van-Lan Dao .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Dao, VL., Nguyen, VD., Nguyen, HD., Hoang, VP. (2017). Hardware Implementation of MFCC Feature Extraction for Speech Recognition on FPGA. In: Akagi, M., Nguyen, TT., Vu, DT., Phung, TN., Huynh, VN. (eds) Advances in Information and Communication Technology. ICTA 2016. Advances in Intelligent Systems and Computing, vol 538. Springer, Cham. https://doi.org/10.1007/978-3-319-49073-1_27

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-49073-1_27

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-49072-4

  • Online ISBN: 978-3-319-49073-1

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics