Skip to main content

The State of the Art of Feature Extraction Techniques in Speech Recognition

  • Conference paper
  • First Online:
Speech and Language Processing for Human-Machine Communications

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 664))

Abstract

This paper surveys feature extraction techniques applied in automatic speech recognition. After so many researches and improvement, the accuracy is a key issue in speech recognition systems. Speech recognition process converts the speech signal into its corresponding written text by the computer system. In this paper, we brief few well-known techniques of feature extraction like LPC, MFCC, RASTA, PCA, LDA, PLP.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Bhabad, S.S., Kharate, G.K.: An overview of technical progress in speech recognition. Int. J. Adv. Res. Comput. Sci. Soft. Eng. 3(3) (2013)

    Google Scholar 

  2. Nehel, N.S., Holambe, R.S.: DWT and LPC based feature extraction methods for isolated word recognition. J. Audio Speech Music Process (2012)

    Google Scholar 

  3. Mishra A.N., Shrotriya, M.C., Sharan, S.N.: Comparative wavelet, PLP and LPC speech recognition techniques on the Hindi speech digits database. ICDIP Singapore (2010)

    Google Scholar 

  4. Zhang, G., Song Q., Fei, S.: Research on speech emotion. Comput. Technol. Prospect 19, 92–95 (2009) (in Chinese)

    Google Scholar 

  5. Wijoyo, T.S.: Speech recognition using linear predictive coding and artificial neural network for controlling movement of mobile robot. In: International Conference on Information and Electronics Engineering

    Google Scholar 

  6. Tiwari, V.: MFCC and its applications in speaker recognition. Int. J. Emerg. Technol. 1(1), 19–22(2010). National Center for Biotechnology Information, http://www.ncbi.nlm.nih.gov

  7. Anusuya, M.A., Katti, S.K.: Speech recognition by machine: a review. Int. J. Comput. Sci. Inf. Secur. (IJCSIS) 6(3), pp. 181–205 (2009). ACM: ACM Policy and Procedures on Plagiarism, http://www.acm.org/publications/policies/plagiarism_policy

  8. Yadav, S.K., Mukhedkar, M.M.: Review on speech recognition. Int. J. Sci. Eng. 1(2), 61–70 (2013)

    Google Scholar 

  9. Luengo, I., Navas, E.: Feature analysis and evaluation for automatic emotion identification in speech. IEEE Trans. Multimedia 12(6), 267–270 (2010)

    Google Scholar 

  10. Wiqas, G., Singh, N.: Literature review on automatic speech recognition. Int. J. Comput. Appl. 41(8) (2012) (0975 – 8887)

    Google Scholar 

  11. Dave, N.: Feature extraction methods LPC, PLP and MFCC in speech recognition. Int. J. Adv. Res. Eng. Technol. 1(VI) (2013)

    Google Scholar 

  12. Hinton, G., Deng, L., Yu, D., Dahl, G., Mohamed, A., Jaitly, N., Senior, A., Vanhoucke, V., Nguyen, P., Sainath, T., Kingsbury, B.: Deep neural networks for acoustic modeling in speech recognition. IEEE Sig. Process. Mag. 29(6), 82–97 (2012)

    Article  Google Scholar 

  13. Prabhakar, O.P., Sahu, K.N.: A survey on: voice command recognition technique. Int. J. Adv. Res. Comput. Sci. Softw. Eng. 3(5) (2013)

    Google Scholar 

  14. Muda, L.: Voice recognition algorithms using mel frequency cepstral coefficient (MFCC) and dynamic time warping (DTW) techniques. J. Comput. 2(3) (2010)

    Google Scholar 

  15. George, K.K., Arunraj, K., Sreekumar, K.T., Kumar, C.S., Ramachandran, K.I.: Towards improving the performance of text/language independent speaker recognition systems. In: International Conference on Power, Signals, Controls and Computation (EPSCICON), 8–10 January 2014

    Google Scholar 

  16. Hao, T., Chao-Hong, M., Lin-Shan, L.: An initial attempt for phoneme recognition using structured support vector machine (SVM). In: IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), 2010, pp. 4926–4929 (2010)

    Google Scholar 

  17. Gemmeke, J.F., Virtanen, T., Hurmalainen, A.: Exemplar-based sparse representations for noise robust automatic speech recognition. IEEE Trans. Audio Speech Lang. Process. 19(7), 2067–2080 (2011)

    Article  Google Scholar 

  18. Cheng, X., Duan, Q.: Speech emotion recognition using Gaussian mixture model. In: 2nd International Conference on Computer Application and System Modeling, pp. 1222–1225 (2012)

    Google Scholar 

  19. Nidhyananthan, S.S., Kumari, R.S.S.: Text independent voice based students attendance system under noisy environment using RASTA-MFCC feature. In: International Conference on Communication and Network Technologies (ICCNT) (2014)

    Google Scholar 

  20. Tan, T.S., Ariff, A.K., Ting, C.M., Salleh, S.H.: Application of Malay speech technology in Malay speech therapy assistance tools. In: Proceedings of IEEE Conference on Intelligent and Advanced Systems, pp. 330–334 (2007)

    Google Scholar 

  21. Venkateswarlu, R.L.K., Kumari, R.V.: Novel approach for speech recognition by using self organised maps. In: 2011 International Conference on Emerging Trends in Networks and Computer Communications (ETNCC), Udaipur, pp. 215–222 (2011)

    Google Scholar 

  22. Cutajar, M., Gatt, E., Grech, I., Casha, O., Micallef, J.: Comparative study of automatic speech recognition techniques. IET Sig. Process. 7(1), 25–46 (2013)

    Google Scholar 

  23. Ravikumar, K.M., Rajagopal, R., Nagaraj, H.C.: An approach for objective assessment of stuttered speech using MFCC features. ICGST Int. J. Digit. Sig. Process. 9, 19–24 (2009)

    Google Scholar 

  24. Seehapoch, T., Wongthanavasu, S.: Speech emotion recognition using support vector machines. In: 5th IEEE International Conference on Knowledge and Smart Technology (KST), pp. 86–91, Jan 2013

    Google Scholar 

  25. Abu Shariah, M.A.M., Ainon, R.N., Zainuddin, R., Khalifa, O.O.: Human computer interaction using isolated-words speech recognition technology. In: International Conference on Intelligent and Advanced Systems, ICIAS 2007, pp. 1173–1178 (2007)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Poonam Bansal .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Gupta, D., Bansal, P., Choudhary, K. (2018). The State of the Art of Feature Extraction Techniques in Speech Recognition. In: Agrawal, S., Devi, A., Wason, R., Bansal, P. (eds) Speech and Language Processing for Human-Machine Communications. Advances in Intelligent Systems and Computing, vol 664. Springer, Singapore. https://doi.org/10.1007/978-981-10-6626-9_22

Download citation

  • DOI: https://doi.org/10.1007/978-981-10-6626-9_22

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-10-6625-2

  • Online ISBN: 978-981-10-6626-9

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics