Skip to main content

Hidden Markov Model for Speech Recognition System—A Pilot Study and a Naive Approach for Speech-To-Text Model

  • Conference paper
  • First Online:
Speech and Language Processing for Human-Machine Communications

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 664))

Abstract

Today’s advancement in the research field has brought a new horizon to design the state-of-the-art systems that produce sound utterance. In order to attain a higher level of speech understanding potentiality, it is of utmost importance to achieve good efficiency. Speech-to-Text (STT) or voice recognition system is an efficacious approach that aims at recognizing speech and allows the conversion of the human voice into the text. By this, an interface between the human and the computer is created. In this direction, this paper introduces a novel approach to convert STT by using Hidden Markov Model (HMM). HMM along with other techniques such as Mel-Frequency Cepstral Coefficients (MFCCs), Decision trees, Support Vector Machine (SVM) is used to ascertain the speakers’ utterances and catalyse these utterances into quantization features by evaluating the likelihood extremity of the spoken word. The accuracy of the proposed architecture is studied, which is found to be better than the existing methodologies.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Alias, F., et al.: Towards high-quality next generation text-to-speech synthesis: a multi domain approach by automatic domain classification. IEEE Trans. Audio Speech Lang. Process. 16(7) (Sept 2008)

    Google Scholar 

  2. Abushariah, A.A.M., et al.: English digits speech recognition system based on Hidden Markov Models. In: IEEE Conference 2010, ICCCE. doi:10.1109/ICCCE.2010.5556819

  3. Hossan, M.A., et al.: A novel approach for MFCC feature extraction. In: IEEE Conference 2010, ICSPCS. doi:10.1109/ICSPCS.2010.5709752

  4. Bsyrne, W.: Minimum Bayes risk estimation and decoding in large vocabulary continuous speech recognition. IEEE E89-D(3), 900–907 (2006)

    Google Scholar 

  5. Patel, I., et al.: Speech recognition using hidden Markov model with MFCC-subband technique. In: IEEE Conference (2010). doi:10.1109/ITC.2010.45

  6. Duan, W., et al.: Weighted naive Bayesian classifier model based on information gain. In: IEEE Conference, (ISDEA). doi:10.1109/ISDEA.2010.226

  7. Gales, M., et al.: The application of hidden Markov models in speech recognition. Found. Trends Signal Process. 1(3), 195–304 (2008)

    Google Scholar 

  8. Swamy, S., et al.: An efficient speech recognition system. Comput. Sci. Eng. Int. J. (CSEIJ) 3(4) (Aug 2013)

    Google Scholar 

  9. Kholghi, M., et al.: Classification and evaluation of data mining techniques for data stream requirements. In: IEEE Conference on Computer Communication Control and Automation (3CA). doi:10.1109/3CA.2010.5533759

  10. Shahrokhi, N., et al.: Targeting customers with data mining techniques: classification. In: 2011 International Conference on User Science and Engineering (i-USEr), IEEE, New York. doi:10.1109/iUSEr.2011.6150567

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to S. Rashmi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Rashmi, S., Hanumanthappa, M., Reddy, M.V. (2018). Hidden Markov Model for Speech Recognition System—A Pilot Study and a Naive Approach for Speech-To-Text Model. In: Agrawal, S., Devi, A., Wason, R., Bansal, P. (eds) Speech and Language Processing for Human-Machine Communications. Advances in Intelligent Systems and Computing, vol 664. Springer, Singapore. https://doi.org/10.1007/978-981-10-6626-9_9

Download citation

  • DOI: https://doi.org/10.1007/978-981-10-6626-9_9

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-10-6625-2

  • Online ISBN: 978-981-10-6626-9

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics