HMM-Based Speaker Gender Recognition for Bodo Language

Chakraborty, Chandralika; Talukdar, Pran Hari

doi:10.1007/978-981-10-8911-4_15

Chandralika Chakraborty¹² &
Pran Hari Talukdar¹³

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 31))

650 Accesses

Abstract

Speech, the act of speaking, is the most natural way of exchanging information between homo sapiens. Speech primarily conveys the message via words, spoken by the speaker. Speech also conveys the emotion with which the speaker speaks, speaker’s health condition, gender of the speaker, and also the language in which the speaker is speaking. Systems which aim to recognize the speaker-related information in speech signals through an extraction and characterization process are called speaker recognition systems. Speaker recognition applications are becoming common and useful nowadays as many of the modern devices are designed and produced for the convenience of the general public. Speaker recognition systems are developed for many indigenous languages. Application of hidden Markov models (HMMs) to speaker recognition has seen considerable success and gained much popularity. This paper presents an attempt made toward developing a speaker gender recognition system. A model built using Hidden Markov Model Toolkit (HTK 3.4.1) has been trained and tested on sample speech of either gender in Bodo language, and results show good recognition.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Hardcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Fu Z, Zhao R (2003) An overview of modeling technology of speaker recognition. In: IEEE proceedings of the international conference on neural networks and signal processing, vol 2, pp 887–891, Dec 2003
Google Scholar
Young S et al (2009) The HTK book (for HTK version 3.4). Cambridge University Engineering Department
Google Scholar
Lee K-F, Hon HW, Reddy R (1990) An overview of the SPHINX speech recognition system. In: IEEE transactions on acoustic speech and signal processing, vol 38, no 1
Google Scholar
Admin. ‘sphinx-4 application programmer’s guide (2015). http://cmusphinx.sourceforge.net/sphinx4
Deka MK, Nath CK, Sarma SK, Talukdar PH (2011) An approach to noise robust speech recognition using LPC cepstral coefficient and MLP based artificial neural network with respect to Assamese and Bodo language. In: International symposium on devices MEMS, intelligent systems & communication (ISDMISC)
Google Scholar
Patel J, Patel P, Virparia P (2014) Voice enabled telephony commands using Gujarati speech recognition. Int J Adv Res Comput Sci Softw Eng 3(12)
Google Scholar
Mishra AN, Biswas A, Chandra M, Sharan SN (2011) Robust Hindi connected digits recognition. Int J Signal Process Image Process Pattern Recognit 4(2)
Google Scholar
Vimala C, Radha V (2012) Speaker independent isolated speech recognition system for Tamil language using HMM. Procedia Eng 30:1097–1102
Google Scholar
Boro MR (2008) The structure of Boro language. N.L. Publications, Panbazar, Guwahati
Google Scholar
Boro MR (2007) The historical development of Boro language. N.L. Publications, Panbazar, Guwahati
Google Scholar
Jurafsky D, Martin J (2014) Speech and language processing, 2nd edn. Pearson
Google Scholar
Stevens SS, Volkmann J, Newman EB (1937) A scale for the measurement of the psychological magnitude pitch. J Acoust Soc Am 8(3):185–190. Bibcode:1937ASAJ….8..185S. https://doi.org/10.1121/1.1915893

Download references

Author information

Authors and Affiliations

Department of Information Technology, Sikkim Manipal Institute of Technology, Sikkim Manipal University, Majitar, Sikkim, India
Chandralika Chakraborty
Kaziranga University, Koraikhuwa, Assam, India
Pran Hari Talukdar

Authors

Chandralika Chakraborty
View author publications
You can also search for this author in PubMed Google Scholar
Pran Hari Talukdar
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chandralika Chakraborty .

Editor information

Editors and Affiliations

Department of Information Technology, Sikkim Manipal University, Gangtok, Sikkim, India
Hiren Kumar Deva Sarma
Department of Information Technology, Sikkim Manipal University, Gangtok, Sikkim, India
Samarjeet Borah
Department of Computer Science and Engineering, MEF Group of Institutions, Rajkot, Gujarat, India
Nitul Dutta

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chakraborty, C., Talukdar, P.H. (2019). HMM-Based Speaker Gender Recognition for Bodo Language. In: Sarma, H., Borah, S., Dutta, N. (eds) Advances in Communication, Cloud, and Big Data. Lecture Notes in Networks and Systems, vol 31. Springer, Singapore. https://doi.org/10.1007/978-981-10-8911-4_15

Download citation

DOI: https://doi.org/10.1007/978-981-10-8911-4_15
Published: 16 June 2018
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-8910-7
Online ISBN: 978-981-10-8911-4
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics