Abstract
Speaker Adaptation is a technique which is used to improve the recognition accuracy of Automatic Speech Recognition (ASR) systems. Here, we report a study of the impact of online speaker adaptation on the performance of a speaker independent, continuous speech recognition system for Hindi language. The speaker adaptation is performed using the Maximum Likelihood Linear Regression (MLLR) transformation approach. The ASR system was trained using narrowband speech. The efficacy of the speaker adaptation is studied by using an unrelated speech database. The MLLR transform based speaker adaptation technique is found to significantly improve the accuracy of the Hindi ASR system by 3%.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Rabiner, L.R.: A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition. Proceedings of the IEEE 77, 257–286 (1989)
Legetter, C.J.: Improved acoustic modeling for HMMs using linear transformations. Ph.D. Thesis. University of Cambridge (1995)
Leggetter, C.J., Woodland, P.C.: Maximum likelihood linear regression for speaker adaptation of HMMs. Computer Speech & Language 9, 171–185 (1995)
Doh, S.-J.: Enhancements to Transformation-Based Speaker Adaptation: Principal Component and Inter-Class Maximum Likelihood Linear Regression, PhD. Thesis, Carnegie Mellon University (2000)
CMU sphinx – Speech Recognition Toolkit, http://www.cmusphinx.sourceforge.net
Chan, A., et al.: The Hieroglyphs: Building Speech Applications Using CMU Sphinx and Related Resources (2003)
Samudravijaya, K.: Hindi Speech Recognition. J. Acoustic Society of India 29(1), 385–393 (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Sivaraman, G., Mehta, S., Nabar, N., Samudravijaya, K. (2011). Higher Accuracy of Hindi Speech Recognition Due to Online Speaker Adaptation. In: Shah, K., Lakshmi Gorty, V.R., Phirke, A. (eds) Technology Systems and Management. Communications in Computer and Information Science, vol 145. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20209-4_33
Download citation
DOI: https://doi.org/10.1007/978-3-642-20209-4_33
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-20208-7
Online ISBN: 978-3-642-20209-4
eBook Packages: Computer ScienceComputer Science (R0)