Abstract
The method of speaker normalization has been known as the successful method for improving the speech recognition at speaker independent speech recognition system. This paper propose a new power spectrum warping approach to making improvement of speaker normalization better than a frequency warping. The power spectrum warping uses Mel-frequency cepstral of Mel filter bank in MFCC. Also, this paper proposes the hybrid VTN combined the power spectrum warping and a frequency warping. Experiment of this paper did a comparative analysis about the recognition performance of the SKKU PBW DB applied each the power spectrum is 3.06%, and hybrid VTN is 4.07% word error rate reduction as word recognition performance of baseline system.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Lee, L., Rose, R.: A Frequency Warping Approach to Speaker Normalization. IEEE Transactions on Speech and Audio Processing 6(1) (January 1998)
Welling, L., Ney, H., Kanthak, S.: Speaker Adaptive Modeling by Vocal Tract Normalization. IEEE Transaction on Speech and Audio Processing 10(6) (September 2002)
Andreou, A., Kam, T., Cohen, J.: Experiments in Vocal Tract Normalization. In: Proc. CAIP Workshop: Frontiers in Speech Recognition II (1994)
Seltzer, M.: SPHINX III Signal Processing Front End Specification, CMU Speech Group (August 1999)
Linde, Y., Duzo, A., Gray, R.M.: An Algorithm for Vector Quantizer Design. IEEE Transaction on COM. 28 (January 1980)
Youn, J.S., Chung, K.W., Hong, K.S.: A Continuous Digit Speech Recognition Applied Vowel Sequence and VCCV Unit HMM. In: Proceeding of the Acoustical Society of Korea, vol. 20(2) (2001)
Rossing, T.D., Wheeler, P., Moore, F.R.: The Science of Sound. Addition Wesley. Addison Wesley, London (2002)
Roth, R., et al.: Dragon systems 1994 Large Vocabulary Continuous Speech Recognizer. In: Proc. Spoken Language Systems Technology Workshop (1995)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Roh, YW., Kim, JH., Kim, DJ., Hong, KS. (2006). A Hybrid Warping Method Approach to Speaker Warping Adaptation. In: Bloch, I., Petrosino, A., Tettamanzi, A.G.B. (eds) Fuzzy Logic and Applications. WILF 2005. Lecture Notes in Computer Science(), vol 3849. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11676935_18
Download citation
DOI: https://doi.org/10.1007/11676935_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-32529-1
Online ISBN: 978-3-540-32530-7
eBook Packages: Computer ScienceComputer Science (R0)