Skip to main content

A Hybrid Warping Method Approach to Speaker Warping Adaptation

  • Conference paper
Fuzzy Logic and Applications (WILF 2005)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3849))

Included in the following conference series:

  • 843 Accesses

Abstract

The method of speaker normalization has been known as the successful method for improving the speech recognition at speaker independent speech recognition system. This paper propose a new power spectrum warping approach to making improvement of speaker normalization better than a frequency warping. The power spectrum warping uses Mel-frequency cepstral of Mel filter bank in MFCC. Also, this paper proposes the hybrid VTN combined the power spectrum warping and a frequency warping. Experiment of this paper did a comparative analysis about the recognition performance of the SKKU PBW DB applied each the power spectrum is 3.06%, and hybrid VTN is 4.07% word error rate reduction as word recognition performance of baseline system.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Lee, L., Rose, R.: A Frequency Warping Approach to Speaker Normalization. IEEE Transactions on Speech and Audio Processing 6(1) (January 1998)

    Google Scholar 

  2. Welling, L., Ney, H., Kanthak, S.: Speaker Adaptive Modeling by Vocal Tract Normalization. IEEE Transaction on Speech and Audio Processing 10(6) (September 2002)

    Google Scholar 

  3. Andreou, A., Kam, T., Cohen, J.: Experiments in Vocal Tract Normalization. In: Proc. CAIP Workshop: Frontiers in Speech Recognition II (1994)

    Google Scholar 

  4. Seltzer, M.: SPHINX III Signal Processing Front End Specification, CMU Speech Group (August 1999)

    Google Scholar 

  5. Linde, Y., Duzo, A., Gray, R.M.: An Algorithm for Vector Quantizer Design. IEEE Transaction on COM. 28 (January 1980)

    Google Scholar 

  6. Youn, J.S., Chung, K.W., Hong, K.S.: A Continuous Digit Speech Recognition Applied Vowel Sequence and VCCV Unit HMM. In: Proceeding of the Acoustical Society of Korea, vol. 20(2) (2001)

    Google Scholar 

  7. Rossing, T.D., Wheeler, P., Moore, F.R.: The Science of Sound. Addition Wesley. Addison Wesley, London (2002)

    Google Scholar 

  8. Roth, R., et al.: Dragon systems 1994 Large Vocabulary Continuous Speech Recognizer. In: Proc. Spoken Language Systems Technology Workshop (1995)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Roh, YW., Kim, JH., Kim, DJ., Hong, KS. (2006). A Hybrid Warping Method Approach to Speaker Warping Adaptation. In: Bloch, I., Petrosino, A., Tettamanzi, A.G.B. (eds) Fuzzy Logic and Applications. WILF 2005. Lecture Notes in Computer Science(), vol 3849. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11676935_18

Download citation

  • DOI: https://doi.org/10.1007/11676935_18

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-32529-1

  • Online ISBN: 978-3-540-32530-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics