Skip to main content

FLC-Regulated Speaker Adaptation Mechanisms for Speech Recognition

  • Conference paper
Computational Collective Intelligence. Technologies and Applications (ICCCI 2010)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6422))

Included in the following conference series:

  • 1037 Accesses

Abstract

The exploitation of fuzzy logic control (FLC) mechanism in the fields of speaker adaptation (SA) is thoroughly investigated in this study, specifically in the reliable determination of HMM acoustic parameters. For enhancing the performance of speaker adaptation, the FLC mechanism is engineered into the MAP estimate of HMM parameters for Bayesian-based adaptation; also into the MLLR estimate for transformation-based adaptation. The speech recognition system using an adaptation scheme with the support of FLC will still be able to keep a satisfactory recognition performance even in an ordinary case.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Rabiner, L.R.: The Power of Speech. Science 301, 1494–1495 (2003)

    Article  Google Scholar 

  2. Lippmann, R.P.: Speech Recognition by Machines and Humans. Speech Communication 22, 1–15 (1997)

    Article  Google Scholar 

  3. Kuhn, R., Junqua, J.-C., Nguyen, P., Niedzielski, N.: Rapid Speaker Adaptation in Eigenvoice Space. IEEE Transactions on Speech and Audio Processing 8, 695–707 (2000)

    Article  Google Scholar 

  4. Mak, B., Hsiao, R.: Kernel Eigenspace-based MLLR Adaptation. IEEE Transactions on Audio, Speech, and Language Processing 15, 784–795 (2007)

    Article  Google Scholar 

  5. Kermiche, S., Saidi, M.L., Abbassi, H.A., Ghodbane, H.: Takagi-Sugeno Based Controller for Mobile Robot Navigation. Journal of Applied Science 6, 1838–1844 (2006)

    Article  Google Scholar 

  6. Gauvain, J.L., Lee, C.H.: Maximum a Posteriori Estimation for Multivariate Gaussian Mixture Observations of Markov Chains. IEEE Transactions on Speech and Audio Processing 2, 291–298 (1994)

    Article  Google Scholar 

  7. Lee, C.H., Lin, C.H., Juang, B.H.: A Study on Speaker Adaptation of the Parameters of Continuous Density Hidden Markov Models. IEEE Transactions on Acoustics, Speech and Signal Processing 39, 806–814 (1991)

    Article  Google Scholar 

  8. Takahashi, J.-I., Sagayama, S.: Vector-field-smoothed Bayesian Learning for Fast and Incremental Speaker/Telephone-channel Adaptation. Computer Speech and Language 11, 127–146 (1997)

    Article  Google Scholar 

  9. Woodland, P.C.: Speaker Adaptation: Techniques and Challenges. In: Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding, pp. 85–90 (1999)

    Google Scholar 

  10. Takagi, T., Sugeno, M.: Fuzzy Identification of Systems and Its Application to Modeling and Control. IEEE Transactions on Systems, Man and Cybernetics 15, 116–132 (1985)

    Article  MATH  Google Scholar 

  11. Leggetter, C.J., Woodland, P.C.: Maximum Likelihood Linear Regression for Speaker Adaptation of Continuous Density Hidden Markov Models. Computer Speech and Language 9, 171–185 (1995)

    Article  Google Scholar 

  12. Chien, J.T., Lee, L.M., Wang, H.C.: Estimation of Channel Bias for Telephone Speech Recognition. In: Proceedings of International Conference on Spoken Language Processing, pp. 1840–1843 (1996)

    Google Scholar 

  13. Chien, J.T., Wang, H.C.: Telephone Speech Recognition Based on Bayesian Adaptation of Hidden Markov Models. Speech Communication 22, 369–384 (1997)

    Article  Google Scholar 

  14. Chesta, C., Siohan, O., Lee, C.H.: Maximum a Posteriori Linear Regression for Hidden Markov Model Adaptation. In: Proceedings of the European Conference on Speech Communication and Technology, pp. 211–214 (1999)

    Google Scholar 

  15. Chou, W.: Maximum a Posteriori Linear Regression with Elliptically Symmetric Matrix Priors. In: Proceedings of the European Conference on Speech Communication and Technology, pp. 1–4 (1999)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Ding, IJ. (2010). FLC-Regulated Speaker Adaptation Mechanisms for Speech Recognition. In: Pan, JS., Chen, SM., Nguyen, N.T. (eds) Computational Collective Intelligence. Technologies and Applications. ICCCI 2010. Lecture Notes in Computer Science(), vol 6422. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16732-4_31

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-16732-4_31

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-16731-7

  • Online ISBN: 978-3-642-16732-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics