FLC-Regulated Speaker Adaptation Mechanisms for Speech Recognition

Ding, Ing-Jr

doi:10.1007/978-3-642-16732-4_31

Ing-Jr Ding²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6422))

Included in the following conference series:

International Conference on Computational Collective Intelligence

1037 Accesses

Abstract

The exploitation of fuzzy logic control (FLC) mechanism in the fields of speaker adaptation (SA) is thoroughly investigated in this study, specifically in the reliable determination of HMM acoustic parameters. For enhancing the performance of speaker adaptation, the FLC mechanism is engineered into the MAP estimate of HMM parameters for Bayesian-based adaptation; also into the MLLR estimate for transformation-based adaptation. The speech recognition system using an adaptation scheme with the support of FLC will still be able to keep a satisfactory recognition performance even in an ordinary case.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Rabiner, L.R.: The Power of Speech. Science 301, 1494–1495 (2003)
Article Google Scholar
Lippmann, R.P.: Speech Recognition by Machines and Humans. Speech Communication 22, 1–15 (1997)
Article Google Scholar
Kuhn, R., Junqua, J.-C., Nguyen, P., Niedzielski, N.: Rapid Speaker Adaptation in Eigenvoice Space. IEEE Transactions on Speech and Audio Processing 8, 695–707 (2000)
Article Google Scholar
Mak, B., Hsiao, R.: Kernel Eigenspace-based MLLR Adaptation. IEEE Transactions on Audio, Speech, and Language Processing 15, 784–795 (2007)
Article Google Scholar
Kermiche, S., Saidi, M.L., Abbassi, H.A., Ghodbane, H.: Takagi-Sugeno Based Controller for Mobile Robot Navigation. Journal of Applied Science 6, 1838–1844 (2006)
Article Google Scholar
Gauvain, J.L., Lee, C.H.: Maximum a Posteriori Estimation for Multivariate Gaussian Mixture Observations of Markov Chains. IEEE Transactions on Speech and Audio Processing 2, 291–298 (1994)
Article Google Scholar
Lee, C.H., Lin, C.H., Juang, B.H.: A Study on Speaker Adaptation of the Parameters of Continuous Density Hidden Markov Models. IEEE Transactions on Acoustics, Speech and Signal Processing 39, 806–814 (1991)
Article Google Scholar
Takahashi, J.-I., Sagayama, S.: Vector-field-smoothed Bayesian Learning for Fast and Incremental Speaker/Telephone-channel Adaptation. Computer Speech and Language 11, 127–146 (1997)
Article Google Scholar
Woodland, P.C.: Speaker Adaptation: Techniques and Challenges. In: Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding, pp. 85–90 (1999)
Google Scholar
Takagi, T., Sugeno, M.: Fuzzy Identification of Systems and Its Application to Modeling and Control. IEEE Transactions on Systems, Man and Cybernetics 15, 116–132 (1985)
Article MATH Google Scholar
Leggetter, C.J., Woodland, P.C.: Maximum Likelihood Linear Regression for Speaker Adaptation of Continuous Density Hidden Markov Models. Computer Speech and Language 9, 171–185 (1995)
Article Google Scholar
Chien, J.T., Lee, L.M., Wang, H.C.: Estimation of Channel Bias for Telephone Speech Recognition. In: Proceedings of International Conference on Spoken Language Processing, pp. 1840–1843 (1996)
Google Scholar
Chien, J.T., Wang, H.C.: Telephone Speech Recognition Based on Bayesian Adaptation of Hidden Markov Models. Speech Communication 22, 369–384 (1997)
Article Google Scholar
Chesta, C., Siohan, O., Lee, C.H.: Maximum a Posteriori Linear Regression for Hidden Markov Model Adaptation. In: Proceedings of the European Conference on Speech Communication and Technology, pp. 211–214 (1999)
Google Scholar
Chou, W.: Maximum a Posteriori Linear Regression with Elliptically Symmetric Matrix Priors. In: Proceedings of the European Conference on Speech Communication and Technology, pp. 1–4 (1999)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electrical Engineering, National Formosa University, No.64, Wunhua Rd., Huwei Township, Yunlin County, 632, Taiwan, R.O.C.
Ing-Jr Ding

Authors

Ing-Jr Ding
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Electronic Engineering, National Kaohsiung University of Applied Sciences, 415 Chien-Kung Road, 807, Kaohsiung, Taiwan
Jeng-Shyang Pan
Department of Computer Science and Information Engineering, National Taiwan University of Science and Technology, #43, Sec.4, Keelung Rd., 106, Taipei, Taiwan
Shyi-Ming Chen
Institute of Informatics, Wroclaw University of Technology, Str. Wyb. Wyspianskiego 27, 50-370, Wroclaw, Poland
Ngoc Thanh Nguyen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ding, IJ. (2010). FLC-Regulated Speaker Adaptation Mechanisms for Speech Recognition. In: Pan, JS., Chen, SM., Nguyen, N.T. (eds) Computational Collective Intelligence. Technologies and Applications. ICCCI 2010. Lecture Notes in Computer Science(), vol 6422. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16732-4_31

Download citation

DOI: https://doi.org/10.1007/978-3-642-16732-4_31
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-16731-7
Online ISBN: 978-3-642-16732-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics