Speaker Adaptation of CDHMMs Using Bayesian Learning

Vair, Claudio; Fissore, Luciano

doi:10.1007/978-3-642-60087-6_7

Claudio Vair² &
Luciano Fissore²

Part of the book series: NATO ASI Series ((NATO ASI F,volume 169))

228 Accesses

Summary

We investigate the Bayesian Learning approach (also known as Maximum A Posteriori — MAP) to the speaker adaptation of Continuous Density Hidden Markov Models (CDHMMs). The parameters of the Gaussian mixture output densities are adapted using the exponential forgetting mechanism and performing the a priori parameter estimation in a model based outline. Moreover a channel adaptation is carried out by means of the cepstral mean normalization method (CMN).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

L. Fissore, F. Ravera, and P. Laface. Acoustic-phonetic modeling for flexible vocabulary speech recognition. In Proc. of EUROSPEECH, pages 1–799-802. Madrid, Spain, 1995.
Google Scholar
J.-L. Gauvain and C.-H. Lee. Maximum a posteriori estimation for multivariate gaussian mixture observations of markov chains. IEEE Trans. on Speech and Audio Processing, 2 (2): 291–298, Apr. 1992.
Article Google Scholar
Q. Huo and C.-H. Lee. A study of on-line quasi-bayes adaptation for cdhmm-based speech recognition. In Proc. of lCASSP, pages II–705–708. Atlanta, 1996.
Google Scholar
Y. Zhao. Self-learning speaker and channel adaptation based on spectral variation source decomposition. Speech Communication, 18: 65–77, Jan. 1996.
Article Google Scholar

Download references

Author information

Authors and Affiliations

CSELT — Centro Studi E Laboratori Telecomunicazioni, Via G. Reiss Romoli 274, 10148, Torino, Italy
Claudio Vair & Luciano Fissore

Authors

Claudio Vair
View author publications
You can also search for this author in PubMed Google Scholar
Luciano Fissore
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Speech Research Unit, DERA Malvern, St. Andrew’s Road, WR14 4DT, Great Malvern, Worcs, UK
Keith Ponting

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Vair, C., Fissore, L. (1999). Speaker Adaptation of CDHMMs Using Bayesian Learning. In: Ponting, K. (eds) Computational Models of Speech Pattern Processing. NATO ASI Series, vol 169. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-60087-6_7

Download citation

DOI: https://doi.org/10.1007/978-3-642-60087-6_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-64250-0
Online ISBN: 978-3-642-60087-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics