Automatic adjustments of the Markov models topology for speech recognition applications over the telephone

Jouvet, Denis; Mauuary, Laurent; Monné, Jean

doi:10.1007/978-3-642-76626-8_4

Denis Jouvet³,
Laurent Mauuary³ &
Jean Monné³

Part of the book series: NATO ASI Series ((NATO ASI F,volume 75))

280 Accesses

Abstract

This paper presents some automatic adjustments of the structure of Markov models with the objective of either reducing model complexity, or improving recognition performance. These modifications are tested on a 36 word vocabulary recorded by more than 500 speakers over the telephone network. The reduction of the model complexity is carried out by merging the similar gaussian functions using an iterative procedure. A 40% reduction of the number of gaussian functions is obtained on word based models without altering recognition performance.

The improvement of the recognition performance is obtained by dynamically expanding the Markov model. This is achieved mainly by splitting the gaussian functions which make the highest contribution to the observation probability of the training set and by discarding the infrequently used transitions. After some iterations (involving the splitting and discarding operators) a 30% reduction of the word error rate is achieved using pseudo-diphone based models.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Gagnoulet, C.: “Speech recognition over the telephone: experiments in France”; Voice Systems Worldwide 1990 conference, London, May 1990, p173–177.
Google Scholar
Jouvet, D., Monné, J., Dubois, D.: “A new network-based speaker-independent connected-word speech recognition system”, Proc. IEEE Int. Conf. ASSP 1986, Tokyo, April 1986.
Google Scholar
Juang, B. H., Rabiner, L. R., Levinson, S. E., Sondhi, M. M.: “Recent developments in the application of hidden Markov models to speaker-independent isolated word recognition”, Proc. IEEE Int. Conf. ASSP 1985.
Google Scholar

Download references

Author information

Authors and Affiliations

Centre National d’Etudes des Télécommunications, LAA/TSS/RCP, Route de Trégastel, 22300, Lannion, France
Denis Jouvet, Laurent Mauuary & Jean Monné

Authors

Denis Jouvet
View author publications
You can also search for this author in PubMed Google Scholar
Laurent Mauuary
View author publications
You can also search for this author in PubMed Google Scholar
Jean Monné
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dipartimento di Automatica e Informatica, Politecnico di Torino, Corso Duca degli Abruzzi 24, 10129, Torino, Italy
Pietro Laface
School of Computer Science, 3480 University St., Montreal, Quebec, H3A 2A7, Canada
Renato De Mori

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jouvet, D., Mauuary, L., Monné, J. (1992). Automatic adjustments of the Markov models topology for speech recognition applications over the telephone. In: Laface, P., De Mori, R. (eds) Speech Recognition and Understanding. NATO ASI Series, vol 75. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-76626-8_4

Download citation

DOI: https://doi.org/10.1007/978-3-642-76626-8_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-76628-2
Online ISBN: 978-3-642-76626-8
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics