Continuously Variable Transition Probability HMM for Speech Recognition

Falaschi, Alessandro

doi:10.1007/978-3-642-76626-8_14

Alessandro Falaschi³

Part of the book series: NATO ASI Series ((NATO ASI F,volume 75))

287 Accesses
4 Citations

Abstract

A new duration intrinsic model for improved speech recognition by HMM techniques is presented. Assuming an exponentially decaying time dependency of the states loop probability, the duration density can be factorized and a path early pruning theorem demonstrated. As a consequence, computational complexity is greatly reduced with respect to explicit models, whereas recognition performances improve considerably.

This work has been partially founded by ALCATEL-FACE. Only the author is responsible for the ideas and conclusions here reported.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

— J.D.Ferguson, “Variable Duration Models of Speech”, Proc. of Symp. on the application of HMM to Text and Speech, Ferguson jd Ed, Princetown, NJ, 1980
Google Scholar
— H.Bourlard, C.J.Welleknes, “Connected Speech Recognition by Phonemic Semi-Markov chains for state occupancy modelling”, Signal Processing III, Theories an applications, I.T. Young et al, Elsevier Sc. Pub. B V (North-Holland)
Google Scholar
— L.J.Russel, R.K.Moore, “Explicit modelling of state occupancy in Hidden Markov Models for Automatic Speech Recognition”, Proc.ICASSP-85
Google Scholar
— S.E.Levinson, “Continuously Variable Duration Hidden Markov Models for Speech Analysis”, Proc ICASSP-86, p. 1241, Tokyo
Google Scholar
- L.R.Rabiner, B.H.Juang, S.E.Levinson, M.M.Sondhi, “Recognition of isolated digits using hidden Markov models with continuous mixture densities”, AT&T Tech.J., Vol.64, pp1211–1234, July-Aug. 1985
MathSciNet Google Scholar
- M.Codogno, L.Fissore, “Duration modelling in finite state automata for speech recognition and fast speaker adaptation”, Proc. ICASSP-87, Dallas
Google Scholar
- T.H.Crystal, A.S.House, “Characterization and modeling of speech segment duration”, Proc. ICASSP-86, Tokyo
Google Scholar
- M.J.Russel, A.E.Cook, “Experimental Evaluation of Durational Modelling Techniques for Automatic Speech Recognition”, Proc ICASSP-87, p. 2376, Dallas
Google Scholar
- C.H.Lee, “On the use of some robust modelling techniques for speech recognition”, Computer Speech and Language, 1989, 3, 35–52
Article Google Scholar
- A.Falaschi, “Phonetic recognition by non stationary HMM”, Proc. of FASE Speech 88, Aug. 1988, Edinburgh, UK
Google Scholar
- D.Jouvet, J.Monne’, P.Dobois, “A new network-based speaker-independent connected- word recognition system”, Proc. ICASSP-86, Tokyo
Google Scholar
- I.S.Gradshtey, I.M.Ryzhik, “Table of integrals series and products” 4th Ed., Acad. Press 1965 — NY, San Francisco, London
Google Scholar
- A.Falaschi, “Decodifica acustico-fonetica del messaggio vocale su basi informatico-strutturali e modelli di Markov nascosti”, PhD Thesis, INFO-COM Dpt, La Sapienza Univ. of Rome, Italy
Google Scholar
— R.Schwartz, Y.Chow, F.Kubala, “Rapid speaker adaptation using a probabilistic mapping”, Proc of ICASSP-87, Dallas
Google Scholar

Download references

Author information

Authors and Affiliations

INFO-COM Department, La Sapienza University, Via Eudossiana 18, 00184, Roma, Italy
Alessandro Falaschi

Authors

Alessandro Falaschi
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dipartimento di Automatica e Informatica, Politecnico di Torino, Corso Duca degli Abruzzi 24, 10129, Torino, Italy
Pietro Laface
School of Computer Science, 3480 University St., Montreal, Quebec, H3A 2A7, Canada
Renato De Mori

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Falaschi, A. (1992). Continuously Variable Transition Probability HMM for Speech Recognition. In: Laface, P., De Mori, R. (eds) Speech Recognition and Understanding. NATO ASI Series, vol 75. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-76626-8_14

Download citation

DOI: https://doi.org/10.1007/978-3-642-76626-8_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-76628-2
Online ISBN: 978-3-642-76626-8
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics