Abstract
Used for acoustic-phonetic decoding, hidden Markov models (HMM) perform well, but they induce phoneme boundaries problems; furthermore, the inter-speaker variability makes probability distributions difficult to learn. Two efficient signal processing techniques are used in a HMM-based phonetic decoding system: the Forward-Backward Divergence method detecting discontinuities of the speech signal, which are used to constrain the phonetic transitions between the models; the auto-regressive vector model allowing a definition of a speaker topology, which improves the quality of the training set. A new decoding system using these two processings is compared with a standard HMM-based system on the TIMIT database. -A significant 4% improvement of the accuracy rate is observed on the 7000 test phonemes.
Thanks to R. André-Obrecht for her software performing the FBD method.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
R. André-Obrecht, “A New Statistical Approach for the Automatic Segmentation of Continuous Speech Signals”, IEEE Trans. ASSP, vol. 36, pp. 29–40, 1988.
C. Barras, M.-J. Caraty, P. Deléglise, C. Montacié, R. André-Obrecht & X. Rodet, “Décomposition Temporelle et Ruptures de Modèles pour le Décodage Acoustico-Phonétique”, 19th JEP, pp. 335–340, 1992.
C. Montacié& J.-L. Le Floch, “Discriminant AR-Vector Models for Free-Text Speaker Verification”, Eurospeech, pp. 161–164, Berlin, 1993.
K.-F. Lee & H.-W. Hon, “Speaker-Independent Phone Recognition using Hidden Markov Models”, IEEE Trans. ASSP, vol. 37, pp. 1641–1648, 1989.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1995 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Barras, C., Caraty, MJ., Montacié, C. (1995). HMM Based Acoustic-Phonetic Decoding with Constrained Transitions and Speaker Topology. In: Ayuso, A.J.R., Soler, J.M.L. (eds) Speech Recognition and Coding. NATO ASI Series, vol 147. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-57745-1_9
Download citation
DOI: https://doi.org/10.1007/978-3-642-57745-1_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-63344-7
Online ISBN: 978-3-642-57745-1
eBook Packages: Springer Book Archive