Abstract
Voice recognition systems provide good performances when the speech signal is recorded in good conditions: low noise level, good microphones. But results are not sufficient for several real life noisy situations (e.g cars). The aim of the presented work was to compare three techniques of spectral parametrisation in terms of performances for speech recognition and more precisely to evaluate their robustness in noise. This study was part of a French GRECO project on the comparison of methods of parametric and non parametric spectral analysis for speech recognition. This project has used the existing speech recognition program SAMREC-1 with the speech data base EUROMO and the RSG_10 noise data base. The different techniques are evaluated by their scores of recognition in lexical accesses, for speaker dependent isolated word recognition based on Dynamic Time Warping.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Itakura E, Umezaki T., “Distance measure for speech recognition based on the smoothed group delay spectrum”, ICASSP Proc., pp. 1257–1260, 1987
Mokbel C. “Reconnaissance de la parole dans le bruit: bruitage, débruitage”,These ENST, 1992
Murthy H., Yegnanarayana, “Speech processing using group delay functions”, Signal Processing, Vol. 22, pp. 257–267, 1991
Paliwal K. K., “Perception based distance LSP measure for speech recognition”, JASA, Sup. 1, Vol. 84, S15, 1988
Furui S., Sagayama S., Gurgen S., “Line Spectrum Pairs based distance measures for speech recognition”, International conference on spoken language processing, Kobe, Japan, 1990
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1995 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Baudoin, G., Jardin, P., Gross, J., Chollet, G. (1995). Comparison of Parametric Spectral Representations for Voice Recognition in Noisy Environments. In: Ayuso, A.J.R., Soler, J.M.L. (eds) Speech Recognition and Coding. NATO ASI Series, vol 147. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-57745-1_45
Download citation
DOI: https://doi.org/10.1007/978-3-642-57745-1_45
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-63344-7
Online ISBN: 978-3-642-57745-1
eBook Packages: Springer Book Archive