Abstract
In this paper a real time prototype dedicated to single word recognition in ISDN lines is described. This system is speaker independent for a fixed hierarchical command set of totally 61 words. Context dependent continuous density Markov phoneme models are used. To improve recognition rates, a postprocessor based on information measures is proposed, which chooses the best word candidate in respect to transinformation.
In the first part the used speech recognition algorithms are presented. The second part deals with the ISDN speech database, the recording conditions and the achieved recognition rates. In the last part the hardware configuration of the speech server and the implementation of the described algorithms is explained in more detail. An outlook to future work concludes this contribution.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
A. Aktas and H. Höge. Multi-DSP and VQ-ASIC based acoustic front-end for real-time speech processing tasks. In Proceedings of Eurospeech, pages 586 – 589, Paris, September 1989. EUROSPEECH.
A. Aktas and H. Höge. Real-time recognition of subword units on a hybrid multi-DSP/ASIC based acoustic front-end. In IEEE International Conference on Acoustics, Speech and Signal Processing, pages 101 – 103, Edinburgh, May 1989. ICASSP.
H. Ney. A script-guided algorithm for the automatic segmentation of continuous speech. In IEEE International Conference on Acoustics, Speech and Signal Processing, pages 1209 – 1212, Tampa, 1985. ICASSP.
H. Ney, D. Mergel, A. Noll, and A. Paeseler. Recent Advances in Speech Understanding and Dialog Systems, volume F46 of NATO ASI Series, chapter Overview of Speech recognition in the Spicos System, pages 305 – 309. Springer, Berlin Heidelberg, 1988.
H. Ney and A. Noll. Phoneme modeling using continuous mixture densities. In IEEE International Conference on Acoustics, Speech and Signal Processing, pages 437 – 440, New York, April 1988. ICASSP.
T. R. Vilmansen. Feature evaluation with measures of probabilistic dependence. IEEE Transactions on Computers, 22:381 – 388, April 1973.
K. Zünkler. Speech-understanding systems: The communication technology of tomorrow. In H. Schwärtzel and I. Mizin, editors, Advanced Information Processing, Proceedings of a Joint Symposium Information Processing and Software, Systems Design Automation, pages 227 – 251. Springer, Berlin Heidelberg New York, June 1990.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1992 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zünkler, K. (1992). An ISDN speech server based on speaker independent continuous Hidden Markov Models. In: Laface, P., De Mori, R. (eds) Speech Recognition and Understanding. NATO ASI Series, vol 75. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-76626-8_11
Download citation
DOI: https://doi.org/10.1007/978-3-642-76626-8_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-76628-2
Online ISBN: 978-3-642-76626-8
eBook Packages: Springer Book Archive