An ISDN speech server based on speaker independent continuous Hidden Markov Models

Zünkler, Klaus

doi:10.1007/978-3-642-76626-8_11

Klaus Zünkler³

Part of the book series: NATO ASI Series ((NATO ASI F,volume 75))

277 Accesses

Abstract

In this paper a real time prototype dedicated to single word recognition in ISDN lines is described. This system is speaker independent for a fixed hierarchical command set of totally 61 words. Context dependent continuous density Markov phoneme models are used. To improve recognition rates, a postprocessor based on information measures is proposed, which chooses the best word candidate in respect to transinformation.

In the first part the used speech recognition algorithms are presented. The second part deals with the ISDN speech database, the recording conditions and the achieved recognition rates. In the last part the hardware configuration of the speech server and the implementation of the described algorithms is explained in more detail. An outlook to future work concludes this contribution.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

A. Aktas and H. Höge. Multi-DSP and VQ-ASIC based acoustic front-end for real-time speech processing tasks. In Proceedings of Eurospeech, pages 586 – 589, Paris, September 1989. EUROSPEECH.
Google Scholar
A. Aktas and H. Höge. Real-time recognition of subword units on a hybrid multi-DSP/ASIC based acoustic front-end. In IEEE International Conference on Acoustics, Speech and Signal Processing, pages 101 – 103, Edinburgh, May 1989. ICASSP.
Chapter Google Scholar
H. Ney. A script-guided algorithm for the automatic segmentation of continuous speech. In IEEE International Conference on Acoustics, Speech and Signal Processing, pages 1209 – 1212, Tampa, 1985. ICASSP.
Google Scholar
H. Ney, D. Mergel, A. Noll, and A. Paeseler. Recent Advances in Speech Understanding and Dialog Systems, volume F46 of NATO ASI Series, chapter Overview of Speech recognition in the Spicos System, pages 305 – 309. Springer, Berlin Heidelberg, 1988.
Google Scholar
H. Ney and A. Noll. Phoneme modeling using continuous mixture densities. In IEEE International Conference on Acoustics, Speech and Signal Processing, pages 437 – 440, New York, April 1988. ICASSP.
Google Scholar
T. R. Vilmansen. Feature evaluation with measures of probabilistic dependence. IEEE Transactions on Computers, 22:381 – 388, April 1973.
Article MATH MathSciNet Google Scholar
K. Zünkler. Speech-understanding systems: The communication technology of tomorrow. In H. Schwärtzel and I. Mizin, editors, Advanced Information Processing, Proceedings of a Joint Symposium Information Processing and Software, Systems Design Automation, pages 227 – 251. Springer, Berlin Heidelberg New York, June 1990.
Google Scholar

Download references

Author information

Authors and Affiliations

Corporate Research and Development, Siemens AG, Munich, W-Germany
Klaus Zünkler

Authors

Klaus Zünkler
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dipartimento di Automatica e Informatica, Politecnico di Torino, Corso Duca degli Abruzzi 24, 10129, Torino, Italy
Pietro Laface
School of Computer Science, 3480 University St., Montreal, Quebec, H3A 2A7, Canada
Renato De Mori

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zünkler, K. (1992). An ISDN speech server based on speaker independent continuous Hidden Markov Models. In: Laface, P., De Mori, R. (eds) Speech Recognition and Understanding. NATO ASI Series, vol 75. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-76626-8_11

Download citation

DOI: https://doi.org/10.1007/978-3-642-76626-8_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-76628-2
Online ISBN: 978-3-642-76626-8
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics