Skip to main content

An ISDN speech server based on speaker independent continuous Hidden Markov Models

  • Conference paper
Speech Recognition and Understanding

Part of the book series: NATO ASI Series ((NATO ASI F,volume 75))

  • 277 Accesses

Abstract

In this paper a real time prototype dedicated to single word recognition in ISDN lines is described. This system is speaker independent for a fixed hierarchical command set of totally 61 words. Context dependent continuous density Markov phoneme models are used. To improve recognition rates, a postprocessor based on information measures is proposed, which chooses the best word candidate in respect to transinformation.

In the first part the used speech recognition algorithms are presented. The second part deals with the ISDN speech database, the recording conditions and the achieved recognition rates. In the last part the hardware configuration of the speech server and the implementation of the described algorithms is explained in more detail. An outlook to future work concludes this contribution.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. A. Aktas and H. Höge. Multi-DSP and VQ-ASIC based acoustic front-end for real-time speech processing tasks. In Proceedings of Eurospeech, pages 586 – 589, Paris, September 1989. EUROSPEECH.

    Google Scholar 

  2. A. Aktas and H. Höge. Real-time recognition of subword units on a hybrid multi-DSP/ASIC based acoustic front-end. In IEEE International Conference on Acoustics, Speech and Signal Processing, pages 101 – 103, Edinburgh, May 1989. ICASSP.

    Chapter  Google Scholar 

  3. H. Ney. A script-guided algorithm for the automatic segmentation of continuous speech. In IEEE International Conference on Acoustics, Speech and Signal Processing, pages 1209 – 1212, Tampa, 1985. ICASSP.

    Google Scholar 

  4. H. Ney, D. Mergel, A. Noll, and A. Paeseler. Recent Advances in Speech Understanding and Dialog Systems, volume F46 of NATO ASI Series, chapter Overview of Speech recognition in the Spicos System, pages 305 – 309. Springer, Berlin Heidelberg, 1988.

    Google Scholar 

  5. H. Ney and A. Noll. Phoneme modeling using continuous mixture densities. In IEEE International Conference on Acoustics, Speech and Signal Processing, pages 437 – 440, New York, April 1988. ICASSP.

    Google Scholar 

  6. T. R. Vilmansen. Feature evaluation with measures of probabilistic dependence. IEEE Transactions on Computers, 22:381 – 388, April 1973.

    Article  MATH  MathSciNet  Google Scholar 

  7. K. Zünkler. Speech-understanding systems: The communication technology of tomorrow. In H. Schwärtzel and I. Mizin, editors, Advanced Information Processing, Proceedings of a Joint Symposium Information Processing and Software, Systems Design Automation, pages 227 – 251. Springer, Berlin Heidelberg New York, June 1990.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1992 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Zünkler, K. (1992). An ISDN speech server based on speaker independent continuous Hidden Markov Models. In: Laface, P., De Mori, R. (eds) Speech Recognition and Understanding. NATO ASI Series, vol 75. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-76626-8_11

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-76626-8_11

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-76628-2

  • Online ISBN: 978-3-642-76626-8

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics