Skip to main content

Demisyllables as Processing Units for Automatic Speech Recognition and Lexical Access

  • Conference paper

Part of the book series: NATO ASI Series ((NATO ASI F,volume 16))

Abstract

This paper describes a number of experimental investigations into syllable-based acoustic-phonetic analysis of German words; these methods can be used as a basic processing stage in a system for automatic speech recognition as well as for speech understanding. In this connection the importance of the syllable in speech processing by man and machine will first be discussed. Then several methods and experiments are presented involving segmentation into syllables and recognition of vowels and consonant clusters, as well as two methods for lexical access and lexical search using these units. The search in the lexicon is necessary in order to find the word in a word-list corresponding to the units recognized, or alternatively to determine the most similar word. The most salient feature of this system is that so-called demisyllables are used as the processing units.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. G. Ruske and T. Schotola, “The efficiency of demisyllable segmentation in the recognition of spoken words,” IEEE Int. on Acoustics, Speech and Signal Processing, Atlanta, 971–974, 1981.

    Google Scholar 

  2. G. Hammarström, “Linguistische Einheiten im Rahmen der modernen Sprachwissenschaft,” in Kommunkation und Kybernetik in Einzeldarstellungen, (H. Wolter and W. D. Keidel eds.), Volume 5, Springer-Verlag, 1966.

    Google Scholar 

  3. P. Mermelstein, “On the relationship between vowel and consonant identification when cued by the same acoustic information,” Percept. Psychophys. 23, 331–336, 1978.

    Article  Google Scholar 

  4. G. Ruske and T. Schotola, “An approach to speech recognition using syllabic decision units,” IEEE Int. Conf. on Acoustics, Speech and Signal Procesing, Tulsa, 722–725, 1978.

    Google Scholar 

  5. M. J. Hunt, M. Lennig and P. Mermelstein, “Experiments in syllable-based recognition of continuous speech,” IEEE Int. Conf. on Acoustics, Speech and Signal Processing, Denver, 880–883, 1980.

    Google Scholar 

  6. G. Ruske, “Auditory perception and its application to computer analysis of speech,” in Computer Analysis and Perception, Vol. II, Auditory Signals (C. Y. Suen and R. De Mori, eds.), CRC-Press, Boca Raton, Florida, 1–42, 1982.

    Google Scholar 

  7. T. Schotola, “On the use of demisyllables in automatic word recognition,” Speech Communication 3, North-Holland Publ., 63–87, 1984.

    Google Scholar 

  8. E. Zwicker, E. Terhardt and E. Paulus, “Automatic speech recognition using psychoacoustic models,” JASA 65, 487–498, 1979.

    Google Scholar 

  9. E. Zwicker, “Subdivision of the audible frequency range into critical bands (Frequenzgruppen)”, JASA 33, 248, 1961.

    Google Scholar 

  10. G. Ruske, “On the usage of demisyllables in automatic speech recognition,” in SIGNAL PROCESSING II: Theoreies and Applications, H. W. Schüßler (Ed.), Elsvier Science Publishers B. V. (North-Holland), 419–422, 1983.

    Google Scholar 

  11. W. Schiele, “Korrektur falsch klassifizierter Halbsilben in einem Worterkennungssystem durch geometrische Abstandsmessungen”, Report of the Division of Data Processing, Technical University of Munich, Germany, 1981.

    Google Scholar 

  12. D. H. Klatt, “Speech perception: A model of acoustic-phonetic analysis and lexical access,” Journal of Phonetics 7, 279–312, 1979.

    Google Scholar 

  13. H.-J. Geywitz, “Automatische Erkennung fließender Sprache mit silbenorientierten Einheiten”, Doctoral thesis, Division of Data Processing, Technical University of Munich, Germany, 1984.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1985 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Ruske, G. (1985). Demisyllables as Processing Units for Automatic Speech Recognition and Lexical Access. In: De Mori, R., Suen, C.Y. (eds) New Systems and Architectures for Automatic Speech Recognition and Synthesis. NATO ASI Series, vol 16. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-82447-0_24

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-82447-0_24

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-82449-4

  • Online ISBN: 978-3-642-82447-0

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics