Overview of Speech Recognition in the ‘SPICOS’ System

  • H. Ney
  • D. Mergel
  • A. Noll
  • A. Paeseler
Part of the NATO ASI Series book series (volume 46)

Abstract

In this paper, a recognition technique used in the ‘SPICOS’ project is described. It is based on an integrated approach that combines the various knowledge sources, such as inventory of subword unit, pronunciation lexicon and language model, during the process of decision making in order to improve the reliability of the acoustic recognition. The recognition decision amounts to a search through a large state space with delayed decisions. The speaker dependent recognition tests are performed on a speech data base comprising 3 sessions of each of 5 speakers. A session consists of 200 sentences and amounts to 1391 word samples.

Keywords

Acoustics 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. J.K. BAKER (1975): “Stochastic Modeling for Automatic Speech Understanding”, in D.R. REDDY (ed.): ‘Speech Recognition’, Academic Press, New York, pp.512–542, 1975.Google Scholar
  2. F. JELINEK (1976): “Continuous Speech Recognition by Statistical Methods”, Proc. of the IEEE, Vol. 64, No. 10, pp.532–556, April 1976.CrossRefGoogle Scholar
  3. D. MERGEL, A. PAESELER (1987): “Construction of Language Models for Spoken Data Base Queries”, Proc. 1 987 IEEE Int. Conf. on Acoustics, Speech and Signal Processing, Dallas, Texas, pp. 20.13.1–4, April 1987.Google Scholar
  4. H. NEY (1984): “The Use of a One-Stage Dynamic Programming Algorithm for Connected Word Recognition”, IEEE Trans. on Acoustics, Speech and Signal, Vol. ASSP-32, No. 2, pp.263–271, April 1984.CrossRefGoogle Scholar
  5. H. NEY, D. MERGEL, A. NOLL, P. PAESELER (1987): “A Data-Driven Organization of the Dynamic Programming Beam Search for Continuous Speech Recognition”, Proc. 1987 IEEE Int. Conf. on Acoustics, Speech and Signal Processing, Dallas, Texas, pp. 20.10.1–4, April 1987.Google Scholar
  6. A. NOLL, H. NEY (1987): “Training of Phoneme Models in a Sentence Recognition System”, Proc. 1987 IEEE Int. Conf. on Acoustics, Speech and Signal Processing, Dallas, Texas, pp.29.6.1–4, April 1987.Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 1988

Authors and Affiliations

  • H. Ney
    • 1
  • D. Mergel
    • 1
  • A. Noll
    • 1
  • A. Paeseler
    • 1
  1. 1.Forschungslaboratorium HamburgPhilips GmbHHamburg 54West Germany

Personalised recommendations