Skip to main content

Experiments on a Fast Mixture Density Likelihood Computation

  • Conference paper
  • 227 Accesses

Part of the book series: NATO ASI Series ((NATO ASI F,volume 147))

Abstract

A computationally very expensive task arising within speech recognition systems using continuous mixture densities is the log-likelihood computation, which in our system essentially amounts to a nearest-neighbor search. A powerful nearest-neighbor search procedure is the Approximating and Eliminating Search Algorithm (AESA) [1], based on the triangle inequality. However, a direct application of this method is prohibitive in our framework, which is characterized by a high-dimensional feature space and a small set of prototypes per class. Thus, an improved triangle-inequality algorithm adapted to our conditions was derived. Experimental tests show that it is 35% faster than a straightforward search and 20% faster than the standard partial-distantce algorithm used in the Philips large vocabulary continuous-speech recognition system. The discussion particularly focuses on the influence of feature-space dimension and number of prototypes on the number of distances to be computed.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Enrique Vidal Ruiz. An Algorithm For Finding Nearest Neighbours in (Approximately) Constant Average Time. Pattern Recognition Letters 4, 1986.

    Google Scholar 

  2. Enrique Vidal, Hector M. Rulot, Francisco Casacuberta, Jose Miguel Benedi. On the Use of a Metric-Space Search Algorithm (AESA) for Fast DTW-Based Recognition of Isolated Words. IEEE Trans. Acoust., Speech. Signal Processing, VOL. 36,No. 5, May 1988.

    Google Scholar 

  3. H. Ney. Acoustic Modelling Of Phoneme Units For Continuous Speech Recognition. Signal Processing V: Theory and Applications, 1990.

    Google Scholar 

  4. H. Ney, V. Steinbiss, R. Haeb-Umbach, B.-H. Tran, U. Essen. An Overview of the Philips Research System for Large-Vocabulary Continuous-Speech Recognition. To appear in International Journal of Pattern Recognition and Artificial Intelligence, 1994.

    Google Scholar 

  5. Jon Luis Bentley. Writing Efficient Programs. Prentice Hall International, London, 1982. pp. 67, 68.

    MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1995 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Beyerlein, P. (1995). Experiments on a Fast Mixture Density Likelihood Computation. In: Ayuso, A.J.R., Soler, J.M.L. (eds) Speech Recognition and Coding. NATO ASI Series, vol 147. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-57745-1_10

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-57745-1_10

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-63344-7

  • Online ISBN: 978-3-642-57745-1

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics