Code Breaking for Automatic Speech Recognition

Jelinek, Frederick

doi:10.1007/978-3-642-04208-9_1

Frederick Jelinek²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5729))

Included in the following conference series:

International Conference on Text, Speech and Dialogue

901 Accesses

Abstract

Practical automatic speech recognition is of necessity a (near) real time activity performed by a system whose structure is fixed and whose parameters once trained may be adapted on the basis of the speech that the system observed during recognition.

However, in specially important situations (e.g., recovery of out-of-vocabulary words) the recognition task could be viewed as an activity akin to code-breaking to whose accomplishment can be devoted an essentially infinite amount effort. In such a case everything would be fair, including, for instance, the retraining of a language and/or acoustic model on the basis of newly acquired data (from the Internet!) or even a complete change of the recognizer paradigm.

An obvious way to proceed is to use the basic recognizer to produce a lattice or confusion network and then do the utmost to eliminate ambiguity. Another possibility is to create a list of frequent confusions (for instance the pair IN and AND) and prepare a appropriate individual decision processes to resolve each when it occurs in test data. We will report on our initial code breaking effort.

Download to read the full chapter text

Chapter PDF

Minimizing Free Energy of Stochastic Functions of Markov Chains

SPRAAK: Speech Processing, Recognition and Automatic Annotation Kit

Sparse coding of the modulation spectrum for noise-robust automatic speech recognition

Article Open access 21 October 2014

Author information

Authors and Affiliations

Center for Language and Speech Processing, The Johns Hopkins University, Baltimore, USA
Frederick Jelinek

Authors

Frederick Jelinek
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Wet Bohemia at Pilsen, Czech Republic
Václav Matoušek
Department of Computer Science, University of West Bohemia in Pilsen, Univerzitni 8, 30614, Plzen, Czech Republic
Pavel Mautner

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jelinek, F. (2009). Code Breaking for Automatic Speech Recognition. In: Matoušek, V., Mautner, P. (eds) Text, Speech and Dialogue. TSD 2009. Lecture Notes in Computer Science(), vol 5729. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04208-9_1

Download citation

DOI: https://doi.org/10.1007/978-3-642-04208-9_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04207-2
Online ISBN: 978-3-642-04208-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Code Breaking for Automatic Speech Recognition

Abstract

Chapter PDF

Similar content being viewed by others

Minimizing Free Energy of Stochastic Functions of Markov Chains

SPRAAK: Speech Processing, Recognition and Automatic Annotation Kit

Sparse coding of the modulation spectrum for noise-robust automatic speech recognition

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Code Breaking for Automatic Speech Recognition

Abstract

Chapter PDF

Similar content being viewed by others

Minimizing Free Energy of Stochastic Functions of Markov Chains

SPRAAK: Speech Processing, Recognition and Automatic Annotation Kit

Sparse coding of the modulation spectrum for noise-robust automatic speech recognition

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation