Partial Classification in Speech Recognition Verification

  • Gustavo Hern’
  • andez Ábrego
  • Israel Torres S’nchez
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 2085)


Due to speech recognition imperfections, recognition results need to be verified before being used in real-life applications. Here we present two perspectives for recognition verification: direct classification and partial classification based on confidence measures. Linear classifiers, decision trees and perceptrons are used here as direct classifiers. On the other hand, we compute confidence measures through several methods, being MLP’s and evolutionary fuzzy systems the best performing ones. Experimentation with three types of speech input reveals that higher correct verification rates can be achieved when verification is based on confidence measures. Moreover, classification rates can be improved when verification does not have to deal with “uncertain” examples, which are not classified. Partial classification represents a trade-off between verification accuracy and the number of recognition results verified.


False Alarm Speech Recognition Recognition Result Fuzzy Logic System Continuous Speech 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    G. Hernández-Ábrego and J.B. Mariño. Fuzzy reasoning in confidence evaluation of speech recognition. In Proceedings of WISP’99, pages 221–226, Budapest, September 1999. IEEE.Google Scholar
  2. 2.
    G. Hernández-Ábrego and J.B. Mariño. A second opinion approach for speech recognition verification. In Proceedings of the VIII SNRFAI, volume I, pages 85–92, Bilbao, May 1999.Google Scholar
  3. 3.
    J.B. Mariño, A. Nogueiras, and A. Bonafonte. The Demiphone: an efficient subword unit for continuos speech recognition. In Proceedings of EUROSPEECH’97, volume III, pages 1215–1218, Rhodes, September 1997.Google Scholar
  4. 4.
    A. Moreno and R. Winsky. Spanish fixed network speech corpus. Technical report, SpeechDat Project LRE-63314, 1997.Google Scholar
  5. 5.
    T. Schaaf and T. Kemp. Confidence measures for spontaneous speech recognition. In Proceedings of 1997 ICASSP, volume II, pages 875–878, Munich, April 1997.Google Scholar
  6. 6.
    Y. Shi, R. Eberhart, and Y. Chen. Implementation of evolutionary fuzzy systems. IEEE Transactions on fuzzy systems, 7(2):109–119, April 1999.CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2001

Authors and Affiliations

  • Gustavo Hern’
    • 1
  • andez Ábrego
    • 1
  • Israel Torres S’nchez
    • 2
  1. 1.Spoken Language Technology, Sony U.S. Research Labs.San JoseUSA
  2. 2.Signal Theory and Communications DepartmentUniversitat Politècnica de CatalunyaBarcelonaSpain

Personalised recommendations