Advertisement

Speech Recognition Using GFIKS

  • Sakriani Sakti
  • Satoshi Nakamura
  • Konstantin Markov
  • Wolfgang   Minker
Chapter
Part of the Lecture Notes in Electrical Engineering book series (LNEE, volume 42)

In this chapter, we demonstrate how the statistical speech recognition system may incorporate additional sources by utilizing GFIKS at different levels, HMM state and phonetic-unit. We also present some experimental results of incorporating various knowledge sources, including environmental variability (i.e., background noise information), speaker variability (i.e., accent and gender information) and contextual variability (i.e., wide-phonetic information). The incorporation of these knowledge sources may be done only for a single type of knowledge source, or even the combination between different type of knowledge sources.

We describe some common considerations of using GFIKS at the HMM state level in Section 4.1 and at the HMM phonetic-unit level in Section 4.2. These issues include defening causal relationships between information sources, inference, training issues, and enhancing model reliability. Then, in Section 4.3, we describe an experimental evaluation of applying the proposed GFIKS to the task of incorporating various knowledge sources. Finally, in Section 4.4, the summary of the experiments are presented and the comparison between diferent level of incorporation is also discussed.

Keywords

Speech Recognition Knowledge Source Word Error Rate Junction Tree Pronunciation Dictionary 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer Science+Business Media, LLC 2009

Authors and Affiliations

  • Sakriani Sakti
    • 1
  • Satoshi Nakamura
    • 2
  • Konstantin Markov
    • 3
  • Wolfgang   Minker
    • 4
  1. 1. NICT/ATR Spoken Language NICT/ATR Spoken Language Communication Research Laboratories KyotoJapan
  2. 2. NICT/ATR Spoken Language NICT/ATR Spoken Language Communication Research Laboratories KyotoJapan
  3. 3. NICT/ATR Spoken Language NICT/ATR Spoken Language Communication Research Laboratories KyotoJapan
  4. 4.University of UlmUlm Germany

Personalised recommendations