Advertisement

Incorporating Knowledge Sources into Statistical Speech Recognition

  • Wolfgang Minker
  • Satoshi Nakamura
  • Konstantin Markov
  • Sakriani Sakti

Part of the Lecture Notes in Electrical Engineering book series (LNEE, volume 42)

Table of contents

  1. Front Matter
    Pages 1-20
  2. Sakriani Sakti, Satoshi Nakamura, Konstantin Markov, Wolfgang Minker
    Pages 1-17
  3. Sakriani Sakti, Satoshi Nakamura, Konstantin Markov, Wolfgang Minker
    Pages 1-35
  4. Sakriani Sakti, Satoshi Nakamura, Konstantin Markov, Wolfgang Minker
    Pages 1-23
  5. Sakriani Sakti, Satoshi Nakamura, Konstantin Markov, Wolfgang Minker
    Pages 1-59
  6. Sakriani Sakti, Satoshi Nakamura, Konstantin Markov, Wolfgang Minker
    Pages 1-5
  7. Back Matter
    Pages 1-47

About this book

Introduction

Incorporating Knowledge Sources into Statistical Speech Recognition offers solutions for enhancing the robustness of a statistical automatic speech recognition (ASR) system by incorporating various additional knowledge sources while keeping the training and recognition effort feasible.

The authors provide an efficient general framework for incorporating knowledge sources into state-of-the-art statistical ASR systems. This framework, which is called GFIKS (graphical framework to incorporate additional knowledge sources), was designed by utilizing the concept of the Bayesian network (BN) framework. This framework allows probabilistic relationships among different information sources to be learned, various kinds of knowledge sources to be incorporated, and a probabilistic function of the model to be formulated.

Incorporating Knowledge Sources into Statistical Speech Recognition demonstrates how the statistical speech recognition system may incorporate additional information sources by utilizing GFIKS at different levels of ASR. The incorporation of various knowledge sources, including background noises, accent, gender and wide phonetic knowledge information, in modeling is discussed theoretically and analyzed experimentally.

Keywords

Acoustic Modeling Automatic Speech Recognition Bayesian Network Junction Tree Knowledge Source Incoporation Wide-Context Dependency cognition proving speech recognition

Authors and affiliations

  • Wolfgang Minker
  • Satoshi Nakamura
  • Konstantin Markov
  • Sakriani Sakti

There are no affiliations available

Bibliographic information

  • DOI https://doi.org/10.1007/978-0-387-85830-2
  • Copyright Information Springer Science+Business Media, LLC 2009
  • Publisher Name Springer, Boston, MA
  • eBook Packages Engineering
  • Print ISBN 978-0-387-85829-6
  • Online ISBN 978-0-387-85830-2
  • Series Print ISSN 1876-1100
  • Series Online ISSN 1876-1119
  • Buy this book on publisher's site
Industry Sectors
Pharma
Automotive
Biotechnology
Finance, Business & Banking
Electronics
IT & Software
Telecommunications
Consumer Packaged Goods
Aerospace
Oil, Gas & Geosciences
Engineering