Audio Processing and Speech Recognition

Concepts, Techniques and Research Overviews

  • Soumya Sen
  • Anjan Dutta
  • Nilanjan Dey

Part of the SpringerBriefs in Applied Sciences and Technology book series (BRIEFSAPPLSCIENCES)

Also part of the SpringerBriefs in Computational Intelligence book sub series (BRIEFSINTELL)

Table of contents

  1. Front Matter
    Pages i-xiv
  2. Soumya Sen, Anjan Dutta, Nilanjan Dey
    Pages 1-11
  3. Soumya Sen, Anjan Dutta, Nilanjan Dey
    Pages 13-43
  4. Soumya Sen, Anjan Dutta, Nilanjan Dey
    Pages 45-66
  5. Soumya Sen, Anjan Dutta, Nilanjan Dey
    Pages 67-93
  6. Soumya Sen, Anjan Dutta, Nilanjan Dey
    Pages 95-96

About this book


This book offers an overview of audio processing, including the latest advances in the methodologies used in audio processing and speech recognition. First, it discusses the importance of audio indexing and classical information retrieval problem and presents two major indexing techniques, namely Large Vocabulary Continuous Speech Recognition (LVCSR) and Phonetic Search. It then offers brief insights into the human speech production system and its modeling, which are required to produce artificial speech. It also discusses various components of an automatic speech recognition (ASR) system. 
Describing the chronological developments in ASR systems, and briefly examining the statistical models used in ASR as well as the related mathematical deductions, the book summarizes a number of state-of-the-art classification techniques and their application in audio/speech classification. 
By providing insights into various aspects of audio/speech processing and speech recognition, this book appeals a wide audience, from researchers and postgraduate students to those new to the field.


Audio Indexing Classic Information Retrieval problem Large vocabulary continuous speech recognition Phonetic search Automatic speech recognition Hidden Markov Model(HMM) Neural network and speech recognition Feature extraction Linear prediction coding Mel frequency Cepstral Coefficient Linear prediction Cepstral Coefficient Discrete Wavelet Transform (DWT) Wavelet Packet Decomposition (WPD) Perceptual Linear Prediction (PLP) K-nearest neighbors Bayesian classifier Decision Tree Support Vector machines (SVM)

Authors and affiliations

  • Soumya Sen
    • 1
  • Anjan Dutta
    • 2
  • Nilanjan Dey
    • 3
  1. 1.A.K.Choudhury School of Information TechnologyUniversity of CalcuttaKolkataIndia
  2. 2.Department of Information TechnologyTechno India College of TechnologyKolkataIndia
  3. 3.Department of Information TechnologyTechno India College of TechnologyKolkataIndia

Bibliographic information

  • DOI
  • Copyright Information The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2019
  • Publisher Name Springer, Singapore
  • eBook Packages Engineering Engineering (R0)
  • Print ISBN 978-981-13-6097-8
  • Online ISBN 978-981-13-6098-5
  • Series Print ISSN 2191-530X
  • Series Online ISSN 2191-5318
  • Buy this book on publisher's site
Industry Sectors
IT & Software
Oil, Gas & Geosciences