Basic Speech Processing Concepts

  • Priyabrata Sinha


Before we explore the algorithms and techniques used to process speech signals to accomplish various objectives in an embedded application, we need to understand some fundamental principles behind the nature of speech signals. Of particular importance are the temporal and spectral characteristics of different types of vocal sounds produced by humans and what role the human speech production system itself plays in determining the properties of these sounds. This knowledge enables us to efficiently model the sounds generated, thereby providing the foundation of sophisticated techniques for compressing speech. Moreover, any spoken language is based on a combination and sequence of such sounds; hence understanding their salient features is useful for the design and implementation of effective speech recognition and synthesis techniques. In this section, we will learn how to classify the basic types of sounds generated by human voice and the underlying time-domain and frequency-domain characteristics behind these different types of sounds. Finally, and most importantly, we will explore some popular speech processing building-block techniques that enable us to extract critical pieces of information from the speech signal, such as which category a speech segment belongs to, the pitch of the sound, and the energy contained therein.


Vocal Cord Speech Signal Vocal Tract Speech Quality Speech Segment 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. 1.
    Rabiner LR, Schafer RW Digital processing of speech signals, Prentice Hall, 1998.Google Scholar
  2. 2.
    Chau WC Speech coding algorithms, Wiley-Interscience, 2003.Google Scholar
  3. 3.
    Holmes J, Holmes W Speech synthesis and recognition, CRC Press, 2001.Google Scholar
  4. 4.
    Proakis JG, Manolakis DG Digital Signal Processing – Principles, Algorithms and Applications, Prentice Hall, 1995.Google Scholar
  5. 5.
    Flanagan JL, Speech Analysis and Perception, Springer Science+Business Media B.V., 1965.Google Scholar
  6. 6.
    Rubin P, Vatikiotis-Bateson E Measuring and Modeling Speech Production, Animal Acoustic Communication, Springer Science+Business Media B.V., 1998.Google Scholar

Copyright information

© Springer Science+Business Media, LLC 2010

Authors and Affiliations

  1. 1.Microchip Technology, Inc.ChandlerUSA

Personalised recommendations