Basic Speech Processing Concepts

Sinha, Priyabrata

doi:10.1007/978-0-387-75581-6_3

Priyabrata Sinha²

1319 Accesses

Abstract

Before we explore the algorithms and techniques used to process speech signals to accomplish various objectives in an embedded application, we need to understand some fundamental principles behind the nature of speech signals. Of particular importance are the temporal and spectral characteristics of different types of vocal sounds produced by humans and what role the human speech production system itself plays in determining the properties of these sounds. This knowledge enables us to efficiently model the sounds generated, thereby providing the foundation of sophisticated techniques for compressing speech. Moreover, any spoken language is based on a combination and sequence of such sounds; hence understanding their salient features is useful for the design and implementation of effective speech recognition and synthesis techniques. In this section, we will learn how to classify the basic types of sounds generated by human voice and the underlying time-domain and frequency-domain characteristics behind these different types of sounds. Finally, and most importantly, we will explore some popular speech processing building-block techniques that enable us to extract critical pieces of information from the speech signal, such as which category a speech segment belongs to, the pitch of the sound, and the energy contained therein.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 129.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Rabiner LR, Schafer RW Digital processing of speech signals, Prentice Hall, 1998.
Google Scholar
Chau WC Speech coding algorithms, Wiley-Interscience, 2003.
Google Scholar
Holmes J, Holmes W Speech synthesis and recognition, CRC Press, 2001.
Google Scholar
Proakis JG, Manolakis DG Digital Signal Processing – Principles, Algorithms and Applications, Prentice Hall, 1995.
Google Scholar
Flanagan JL, Speech Analysis and Perception, Springer Science+Business Media B.V., 1965.
Google Scholar
Rubin P, Vatikiotis-Bateson E Measuring and Modeling Speech Production, Animal Acoustic Communication, Springer Science+Business Media B.V., 1998.
Google Scholar

Download references

Author information

Authors and Affiliations

Microchip Technology, Inc., Chandler, AZ, USA
Priyabrata Sinha

Authors

Priyabrata Sinha
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Priyabrata Sinha .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Sinha, P. (2010). Basic Speech Processing Concepts. In: Speech Processing in Embedded Systems. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-75581-6_3

Download citation

DOI: https://doi.org/10.1007/978-0-387-75581-6_3
Published: 10 November 2009
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-75580-9
Online ISBN: 978-0-387-75581-6
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics