Voiced/Unvoiced Decision for Speech Signals Based on Zero-Crossing Rate and Energy

Bachu, R.G.; Kopparthi, S.; Adapa, B.; Barkana, B.D.

doi:10.1007/978-90-481-3660-5_47

R.G. Bachu²,
S. Kopparthi²,
B. Adapa² &
…
B.D. Barkana²

3114 Accesses
61 Citations

Abstract

In speech analysis, the voiced-unvoiced decision is usually performed in extracting the information from the speech signals. In this paper, two methods are performed to separate the voiced and unvoiced parts of the speech signals. These are zero crossing rate (ZCR) and energy. In here, we evaluated the results by dividing the speech sample into some segments and used the zero crossing rate and energy calculations to separate the voiced and unvoiced parts of speech. The results suggest that zero crossing rates are low for voiced part and high for unvoiced part where as the energy is high for voiced part and low for unvoiced part. Therefore, these methods are proved effective in separation of voiced and unvoiced speech.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

J. K. Lee, C. D. Yoo, “Wavelet speech enhancement based on voiced/unvoiced decision”, Korea Advanced Institute of Science and Technology The 32nd International Congress and Exposition on Noise Control Engineering, Jeju International Convention Center, Seogwipo, Korea, August 25-28, 2003.
Google Scholar
B. Atal, and L. Rabiner, “A Pattern Recognition Approach to Voiced-Unvoiced-Silence Classification with Applications to Speech Recognition,” IEEE Trans. On ASSP, vol. ASSP-24, pp. 201-212, 1976.
Article Google Scholar
S. Ahmadi, and A.S. Spanias, “Cepstrum-Based Pitch Detection using a New Statistical V/UV Classification Algorithm,” IEEE Trans. Speech Audio Processing, vol. 7 No. 3, pp. 333-338, 1999.
Article Google Scholar
Y. Qi, and B.R. Hunt, “Voiced-Unvoiced-Silence Classifications of Speech using Hybrid Features and a Network Classifier,” IEEE Trans. Speech Audio Processing, vol. 1 No. 2, pp. 250-255, 1993.
Article Google Scholar
L. Siegel, “A Procedure for using Pattern Classification Techniques to obtain a Voiced/Unvoiced Classifier”, IEEE Trans. on ASSP, vol. ASSP-27, pp. 83- 88, 1979.
Article Google Scholar
T.L. Burrows, “Speech Processing with Linear and Neural Network Models”, Ph.D. thesis, Cambridge University Engineering Department, U.K., 1996.
Google Scholar
D.G. Childers, M. Hahn, and J.N. Larar, “Silent and Voiced/Unvoiced/Mixed Excitation (Four-Way) Classification of Speech,” IEEE Trans. on ASSP, vol. 37 No. 11, pp. 1771-1774, 1989.
Article Google Scholar
J. K. Shah, A. N. Iyer, B. Y. Smolenski, and R. E. Yantorno “Robust voiced/unvoiced classification using novel features and Gaussian Mixture model”, Speech Processing Lab., ECE Dept., Temple University, 1947 N 12^th St., Philadelphia, PA 19122-6077, USA.
Google Scholar
J. Marvan, “Voice Activity detection Method and Apparatus for voiced/unvoiced decision and Pitch Estimation in a Noisy speech feature extraction”, 08/23/2007, United States Patent 20070198251.
Google Scholar
T. F. Quatieri, Discrete-Time Speech Signal Processing: Principles and Practice, MIT Lincoln Laboratory, Lexington, Massachusetts, Prentice Hall, 2001, ISBN-13:9780132429429.
Google Scholar
F.J. Owens, Signal Processing of Speech, McGraw-Hill, Inc., 1993, ISBN-0-07-0479555-0.
Google Scholar
L. R. Rabiner, and R. W. Schafer, Digital Processing of Speech Signals, Englewood Cliffs, New Jersey, Prentice Hall, 512-ISBN-13:9780132136037, 1978.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electrical Engineering, School of Engineering, University of Bridgeport, 221 University Ave., Bridgeport, CT, 06604, USA
R.G. Bachu, S. Kopparthi, B. Adapa & B.D. Barkana

Authors

R.G. Bachu
View author publications
You can also search for this author in PubMed Google Scholar
S. Kopparthi
View author publications
You can also search for this author in PubMed Google Scholar
B. Adapa
View author publications
You can also search for this author in PubMed Google Scholar
B.D. Barkana
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Engineering, University of Bridgeport, University Avenue 221, Bridgeport, 06604, U.S.A.
Khaled Elleithy

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bachu, R., Kopparthi, S., Adapa, B., Barkana, B. (2010). Voiced/Unvoiced Decision for Speech Signals Based on Zero-Crossing Rate and Energy. In: Elleithy, K. (eds) Advanced Techniques in Computing Sciences and Software Engineering. Springer, Dordrecht. https://doi.org/10.1007/978-90-481-3660-5_47

Download citation

DOI: https://doi.org/10.1007/978-90-481-3660-5_47
Published: 15 December 2009
Publisher Name: Springer, Dordrecht
Print ISBN: 978-90-481-3659-9
Online ISBN: 978-90-481-3660-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics