Vowel onset point detection for noisy speech using spectral energy at formant frequencies

Vuppala, Anil Kumar; Rao, K. Sreenivasa

doi:10.1007/s10772-012-9179-8

Vowel onset point detection for noisy speech using spectral energy at formant frequencies

Published: 23 October 2012

Volume 16, pages 229–235, (2013)
Cite this article

International Journal of Speech Technology Aims and scope Submit manuscript

Anil Kumar Vuppala¹ &
K. Sreenivasa Rao²

668 Accesses
30 Citations
Explore all metrics

Abstract

In this paper, we propose a method for robust detection of the vowel onset points (VOPs) from noisy speech. The proposed VOP detection method exploits the spectral energy at formant frequencies of the speech segments present in glottal closure region. In this work, formants are extracted by using group delay function, and glottal closure instants are extracted by using zero frequency filter based method. Performance of the proposed VOP detection method is compared with the existing method, which uses the combination of evidence from excitation source, spectral peaks energy and modulation spectrum. Speech data from TIMIT database and noise samples from NOISEX database are used for analyzing the performance of the VOP detection methods. Significant improvement in the performance of VOP detection is observed by using proposed method compared to existing method.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Application of Zero-Frequency Filtering for Vowel Onset Point Detection

Robust analysis for improvement of vowel onset point detection under noisy conditions

Article 04 March 2016

Improved vowel region detection from a continuous speech using post processing of vowel onset points and vowel end-points

Article 01 August 2017

References

Gangashetty, S. V., Sekhar, C. C., & Yegnanarayana, B. (2004a). Detection of vowel onset points in continuous speech using autoassociative neural network models. In Proc. int. conf. spoken language processing (pp. 401–410).
Google Scholar
Gangashetty, S. V., Sekhar, C. C., & Yegnanarayana, B. (2004b). Extraction of fixed dimension patterns from varying duration segments of consonant-vowel utterances. In Proc. of IEEE ICISIP (pp. 159–164).
Google Scholar
Hermes, D. J. (1990). Vowel onset detection. The Journal of the Acoustical Society of America, 87, 866–873.
Article Google Scholar
Joseph, M. A., Guruprasad, S., & Yegnanarayana, B. (2006). Extracting formants from short segments of speech using group delay functions. In Proc. of interspeech (pp. 1009–1012).
Google Scholar
Murty, K. S. R., & Yegnanarayana, B. (2008). Epoch extraction from speech signals. IEEE Transactions on Audio, Speech, and Language Processing, 16(8), 1602–1613.
Article Google Scholar
Prasanna, S. R. M., & Yegnanarayana, B. (2005). Detection of vowel onset point events using excitation source information. In Proc. of interspeech (pp. 1133–1136).
Google Scholar
Prasanna, S. R. M., Gangashetty, S. V., & Yegnanarayana, B. (2001). Significance of vowel onset point for speech analysis. In Proc. of int. conf. signal processing and communications (pp. 81–88).
Google Scholar
Prasanna, S. R. M., Reddy, B. V. S., & Krishnamoorthy, P. (2009). Vowel onset point detection using source, spectral peaks, and modulation spectrum energies. IEEE Transactions on Audio, Speech, and Language Processing, 17(4), 556–565.
Article Google Scholar
Rao, K. S., & Yegnanarayana, B. (2009). Duration modification using glottal closure instants and vowel onset points. Speech Communication, 51, 1263–1269.
Article Google Scholar
Vuppala, A. K., Rao, K. S., Chakrabarti, S., Krishnamoorthy, P., & Prasanna, S. R. M. (2011). Recognition of consonant-vowel (cv) units under background noise using combined temporal and spectral preprocessing. International Journal of Speech Technology, 14(1).
Vuppala, A. K., Rao, K. S., & Chakrabarti, S. (2012a). Improved consonant–vowel recognition for low bit-rate coded speech. Wiley International Journal of Adaptive Control and Signal Processing, 26(4), 333–349.
Article Google Scholar
Vuppala, A. K., Rao, K. S., & Chakrabarti, S. (2012b). Spotting and recognition of consonant-vowel units from continuous speech using accurate vowel onset points. Circuits, Systems, and Signal Processing, 31(4), 1459–1474.
Article Google Scholar
Wang, J.-H., & Chen, S.-H. (1999). A c/v segmentation algorithm for mandarin speech using wavelet transforms. In Proc. IEEE int. conf. acoust., speech, signal processing (pp. 1261–1264).
Google Scholar
Wang, J.-F., Wu, C. H., Chang, S. H., & Lee, J. Y. (1991). A hierarchical neural network based C/V segmentation algorithm for mandarin speech recognition. IEEE Transactions on Signal Processing, 39(9), 2141–2146.
Article Google Scholar

Download references

Author information

Authors and Affiliations

LTRC, International Institute of Information Technology-Hyderabad, Hyderabad, India
Anil Kumar Vuppala
School of Information Technology, Indian Institute of Technology, Kharagpur, India
K. Sreenivasa Rao

Authors

Anil Kumar Vuppala
View author publications
You can also search for this author in PubMed Google Scholar
K. Sreenivasa Rao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Anil Kumar Vuppala.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Vuppala, A.K., Rao, K.S. Vowel onset point detection for noisy speech using spectral energy at formant frequencies. Int J Speech Technol 16, 229–235 (2013). https://doi.org/10.1007/s10772-012-9179-8

Download citation

Received: 24 June 2012
Accepted: 11 October 2012
Published: 23 October 2012
Issue Date: June 2013
DOI: https://doi.org/10.1007/s10772-012-9179-8

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Vowel onset point detection for noisy speech using spectral energy at formant frequencies

Abstract

Access this article

Similar content being viewed by others

Application of Zero-Frequency Filtering for Vowel Onset Point Detection

Robust analysis for improvement of vowel onset point detection under noisy conditions

Improved vowel region detection from a continuous speech using post processing of vowel onset points and vowel end-points

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Vowel onset point detection for noisy speech using spectral energy at formant frequencies

Abstract

Access this article

Similar content being viewed by others

Application of Zero-Frequency Filtering for Vowel Onset Point Detection

Robust analysis for improvement of vowel onset point detection under noisy conditions

Improved vowel region detection from a continuous speech using post processing of vowel onset points and vowel end-points

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation