Abstract
This paper presents an application of one method for improving fundamental frequency detection from a speech. The method is based on searching the best pitch paths over one or more words. It uses the idea that the fundamental frequency of a speaker cannot change sharply in a short time so that the pitch should not vary rapidly over one (or a few) words. This technique is created for improving pitch detection. It cannot detect the pitch itself, but it uses some pitch detectors. We compare some of them here and we try to determine which is the most suitable one for our method.
This work is supported by the Ministry of Education, Youth and Sports of the Czech Republic-project No. VS97060, and by the Research programme of Brno University of Technology “Research of electronic communication systems and technologies”, No. CEZ:J22/98:262200011.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
B. Gold and N. Morgan. Speech and Audio Signal Processing, pages 214–227 and 415-430, New York, 1999.
D. Talkin. A Robust Algorithm for Pitch Tracking (RAPT). In Kleijn, W. B. and Paliwal, K. K. (Eds.), Speech Coding and Synthesis. New York: Elseviever, 1995.
B. P. Bogart, M. J. R. Healy, and J. W. Tukey, The frequency analysis of time series for echoes: Cepstrum, pseudo-autocovariance, cross-cepstrum and shape tracking, in Symphosium on Time Series Analysis (M. Rosenblatt, Ed.), (New York), pp. 209–243, John Wiley and Sons, 1963.
L. R. Rabiner, On the use of autocorrelation analysis for pitch detection, IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-25, pp. 24–33, February 1977.
J. Černocký. Speech Processing Using Automatically Derived Segmental Units, Ph.D. Thesis, ESIEE, France, 1998.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Motlíček, P., Černocký, J. (2000). Optimal Pitch Path Tracking for More Reliable Pitch Detection. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2000. Lecture Notes in Computer Science(), vol 1902. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45323-7_31
Download citation
DOI: https://doi.org/10.1007/3-540-45323-7_31
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41042-3
Online ISBN: 978-3-540-45323-9
eBook Packages: Springer Book Archive