Abstract
This paper describes a new technique for finding objects in spectrograms, and illustrates the idea with an application to the formant-tracking task. Starting with a multiscale representation of speech spectra, a probabilistic relaxation labelling algorithm is applied to determine primitive interpretations of the spectral components. Finally, a cross-scale integration procedure enables the scale space to be collapsed in a principled manner. The techniques are illustrated with an example of voiced speech.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
P D Green, M P Cooke, H H Lafferty & A J H Simons: A Speech Recognition Strategy Based on Making Acoustic Evidence and Phonetic Knowledge Explicit, these proceedings (1987).
H C Leung, V W Zue: Visual Characterisation of Speech Spectrograms. Proc. ICASSP, paper 51.1, (1986).
D Marr: Vision, W H Freeman & Co, (1982).
A P Witkin: Scale-Space filtering. Proc. IJCAI, (1983), 1019–1022
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1988 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Cooke, M.P., Green, P.D. (1988). On Finding Objects in Spectrograms: A Multiscale Relaxation Labelling Approach. In: Niemann, H., Lang, M., Sagerer, G. (eds) Recent Advances in Speech Understanding and Dialog Systems. NATO ASI Series, vol 46. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-83476-9_10
Download citation
DOI: https://doi.org/10.1007/978-3-642-83476-9_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-83478-3
Online ISBN: 978-3-642-83476-9
eBook Packages: Springer Book Archive