Abstract
In this paper, we report on the application of Bayesian methods to the analysis of speech signals. Voiced speech can be modelled as a superposition of decaying sinusoids and estimates of the resonant frequencies, decay rates, phases, amplitudes as well as the number of model functions are calculated. The motivation for this model is that in speech analysis, the frequencies and decay rates correspond to formants and bandwidths which are perceptually significant parameters. Speech parameters are estimated by calculating the posterior probabilities for the model parameters, after various nuisance parameters have been marginalised, and it is shown how model order evidence can be calculated. Comparisons with methods such as Minimum Description Length and the Akaike Information Criteria will be made.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Akaike, H., Information theory and the extension of the maximum likelihood principle, 2nd Inter.Symp. on Information Theory (Petrov, B.N.and Csaki, F. eds.), Akademiai 281, 1973.
Bretthorst, L., Bayesian Spectrum Analysis and Parameter Estimation, Lecture notes in statistics, Springer Verlag 1989.
Broemeling, L.D., Bayesian Analysis of Linear Models, Marcel Dekker, Inc, 1985.
Cox, D.R., and Reid, N., The Canadian Journal of Statistics, 17, 229, 1989.
Duijndam, A.J.W., Geophysical Prospecting, 36, 878, 1988.
Naylor, J.C. and Shaw, J.E.H., Bayes Four User Manual, Nottingham Polytechnic, 1990.
Rissanen, J., Automatica, 14, pp 465–471, 1978.
Schwarz, G., The Annals of Statistics, 6, 461, 1978.
West, M., and Harrison, J., Bayesian Forcasting and Dynamic Models, Springer Verlag, 1989.
Zellner, A., An Introduction to Bayesian Inference in Econometrics, New York, John Wiley and Sons, 1971.
Fitzgerald, W.J., Bayesian Data Analysis, Proc. I.O.A. Vol 13 part 9 pp 212–219, 1991.
Lasenby, J., and Fitzgerald, W.J., A Bayesian Approach to High-Resolution Beam-forming, IEE Proc. F, Vol 138, Number 6, pp 539–544, 1991.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1993 Springer Science+Business Media Dordrecht
About this chapter
Cite this chapter
Fitzgerald, W.J., Niranjan, M. (1993). Speech Processing Using Bayesian Inference. In: Mohammad-Djafari, A., Demoment, G. (eds) Maximum Entropy and Bayesian Methods. Fundamental Theories of Physics, vol 53. Springer, Dordrecht. https://doi.org/10.1007/978-94-017-2217-9_27
Download citation
DOI: https://doi.org/10.1007/978-94-017-2217-9_27
Publisher Name: Springer, Dordrecht
Print ISBN: 978-90-481-4272-9
Online ISBN: 978-94-017-2217-9
eBook Packages: Springer Book Archive