Abstract
In this paper, we provide a new approach in the design of robust speaker verification in noisy environments using some principles based on the missing data theory and Bayesian networks. This approach integrates high-level information concerning the reliability of pitch and spectral envelope features in missing feature compensation process in order to increase the performance of Gaussian mixture models (GMM) of speakers. In this paper, a Bayesian network approach for modeling statistical dependencies between reliable prosodic and spectral envelope features is presented. Within this approach, conditional statistical distributions (represented by GMMs) of the features are simultaneously exploited for increasing the recognition score, particularly in very noisy conditions. Masked by noise data can be discarded and the Bayesian network can be used to infer the likelihood values and compute the recognition scores. The system is tested on a challenging text-independent telephone-quality speaker verification task.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Reynolds, D. A., Rose, R.: Robust text-independent speaker identification using Gaussian mixture models. 3 (1995) 72–83
Pearl, J.: Probabilistic reasoning in intelligent systems: networks of plausible inference. Morgan Kaufmann Publishers (1988)
Arcienega, M., Drygajlo, A.: A Bayesian network approach for combining pitch and spectral envelope features for speaker verfication. In: COST 275 Workshop, “The Advents of Biometrics over the Internet”, Rome, Italy (2002) 99–102
Renevey, P., Drygajlo, A.: Missing feature theory and probabilistic estimation of clean speech componenets for robust speech recognition. Volume 6., Budapest, Hungary (1999) 2627–2630
Drygajlo, A., El-Maliki, M.: Integration and imputation methods for unreliable feature compensation in GMM based speaker verification, Crete, Greece (2001) 107–112
Arcienega, M., Drygajlo, A.: Robust voiced/unvoiced decision associated to continuous pitch tracking in noisy telephone speech. Volume 4., Denver, Colorado USA (2002) 2433–2436
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Arcienega, M., Drygajlo, A. (2003). A Bayesian Network Approach for Combining Pitch and Reliable Spectral Envelope Features for Robust Speaker Verification. In: Kittler, J., Nixon, M.S. (eds) Audio- and Video-Based Biometric Person Authentication. AVBPA 2003. Lecture Notes in Computer Science, vol 2688. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44887-X_10
Download citation
DOI: https://doi.org/10.1007/3-540-44887-X_10
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40302-9
Online ISBN: 978-3-540-44887-7
eBook Packages: Springer Book Archive