Abstract
This paper summarizes the LIMSI participation in the CLEAR’06 acoustic speaker identification task that aims to identify speakers in CHIL seminars via the acoustic channel. The system consists of a standard Gaussian mixture model based system similar to systems developed for the NIST speaker recognition evaluations and includies feature warping of cepstral coefficients and MAP adaptation of a Universal Background Model. Several computational optimizations were implemented for real-time efficiency: stochastic frame subsampling for training, top-Gaussians scoring and auto-adaptive pruning for the tests, speeding up the system by more than a factor of ten.
This work was partially financed by the European Commission under the FP6 Integrated Project IP 506909 Chil.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Barras, C., Gauvain, J.-L.: Feature and score normalization for speaker verification of cellular data. In: Proc. of IEEE ICASSP (May 2003)
Doddington, G., Przybocki, M., Martin, A., Reynolds, D.: The NIST speaker recognition evaluation - overview, methodology, systems, results, perspective. Speech Communication 31, 225–254 (2000)
Gauvain, J.-L., Lee, C.H.: Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains. IEEE Transactions on Speech and Audio Processing 2(2), 291–298 (1994)
McLaughlin, J., Reynolds, D., Gleason, T.: A Study of Computation Speed-UPS of the GMM-UBM Speaker Recognition System. In: Proc. Eurospeech’99, Budapest, pp. 1215–1218 (Sept. 1999)
Mostefa, D., et al.: CLEAR Evaluation Plan v1.1 (2006), http://isl.ira.uka.de/clear06/downloads/chil-clear-v1.1-2006-02-21.pdf
Pelecanos, J., Sridharan, S.: Feature warping for robust speaker verification. In: Proc. ISCA Workshop on Speaker Recognition - Odyssey (June 2001)
Reynolds, D., Quatieri, T., Dunn, R.: Speaker verification using adapted Gaussian mixture models. Digital Signal Processing 10, 19–41 (2000)
Zhu, X., Leung, C-C., Barras, C., Lamel, L., Gauvain, J-L.: Speech activity detection and speaker identification for CHIL. In: Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI), Edinburgh (July 2005)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer Berlin Heidelberg
About this paper
Cite this paper
Barras, C., Zhu, X., Gauvain, JL., Lamel, L. (2007). The CLEAR’06 LIMSI Acoustic Speaker Identification System for CHIL Seminars. In: Stiefelhagen, R., Garofolo, J. (eds) Multimodal Technologies for Perception of Humans. CLEAR 2006. Lecture Notes in Computer Science, vol 4122. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-69568-4_20
Download citation
DOI: https://doi.org/10.1007/978-3-540-69568-4_20
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-69567-7
Online ISBN: 978-3-540-69568-4
eBook Packages: Computer ScienceComputer Science (R0)