The CLEAR’06 LIMSI Acoustic Speaker Identification System for CHIL Seminars

Barras, Claude; Zhu, Xuan; Gauvain, Jean-Luc; Lamel, Lori

doi:10.1007/978-3-540-69568-4_20

Claude Barras¹,
Xuan Zhu¹,
Jean-Luc Gauvain¹ &
…
Lori Lamel¹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 4122))

Included in the following conference series:

International Evaluation Workshop on Classification of Events, Activities and Relationships

1261 Accesses
3 Citations

Abstract

This paper summarizes the LIMSI participation in the CLEAR’06 acoustic speaker identification task that aims to identify speakers in CHIL seminars via the acoustic channel. The system consists of a standard Gaussian mixture model based system similar to systems developed for the NIST speaker recognition evaluations and includies feature warping of cepstral coefficients and MAP adaptation of a Universal Background Model. Several computational optimizations were implemented for real-time efficiency: stochastic frame subsampling for training, top-Gaussians scoring and auto-adaptive pruning for the tests, speeding up the system by more than a factor of ten.

This work was partially financed by the European Commission under the FP6 Integrated Project IP 506909 Chil.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Barras, C., Gauvain, J.-L.: Feature and score normalization for speaker verification of cellular data. In: Proc. of IEEE ICASSP (May 2003)
Google Scholar
Doddington, G., Przybocki, M., Martin, A., Reynolds, D.: The NIST speaker recognition evaluation - overview, methodology, systems, results, perspective. Speech Communication 31, 225–254 (2000)
Article Google Scholar
Gauvain, J.-L., Lee, C.H.: Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains. IEEE Transactions on Speech and Audio Processing 2(2), 291–298 (1994)
Article Google Scholar
McLaughlin, J., Reynolds, D., Gleason, T.: A Study of Computation Speed-UPS of the GMM-UBM Speaker Recognition System. In: Proc. Eurospeech’99, Budapest, pp. 1215–1218 (Sept. 1999)
Google Scholar
Mostefa, D., et al.: CLEAR Evaluation Plan v1.1 (2006), http://isl.ira.uka.de/clear06/downloads/chil-clear-v1.1-2006-02-21.pdf
Pelecanos, J., Sridharan, S.: Feature warping for robust speaker verification. In: Proc. ISCA Workshop on Speaker Recognition - Odyssey (June 2001)
Google Scholar
Reynolds, D., Quatieri, T., Dunn, R.: Speaker verification using adapted Gaussian mixture models. Digital Signal Processing 10, 19–41 (2000)
Article Google Scholar
Zhu, X., Leung, C-C., Barras, C., Lamel, L., Gauvain, J-L.: Speech activity detection and speaker identification for CHIL. In: Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI), Edinburgh (July 2005)
Google Scholar

Download references

Author information

Authors and Affiliations

Spoken Language Processing Group, LIMSI-CNRS, BP 133, 91403 Orsay cedex, France
Claude Barras, Xuan Zhu, Jean-Luc Gauvain & Lori Lamel

Authors

Claude Barras
View author publications
You can also search for this author in PubMed Google Scholar
Xuan Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Jean-Luc Gauvain
View author publications
You can also search for this author in PubMed Google Scholar
Lori Lamel
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Rainer Stiefelhagen John Garofolo

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Barras, C., Zhu, X., Gauvain, JL., Lamel, L. (2007). The CLEAR’06 LIMSI Acoustic Speaker Identification System for CHIL Seminars. In: Stiefelhagen, R., Garofolo, J. (eds) Multimodal Technologies for Perception of Humans. CLEAR 2006. Lecture Notes in Computer Science, vol 4122. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-69568-4_20

Download citation

DOI: https://doi.org/10.1007/978-3-540-69568-4_20
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-69567-7
Online ISBN: 978-3-540-69568-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics