Advertisement

Robust Speaker Identification in a Meeting with Short Audio Segments

  • Giorgio Biagetti
  • Paolo Crippa
  • Laura Falaschetti
  • Simone Orcioni
  • Claudio Turchetti
Conference paper
Part of the Smart Innovation, Systems and Technologies book series (SIST, volume 57)

Abstract

The paper proposes a speaker identification scheme for a meeting scenario, that is able to answer the question “is somebody currently talking?”, if yes, “who is it?”. The suggested system has been designed to identify during a meeting conversation the current speaker from a set of pre-trained speaker models. Experimental results on two databases show the robustness of the approach to the overlapping phenomena and the ability of the algorithm to correctly identify a speaker with short audio segments.

Keywords

Speaker identification Meeting conversation Speaker diarization Overlapping speech 

References

  1. 1.
    Biagetti, G., Crippa, P., Curzi, A., Orcioni, S., Turchetti, C.: Speaker identification with short sequences of speech frames. In: Proceedings of the International Conference on Pattern Recognition Applications and Methods, pp. 178–185 (2015)Google Scholar
  2. 2.
    Carletta, J., Ashby, S., Bourban, S., Flynn, M., Guillemot, M., Hain, T., Kadlec, J., Karaiskos, V., Kraaij, W., Kronenthal, M., et al.: The AMI meeting corpus: a pre-announcement. Springer, Berlin (2005)Google Scholar
  3. 3.
    Friedland, G., Vinyals, O.: Live speaker identification in conversations. In: Proceedings of the 16th ACM International Conference on Multimedia, pp. 1017–1018. ACM (2008)Google Scholar
  4. 4.
    Galibert, O.: Methodologies for the evaluation of speaker diarization and automatic speech recognition in the presence of overlapping speech. In: Proceedings of INTERSPEECH, pp. 1131–1134 (2013)Google Scholar
  5. 5.
    Hotelling, H.: Analysis of a complex of statistical variables into principal components. J. Educ. Psychol. 24(6), 417–441 (1933)CrossRefGoogle Scholar
  6. 6.
    Jiang, X.: Linear subspace learning-based dimensionality reduction. IEEE Sig. Process. Mag. 28(2), 16–26 (2011)CrossRefGoogle Scholar
  7. 7.
    Jolliffe, I.T.: Principal Component Analysis. Springer Series in Statistics. Springer, New York (1986)CrossRefGoogle Scholar
  8. 8.
    Luque, J., Hernando, J.: Robust speaker identification for meetings: UPC CLEAR’07 meeting room evaluation system. In: Multimodal Technologies for Perception of Humans, pp. 266–275. Springer (2008)Google Scholar
  9. 9.
    NIST: 2000 speaker recognition evaluation—evaluation plan. (2000). http://www.itl.nist.gov/iad/mig/tests/spk/2000/spk-2000-plan-v1.0.htm
  10. 10.
    Reynolds, D.A.: Experimental evaluation of features for robust speaker identification. IEEE Trans. Speech Audio Process. 2(4), 639–643 (1994)MathSciNetCrossRefGoogle Scholar
  11. 11.
    Reynolds, D.A., Rose, R.C.: Robust text-independent speaker identification using Gaussian mixture speaker models. IEEE Trans. Speech Audio Process. 3(1), 72–83 (1995)CrossRefGoogle Scholar
  12. 12.
    Singh-Miller, N., Collins, M., Hazen, T.J.: Dimensionality reduction for speech recognition using neighborhood components analysis. In: Proceedings of INTERSPEECH, pp. 1158–1161 (2007)Google Scholar
  13. 13.
    Togneri, R., Pullella, D.: An overview of speaker identification: accuracy and robustness issues. IEEE Circ. Syst. Mag. 11(2), 23–61 (2011)CrossRefGoogle Scholar
  14. 14.
    Yella, S.H., Bourlard, H.: Overlapping speech detection using long-term conversational features for speaker diarization in meeting room conversations. IEEE/ACM Trans. Audio Speech Lang. Process. 22(12), 1688–1700 (2014)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2016

Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 2.5 International License (http://creativecommons.org/licenses/by-nc/2.5/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

Authors and Affiliations

  • Giorgio Biagetti
    • 1
  • Paolo Crippa
    • 1
  • Laura Falaschetti
    • 1
  • Simone Orcioni
    • 1
  • Claudio Turchetti
    • 1
  1. 1.DII – Dipartimento di Ingegneria dell’InformazioneUniversità Politecnica delle MarcheAnconaItaly

Personalised recommendations