Skip to main content

Influence of Noise and Voice Activity Detection on Speaker Verification

  • Conference paper
  • First Online:
Computer Networks (CN 2016)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 608))

Included in the following conference series:

  • 938 Accesses

Abstract

The scope of this paper is to check influence of voice activity detection VAD procedure and its accuracy on speaker verification error rates. It is shown that for speech of high quality, it is absolutely necessary to remove silence from the signal as the errors increase radically. It is better to remove more than less from the signal as the equal error rate EER is the worst for the original speech with silence. Additionally influence of white noise, which was added to speech utterances, was examined. Presented results show that in order to achieve highly reliable speaker verification system it must be insensitive to low quality of speech, since noise is the most important factor responsible for high error rates.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Dustor, A.: Voice verification based on nonlinear Ho-Kashyap classifier. In: International Conference on Computational Technologies in Electrical and Electronics Engineering SIBIRCON 2008, pp. 296–300. Novosibirsk (2008)

    Google Scholar 

  2. Dustor, A., Szwarc, P.: Application of GMM models to spoken language recognition. In: Napieralski, A. (ed.) MIXDES 2009: Proceedings of the 16th International Conference Mixed Design of Integrated Circuits and Systems Lodz, Poland, pp. 603–606 (2009)

    Google Scholar 

  3. Dustor, A.: Speaker verification based on fuzzy classifier. In: Cyran, K.A., Kozielski, S., Peters, J.F., Stańczyk, U., Wakulicz-Deja, A. (eds.) Man-Machine Interactions. AISC, vol. 59, pp. 389–397. Springer, Heidelberg (2009)

    Chapter  Google Scholar 

  4. Dustor, A., Szwarc, P.: Spoken language identification based on GMM models. In: Pulka, A., Golonek, T. (eds.) Inetrnational Conference on Signals and Electronic Systems (ICSES 2010): Conference Proceedings, Poland, Gliwice, pp. 105–108 (2010)

    Google Scholar 

  5. Dustor, A., Kłosowski, P.: Biometric voice identification based on fuzzy kernel classifier. In: Kwiecień, A., Gaj, P., Stera, P. (eds.) CN 2013. CCIS, vol. 370, pp. 456–465. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  6. Kłosowski, P., Dustor, A.: Automatic speech segmentation for automatic speech translation. In: Kwiecień, A., Gaj, P., Stera, P. (eds.) CN 2013. CCIS, vol. 370, pp. 466–475. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  7. Dustor, A., Kłosowski, P., Izydorczyk, J.: Influence of feature dimensionality and model complexity on speaker verification performance. In: Kwiecień, A., Gaj, P., Stera, P. (eds.) CN 2014. CCIS, vol. 431, pp. 177–186. Springer, Heidelberg (2014)

    Chapter  Google Scholar 

  8. Dustor, A., Klosowski, P., Izydorczyk, J.: Speaker recognition system with good generalization properties. In: 2014 International Conference on Multimedia Computing and Systems (ICMCS), Marrakech, Morocco, pp. 206–210 (2014)

    Google Scholar 

  9. Kłosowski, P., Dustor, A., Izydorczyk, J., Kotas, J., Ślimok, J.: Speech recognition based on open source speech processing software. In: Kwiecień, A., Gaj, P., Stera, P. (eds.) CN 2014. CCIS, vol. 431, pp. 308–317. Springer, Heidelberg (2014)

    Chapter  Google Scholar 

  10. Dustor, A., Kłosowski, P., Izydorczyk, J., Kopański, R.: Influence of corpus size on speaker verification. In: Gaj, P., Kwiecień, A., Stera, P. (eds.) CN 2015. CCIS, vol. 522, pp. 242–249. Springer, Heidelberg (2015)

    Chapter  Google Scholar 

  11. Kłosowski, P., Dustor, A., Izydorczyk, J.: Speaker verification performance evaluation based on open source speech processing software and TIMIT speech corpus. In: Gaj, P., Kwiecień, A., Stera, P. (eds.) CN 2015. CCIS, vol. 522, pp. 400–409. Springer, Heidelberg (2015)

    Chapter  Google Scholar 

  12. Fazel, A., Chakrabartty, S.: An overview of statistical pattern recognition techniques for speaker verification. IEEE Circuits Syst. Mag. 11(2), 62–81 (2011)

    Article  Google Scholar 

  13. Adamczyk, B., Adamczyk, K., Trawiński, K.: Zasób mowy ROBOT. Biuletyn Instytutu Automatyki i Robotyki WAT 12, 179–192 (2000)

    Google Scholar 

Download references

Acknowledgment

This work was supported by the Ministry of Science and Higher Education funding for statutory activities.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Adam Dustor .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Dustor, A. (2016). Influence of Noise and Voice Activity Detection on Speaker Verification. In: Gaj, P., Kwiecień, A., Stera, P. (eds) Computer Networks. CN 2016. Communications in Computer and Information Science, vol 608. Springer, Cham. https://doi.org/10.1007/978-3-319-39207-3_18

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-39207-3_18

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-39206-6

  • Online ISBN: 978-3-319-39207-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics