Influence of Noise and Voice Activity Detection on Speaker Verification

Dustor, Adam

doi:10.1007/978-3-319-39207-3_18

Adam Dustor¹³

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 608))

Included in the following conference series:

International Conference on Computer Networks

938 Accesses

Abstract

The scope of this paper is to check influence of voice activity detection VAD procedure and its accuracy on speaker verification error rates. It is shown that for speech of high quality, it is absolutely necessary to remove silence from the signal as the errors increase radically. It is better to remove more than less from the signal as the equal error rate EER is the worst for the original speech with silence. Additionally influence of white noise, which was added to speech utterances, was examined. Presented results show that in order to achieve highly reliable speaker verification system it must be insensitive to low quality of speech, since noise is the most important factor responsible for high error rates.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Dustor, A.: Voice verification based on nonlinear Ho-Kashyap classifier. In: International Conference on Computational Technologies in Electrical and Electronics Engineering SIBIRCON 2008, pp. 296–300. Novosibirsk (2008)
Google Scholar
Dustor, A., Szwarc, P.: Application of GMM models to spoken language recognition. In: Napieralski, A. (ed.) MIXDES 2009: Proceedings of the 16th International Conference Mixed Design of Integrated Circuits and Systems Lodz, Poland, pp. 603–606 (2009)
Google Scholar
Dustor, A.: Speaker verification based on fuzzy classifier. In: Cyran, K.A., Kozielski, S., Peters, J.F., Stańczyk, U., Wakulicz-Deja, A. (eds.) Man-Machine Interactions. AISC, vol. 59, pp. 389–397. Springer, Heidelberg (2009)
Chapter Google Scholar
Dustor, A., Szwarc, P.: Spoken language identification based on GMM models. In: Pulka, A., Golonek, T. (eds.) Inetrnational Conference on Signals and Electronic Systems (ICSES 2010): Conference Proceedings, Poland, Gliwice, pp. 105–108 (2010)
Google Scholar
Dustor, A., Kłosowski, P.: Biometric voice identification based on fuzzy kernel classifier. In: Kwiecień, A., Gaj, P., Stera, P. (eds.) CN 2013. CCIS, vol. 370, pp. 456–465. Springer, Heidelberg (2013)
Chapter Google Scholar
Kłosowski, P., Dustor, A.: Automatic speech segmentation for automatic speech translation. In: Kwiecień, A., Gaj, P., Stera, P. (eds.) CN 2013. CCIS, vol. 370, pp. 466–475. Springer, Heidelberg (2013)
Chapter Google Scholar
Dustor, A., Kłosowski, P., Izydorczyk, J.: Influence of feature dimensionality and model complexity on speaker verification performance. In: Kwiecień, A., Gaj, P., Stera, P. (eds.) CN 2014. CCIS, vol. 431, pp. 177–186. Springer, Heidelberg (2014)
Chapter Google Scholar
Dustor, A., Klosowski, P., Izydorczyk, J.: Speaker recognition system with good generalization properties. In: 2014 International Conference on Multimedia Computing and Systems (ICMCS), Marrakech, Morocco, pp. 206–210 (2014)
Google Scholar
Kłosowski, P., Dustor, A., Izydorczyk, J., Kotas, J., Ślimok, J.: Speech recognition based on open source speech processing software. In: Kwiecień, A., Gaj, P., Stera, P. (eds.) CN 2014. CCIS, vol. 431, pp. 308–317. Springer, Heidelberg (2014)
Chapter Google Scholar
Dustor, A., Kłosowski, P., Izydorczyk, J., Kopański, R.: Influence of corpus size on speaker verification. In: Gaj, P., Kwiecień, A., Stera, P. (eds.) CN 2015. CCIS, vol. 522, pp. 242–249. Springer, Heidelberg (2015)
Chapter Google Scholar
Kłosowski, P., Dustor, A., Izydorczyk, J.: Speaker verification performance evaluation based on open source speech processing software and TIMIT speech corpus. In: Gaj, P., Kwiecień, A., Stera, P. (eds.) CN 2015. CCIS, vol. 522, pp. 400–409. Springer, Heidelberg (2015)
Chapter Google Scholar
Fazel, A., Chakrabartty, S.: An overview of statistical pattern recognition techniques for speaker verification. IEEE Circuits Syst. Mag. 11(2), 62–81 (2011)
Article Google Scholar
Adamczyk, B., Adamczyk, K., Trawiński, K.: Zasób mowy ROBOT. Biuletyn Instytutu Automatyki i Robotyki WAT 12, 179–192 (2000)
Google Scholar

Download references

Acknowledgment

This work was supported by the Ministry of Science and Higher Education funding for statutory activities.

Author information

Authors and Affiliations

Institute of Electronics, Silesian University of Technology, Akademicka 16, 44-100, Gliwice, Poland
Adam Dustor

Authors

Adam Dustor
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Adam Dustor .

Editor information

Editors and Affiliations

Silesian University of Technology, Gliwice, Poland
Piotr Gaj
Silesian University of Technology, Gliwice, Poland
Andrzej Kwiecień
Silesian University of Technology, Gliwice, Poland
Piotr Stera

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Dustor, A. (2016). Influence of Noise and Voice Activity Detection on Speaker Verification. In: Gaj, P., Kwiecień, A., Stera, P. (eds) Computer Networks. CN 2016. Communications in Computer and Information Science, vol 608. Springer, Cham. https://doi.org/10.1007/978-3-319-39207-3_18

Download citation

DOI: https://doi.org/10.1007/978-3-319-39207-3_18
Published: 01 June 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-39206-6
Online ISBN: 978-3-319-39207-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics