Teager Energy Operator Based Features with x-vector for Replay Attack Detection

Zhang, Zhenchuan; Zhou, Liming; Yang, Yingchun; Wu, Zhaohui

doi:10.1007/978-3-030-31456-9_51

Zhenchuan Zhang¹³,
Liming Zhou¹³,
Yingchun Yang¹³ &
…
Zhaohui Wu¹³

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11818))

Included in the following conference series:

Chinese Conference on Biometric Recognition

1663 Accesses

Abstract

Audio replay attack poses great threat to Automatic Speaker Verification (ASV) systems. In this paper, we propose a set of features based on Teager Energy Operator and a slightly modified version of x-vector system to detect replay attacks. The proposed methods are tested on ASVspoof 2017 corpus. When using GMM with the proposed features, our best system has an EER of 6.13% on dev set and 15.53% on eval set, while the EER for the baseline system (GMM with CQCC) is 30.60% on eval set. When combined with the modified x-vector, the best EER further drops to 5.57% for dev subset and 14.21% for eval subset.

This work is supported by NSFC 61602404 and the National Basic Research Program of China (973 Program) (No. 2013CB329504).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 59.99; Price excludes VAT (USA)

Softcover Book: USD 79.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Replay spoofing countermeasures using high spectro-temporal resolution features

Article 20 February 2019

On the performance of empirical mode decomposition-based replay spoofing detection in speaker verification systems

Article 29 August 2020

A Replay Voice Detection Algorithm Based on Multi-feature Fusion

References

Ergünay, S.K., Khoury, E., Lazaridis, A., Marcel, S.: On the vulnerability of speaker verification to realistic voice spoofing. In: 2015 IEEE 7th International Conference on Biometrics Theory, Applications and Systems (BTAS), pp. 1–6. IEEE (2015)
Google Scholar
Alegre, F., Janicki, A., Evans, N.: Re-assessing the threat of replay spoofing attacks against automatic speaker verification. In: 2014 International Conference of the Biometrics Special Interest Group (BIOSIG), pp. 1–6. IEEE (2014)
Google Scholar
Villalba, J., Lleida, E.: Detecting replay attacks from far-field recordings on speaker verification systems. In: Vielhauer, C., Dittmann, J., Drygajlo, A., Juul, N.C., Fairhurst, M.C. (eds.) BioID 2011. LNCS, vol. 6583, pp. 274–285. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-19530-3_25
Chapter Google Scholar
Kinnunen, T., et al.: ASVspoof 2017: automatic speaker verification spoofing and countermeasures challenge evaluation plan. Training 10(1508), 1508 (2017)
Google Scholar
Lavrentyeva, G., Novoselov, S., Malykh, E., Kozlov, A., Kudashev, O., Shchemelinin, V.: Audio replay attack detection with deep learning frameworks. In: INTERSPEECH, pp. 82–86 (2017)
Google Scholar
Witkowski, M., Kacprzak, S., Zelasko, P., Kowalczyk, K., Galka, J.: Audio replay attack detection using high-frequency features. In: INTERSPEECH, pp. 27–31 (2017)
Google Scholar
Kaiser, J.F.: On a simple algorithm to calculate the ‘energy’ of a signal. In: International Conference on Acoustics, Speech, and Signal Processing, pp. 381–384. IEEE (1990)
Google Scholar
Patil, H.A., Kamble, M.R., Patel, T.B., Soni, M.H.: Novel variable length Teager energy separation based instantaneous frequency features for replay detection. In: INTERSPEECH, pp. 12–16 (2017)
Google Scholar
Nagarsheth, P., Khoury, E., Patil, K., Garland, M.: Replay attack detection using DNN for channel discrimination. In: INTERSPEECH, pp. 97–101 (2017)
Google Scholar
Snyder, D., Garcia-Romero, D., Povey, D., Khudanpur, S.: Deep neural network embeddings for text-independent speaker verification. In: INTERSPEECH, pp. 999–1003 (2017)
Google Scholar
Tran, D., Bourdev, L., Fergus, R., Torresani, L., Paluri, M.: Learning spatiotemporal features with 3D convolutional networks. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 4489–4497 (2015)
Google Scholar
Zhou, L.: Research on audio replay detection method for speaker recognition. Master’s thesis, Zhejiang University (2019)
Google Scholar
Maragos, P., Kaiser, J.F., Quatieri, T.F.: Energy separation in signal modulations with application to speech analysis. IEEE Trans. Signal Process. 41(10), 3024–3051 (1993)
Article Google Scholar
Lee, K.A., et al.: The RedDots data collection for speaker recognition. In: Sixteenth Annual Conference of the International Speech Communication Association (2015)
Google Scholar
Chettri, B., Mishra, S., Sturm, B.L., Benetos, E.: A study on convolutional neural network based end-to-end replay anti-spoofing. arXiv preprint arXiv:1805.09164 (2018)
Zhu, Y., Ko, T., Snyder, D., Mak, B., Povey, D.: Self-attentive speaker embeddings for text-independent speaker verification. In: Proceedings of the INTERSPEECH, vol. 2018, pp. 3573–3577 (2018)
Google Scholar

Download references

Author information

Authors and Affiliations

College of Computer Science and Technology, Zhejiang University, Hangzhou, China
Zhenchuan Zhang, Liming Zhou, Yingchun Yang & Zhaohui Wu

Authors

Zhenchuan Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Liming Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Yingchun Yang
View author publications
You can also search for this author in PubMed Google Scholar
Zhaohui Wu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yingchun Yang .

Editor information

Editors and Affiliations

Chinese Academy of Sciences, Beijing, China
Zhenan Sun
Chinese Academy of Sciences, Beijing, China
Ran He
Tsinghua University, Beijing, China
Jianjiang Feng
Chinese Academy of Sciences, Beijing, China
Shiguang Shan
Tsinghua University, Shenzhen, China
Zhenhua Guo

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, Z., Zhou, L., Yang, Y., Wu, Z. (2019). Teager Energy Operator Based Features with x-vector for Replay Attack Detection. In: Sun, Z., He, R., Feng, J., Shan, S., Guo, Z. (eds) Biometric Recognition. CCBR 2019. Lecture Notes in Computer Science(), vol 11818. Springer, Cham. https://doi.org/10.1007/978-3-030-31456-9_51

Download citation

DOI: https://doi.org/10.1007/978-3-030-31456-9_51
Published: 05 October 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-31455-2
Online ISBN: 978-3-030-31456-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Teager Energy Operator Based Features with x-vector for Replay Attack Detection

Abstract

Access this chapter

Similar content being viewed by others

Replay spoofing countermeasures using high spectro-temporal resolution features

On the performance of empirical mode decomposition-based replay spoofing detection in speaker verification systems

A Replay Voice Detection Algorithm Based on Multi-feature Fusion

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Teager Energy Operator Based Features with x-vector for Replay Attack Detection

Abstract

Access this chapter

Similar content being viewed by others

Replay spoofing countermeasures using high spectro-temporal resolution features

On the performance of empirical mode decomposition-based replay spoofing detection in speaker verification systems

A Replay Voice Detection Algorithm Based on Multi-feature Fusion

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation