Speaker Recognition Based on i-Vector and Improved Local Preserving Projection

Wu, Di

doi:10.1007/978-3-662-46469-4_12

Di Wu³

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 336))

1128 Accesses

Abstract

In order to enhance the recognition performance of the i-vector speaker recognition system under unpredicted noise environment, an improved local preserve projection which was used for reduce dimension to i-vector is proposed on this paper. First, the nonzero eigenvalue is rejected when we solve the optimized objective function, only using the eigenvalue the value of which is greater than zero. A mapping matrix is obtained by solving a generalized eigenvalue problem which can settle the singular value problem which occurred in the traditional local preserve projection algorithm. The experiment result shows that the recognition performance of the method proposed in this paper is improved under several kinds of noise environments.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Kinnunen T, Li HZ (2010) An overview of text-independent speaker recognition: from features to supervectors. Speech Commun 52:12–40
Article Google Scholar
Tohidypour HR, Seyyedsalehi SA, Behbood H, Roshandel H (2012) A new representation for speech frame recognition based on redundant wavelet filter banks. Speech Commun 54:256–271
Google Scholar
Perrachione TK, Del Tufo SN, Gabrieli JDE (2011) Human voice recognition depends on language ability. Science 333:595
Google Scholar
Eskikanda PZ, Seyyedsalehia SA (2012) Robust speech recognition by extracting invariant features. Procedia Soc Behav Sci 32(3):230–237
Google Scholar
Yang S, Zhaozhuang J, Deliang W (2009) An auditory based feature for robust speech recognition. ICASSP, Taibei, pp 4625–4628
Google Scholar
Wu D, Cao J, Wang J, Li W (2012) Multi-feature fusion face recognition based on Kernel discriminate local preserve projection algorithm under smart environment. J Comput 7(10):2479–2487
Google Scholar
Du J, Huo Q (2011) A feature compensation approach using high-order vector taylor series approximation of an explicit distortion model for noisy speech recognition. IEEE Trans Audio Speech Lang Process 19(8):2285–2293
Google Scholar
Jeong Y (2010) Speaker adaptation based on the multilinear decomposition of training speaker models. In: Proceedings of the IEEE international conference on acoustics, speech and signal processing, Dallas, USA pp 4870–4873
Google Scholar
He Y, Han J (2011) Gaussian specific compensation for channel distortion in speech recognition. IEEE Signal Process Lett 18(10):599–602
Article Google Scholar
Dehzangi O, Mab B, Chng ES, Li H (2012) Discriminative feature extraction for speech recognition using continuous output codes. Pattern Recogn Lett 33:1703–1709
Google Scholar
Gu XH, Gong WG, Yang LP (2011) Supervised graph-optimized locality preserving projections. Opt Precis Eng 19(3):672–680
Google Scholar
Dehak N, Kenny P, Dehak R, Dumouchel P, Ouel-let P (2010) Front-end factor analysis for speaker verification. IEEE Trans Audio Speech Lang Process 19(99)
Google Scholar

Download references

Acknowledgments

This work was supported in part by the National Science-technology Support Plan Project of China under contract 1214ZGA008, the Nature Science Foundation of China under contract 61263031, and the Science Foundation of Gansu Province of China under contract 1010RJZA046.

Author information

Authors and Affiliations

College of Electrical and Information Engineering, Hunan Institute of Engineering, Xiangtan, 411004, China
Di Wu

Authors

Di Wu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Di Wu .

Editor information

Editors and Affiliations

Tsinghua University, Beijing, China
Zhidong Deng
Department of Computer Science and Techn, Tsinghua University, Beijing, China
Hongbo Li

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wu, D. (2015). Speaker Recognition Based on i-Vector and Improved Local Preserving Projection. In: Deng, Z., Li, H. (eds) Proceedings of the 2015 Chinese Intelligent Automation Conference. Lecture Notes in Electrical Engineering, vol 336. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-46469-4_12

Download citation

DOI: https://doi.org/10.1007/978-3-662-46469-4_12
Published: 28 March 2015
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-46468-7
Online ISBN: 978-3-662-46469-4
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics