Skip to main content

Speaker Recognition Based on i-Vector and Improved Local Preserving Projection

  • Conference paper
  • First Online:
Book cover Proceedings of the 2015 Chinese Intelligent Automation Conference

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 336))

  • 1126 Accesses

Abstract

In order to enhance the recognition performance of the i-vector speaker recognition system under unpredicted noise environment, an improved local preserve projection which was used for reduce dimension to i-vector is proposed on this paper. First, the nonzero eigenvalue is rejected when we solve the optimized objective function, only using the eigenvalue the value of which is greater than zero. A mapping matrix is obtained by solving a generalized eigenvalue problem which can settle the singular value problem which occurred in the traditional local preserve projection algorithm. The experiment result shows that the recognition performance of the method proposed in this paper is improved under several kinds of noise environments.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Kinnunen T, Li HZ (2010) An overview of text-independent speaker recognition: from features to supervectors. Speech Commun 52:12–40

    Article  Google Scholar 

  2. Tohidypour HR, Seyyedsalehi SA, Behbood H, Roshandel H (2012) A new representation for speech frame recognition based on redundant wavelet filter banks. Speech Commun 54:256–271

    Google Scholar 

  3. Perrachione TK, Del Tufo SN, Gabrieli JDE (2011) Human voice recognition depends on language ability. Science 333:595

    Google Scholar 

  4. Eskikanda PZ, Seyyedsalehia SA (2012) Robust speech recognition by extracting invariant features. Procedia Soc Behav Sci 32(3):230–237

    Google Scholar 

  5. Yang S, Zhaozhuang J, Deliang W (2009) An auditory based feature for robust speech recognition. ICASSP, Taibei, pp 4625–4628

    Google Scholar 

  6. Wu D, Cao J, Wang J, Li W (2012) Multi-feature fusion face recognition based on Kernel discriminate local preserve projection algorithm under smart environment. J Comput 7(10):2479–2487

    Google Scholar 

  7. Du J, Huo Q (2011) A feature compensation approach using high-order vector taylor series approximation of an explicit distortion model for noisy speech recognition. IEEE Trans Audio Speech Lang Process 19(8):2285–2293

    Google Scholar 

  8. Jeong Y (2010) Speaker adaptation based on the multilinear decomposition of training speaker models. In: Proceedings of the IEEE international conference on acoustics, speech and signal processing, Dallas, USA pp 4870–4873

    Google Scholar 

  9. He Y, Han J (2011) Gaussian specific compensation for channel distortion in speech recognition. IEEE Signal Process Lett 18(10):599–602

    Article  Google Scholar 

  10. Dehzangi O, Mab B, Chng ES, Li H (2012) Discriminative feature extraction for speech recognition using continuous output codes. Pattern Recogn Lett 33:1703–1709

    Google Scholar 

  11. Gu XH, Gong WG, Yang LP (2011) Supervised graph-optimized locality preserving projections. Opt Precis Eng 19(3):672–680

    Google Scholar 

  12. Dehak N, Kenny P, Dehak R, Dumouchel P, Ouel-let P (2010) Front-end factor analysis for speaker verification. IEEE Trans Audio Speech Lang Process 19(99)

    Google Scholar 

Download references

Acknowledgments

This work was supported in part by the National Science-technology Support Plan Project of China under contract 1214ZGA008, the Nature Science Foundation of China under contract 61263031, and the Science Foundation of Gansu Province of China under contract 1010RJZA046.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Di Wu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Wu, D. (2015). Speaker Recognition Based on i-Vector and Improved Local Preserving Projection. In: Deng, Z., Li, H. (eds) Proceedings of the 2015 Chinese Intelligent Automation Conference. Lecture Notes in Electrical Engineering, vol 336. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-46469-4_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-662-46469-4_12

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-662-46468-7

  • Online ISBN: 978-3-662-46469-4

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics