Skip to main content

Multilingual Speaker Identification with the Constraint of Limited Data Using Multitaper MFCC

  • Conference paper
Recent Trends in Computer Networks and Distributed Systems Security (SNDS 2012)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 335))

  • 1280 Accesses

Abstract

Feature extraction has the ability to improve the performance of speaker identification systems. This paper studies the significance of low-variance multitaper Mel-frequency cepstral coefficient (multitaper MFCC) features for Multilingual speaker identification with the constraint of limited data. The speaker identification study is conducted using 30 speakers of our own database. Sine-weighted cepstrum estimator (SWCE) taper MFCC features are extracted and modeled using Gaussian Mixture Model (GMM)-Universal Background Model (UBM). The results show that the multitaper MFCC approach performs better than the conventional Hamming window MFCC technique in all the speaker identification experiments.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Salman, A., Muhammad, E., Khurshid, K.: Speaker Verification using Boosted Cepstral Features with Gaussian Distributions. In: Proc. IEEE, INMIC 2007, pp. 1–5 (2007)

    Google Scholar 

  2. Reynolds, D.A., Rose, R.C.: Robust Text-Independent Speaker Identification Using Gaussian Mixture Speaker Models. IEEE Trans. Speech and Audio Processing 3, 72–83 (1995)

    Article  Google Scholar 

  3. Jayanna, H.S., Mahadeva Prasanna, S.R.: Analysis, Feature Extraction, Modeling and Testing techniques for Speaker Recognition. IETE Technical Review 26, 181–190 (2009)

    Article  Google Scholar 

  4. Arjun, P.H.: Speaker Recognition in Indian Languages: A Feature Based Approach. Ph.D. dissertation, Indian Institute of Technology Kharagpur, INDIA (July 2005)

    Google Scholar 

  5. Jayanna, H.S.: Limited data Speaker Recognition. Ph.D. dissertation, Indian Institute of Technology, Guwahati, INDIA (November 2009)

    Google Scholar 

  6. Kinnunen, T., Saeidi, R., Sandberg, J., Hansson-Sandsten, M.: What Else is New Than the HammingWindow? Robust MFCCs for Speaker Recognition via Multitapering. In: Proc. Interspeech 2010, pp. 2734–2737 (September 2010)

    Google Scholar 

  7. Durou, G.: Multilingual text-independent speaker identification. In: Proc. MIST 1999 Workshop, Leusden, Netherlands, pp. 115–118 (1999)

    Google Scholar 

  8. Pandey, B., Ranjan, A., Kumar, R., Shukla, A.: Multilingual Speaker Recognition Using ANFIS. In: Proc. IEEE, ICSPS, vol. 3, pp. 714–718 (2010)

    Google Scholar 

  9. Nagaraja, B.G., Jayanna, H.S.: Multi-lingual Speaker Identification with the constraint of Limited data. Accepted for publication in Proc. ICAdC 2012, MSRIT, Bengaluru. Springer (July 2012)

    Google Scholar 

  10. Sandberg, J., Hansson-Sandsten, M., Kinnunen, T., Saeidi, R., Flandrin, P., Borgnat, P.: Multitaper Estimation of Frequency-Warped Cepstra With Application to Speaker Verification. IEEE Signal Processing Letters 17, 343–346 (2010)

    Article  Google Scholar 

  11. Alam, M.J., Kinnunen, T., Kenny, P., Ouellet, P., O’Shaughnessy, D.: Multi-taper MFCC Features for Speaker Verification using I-vectors. In: Proc. IEEE, ASRU 2011, pp. 547–552 (December 2011)

    Google Scholar 

  12. Kinnunen, T., Saeidi, R., Sedlák, F., Lee, K.A., Sandberg, J., Hansson-Sandsten, M., Li, H.: Low-Variance Multitaper MFCC Features: A Case Study in Robust Speaker Verification. IEEE Transaction on Audio, Speech and Language Processing 20, 1990–2001 (2012)

    Article  Google Scholar 

  13. Percival, D.B., Walden, A.T.: Spectral Analysis for Physical Applications. Cambridge Univ. Press, Cambridge (1993)

    Book  MATH  Google Scholar 

  14. Thomson, D.J.: Spectrum estimation and harmonic analysis. Proc. IEEE 70, 1055–1096 (1982)

    Article  Google Scholar 

  15. Riedel, K.S., Sidorenko, A.: Minimum bias multiple taper spectral estimation. IEEE Trans. Signal Process. 43, 188–195 (1995)

    Article  Google Scholar 

  16. Ku, J.M.K., Ambikairajan, E., Epps, J., Togneri, R.: Speaker Verification Using Sparse Representation Classification. In: Proc. IEEE, ICASSP, pp. 4548–4551 (2011)

    Google Scholar 

  17. Hosseinzadeh, D., Krishnan, S.: Combining Vocal Source and MFCC Features for Enhanced Speaker Recognition Performance Using GMMs. In: Proc. IEEE, MMSP 2007, pp. 365–368 (October 2007)

    Google Scholar 

  18. Reynolds, D.: Universal Background Models. Encyclopedia of Biometric Recognition, Journal Article (February 2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Nagaraja, B.G., Jayanna, H.S. (2012). Multilingual Speaker Identification with the Constraint of Limited Data Using Multitaper MFCC. In: Thampi, S.M., Zomaya, A.Y., Strufe, T., Alcaraz Calero, J.M., Thomas, T. (eds) Recent Trends in Computer Networks and Distributed Systems Security. SNDS 2012. Communications in Computer and Information Science, vol 335. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34135-9_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-34135-9_13

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-34134-2

  • Online ISBN: 978-3-642-34135-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics