Speaker Verification Using Coded Speech

Moreno-Daniel, Antonio; Juang, Biing-Hwang; Nolazco-Flores, Juan A.

doi:10.1007/978-3-540-30463-0_45

Antonio Moreno-Daniel^19,20,
Biing-Hwang Juang¹⁹ &
Juan A. Nolazco-Flores²⁰

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3287))

Included in the following conference series:

Iberoamerican Congress on Pattern Recognition

1169 Accesses
2 Citations

Abstract

The implementation of a pseudo text-independent Speaker Verification system is described. This system was designed to use only information extracted directly from the coded parameters embedded in the ITU-T G.729 bit-stream. Experiments were performed over the YOHO database [1]. The feature vector as a short-time representation of speech consists of 16 LPC-Cepstral coefficients, as well as residual information appended in the form of a pitch estimate and a measure of vocality of the speech. The robustness in verification accuracy is also studied. The results show that while speech coders, G.729 in particular, introduce coding distortions that lead to verification performance degradation, proper augmented use of unconventional information nevertheless leads to a competitive performance on par with that of a well-studied traditional system which does not involve signal coding and transmission. The result suggests that speaker verification over a cell phone connection remains feasible even though the signal has been encoded to 8 Kb/s.

Download to read the full chapter text

Chapter PDF

Analysis of Performance Improvement for Speaker Verification by Combining Feature Vectors of LPC Spectral Envelope, MFCC and pLPC Pole Distribution

Efficient speaker identification using spectral entropy

Article 02 January 2019

Text-Independent Speaker Identification Using Vowel Formants

Article Open access 05 May 2015

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Jr. Campbell, J.P.: Testing with the YOHO cd-rom voice verification corpus. In: Proc. ICASSP (1995)
Google Scholar
Furui, A.: Recent Advances in Speaker Recognition. In: First Int. Conf. Audio- and Video based Biometric Person Authentication, Switzerland, pp. 237–252 (1997)
Google Scholar
Reynolds, D.A.: An Overview of Automatic Speaker Recognition Technology. In: Proc. ICASSP (2002)
Google Scholar
Reynolds, D.A., Rose, R.: Robust Text-Independent Speaker Identification Using Gaussians Mixture Speaker Model. IEEE Transactions on Speech and Audio Processing (1995)
Google Scholar
Rosenberg, Aaron, E., Siohan, O., Parthasarathy, S.: Speaker verification using minimum verification error training. In: Proc. ICASSP (1998)
Google Scholar
Li, Q., Juang, B.-H., Zhou, Q., Lee, C.-H.: Automatic Verbal Information Verification for User Authentication. IEEE Transactions on Speech and Audio Processing, 585–596 (2000)
Google Scholar
Kim, H.K., Cox, R.: Bitstream-based feature extraction for wireless speech recognition. In: Proc. ICASSP (2000)
Google Scholar
Zhong, X.: Speech coding and transmission for improved recognition in a communication network. PhD Dissertation, Georgia Institute of Technology (2000)
Google Scholar
ITU-T Recommendation G.729, Coding of speech at 8 kbit/s using conjugate-structure algebraic-code-excited linear-prediction (CS-ACELP) (1996)
Google Scholar
Young, S., et al.: The HTK Book, Cambridge University, Version 3.2 ed. (2002)
Google Scholar
Huang, X., Acero, A., Hon, H.W.: Spoken language processing. Prentice Hall, Englewood Cliffs (2001)
Google Scholar
Quatieri, T.F., et al.: Speaker Recognition Using G.729 Speech Codec Parameters. In: Proc. ICASSP (2000)
Google Scholar
Rabiner, L., Juang, B.H.: Fundamentals of Speech Recognition. Prentice Hall, Englewood Cliffs (1993)
Google Scholar
ITU-T Recommendation G.191, Software tool library 2000 user’s manual (2000)
Google Scholar
Yu Eric, W.M., Man-Wai, M., Chin-Hung, S., Sun-Yuan, K.: Speaker verification based on G.729 and G.723.1 coder parameters and handset mismatch compensation. In: Proc. of the 8th European Conference on Speech Communication and Technology (2003)
Google Scholar
Besacier, L., Grassi, S., Dufaux, A., Ansorge, M., Pellandini, F.: GSM Speech coding and Speaker Recognition. In: Proc. ICASSP (2000)
Google Scholar

Download references

Author information

Authors and Affiliations

Center for Signal and Image Processing, Georgia Institute of Technology, Atlanta, GA, USA
Antonio Moreno-Daniel & Biing-Hwang Juang
Departamento de Ciencias Computacionales, Instituto Tecnológico y de Estudios, Superiores de Monterrey, Monterrey, NL, México
Antonio Moreno-Daniel & Juan A. Nolazco-Flores

Authors

Antonio Moreno-Daniel
View author publications
You can also search for this author in PubMed Google Scholar
Biing-Hwang Juang
View author publications
You can also search for this author in PubMed Google Scholar
Juan A. Nolazco-Flores
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dept. System Engineering and Automation, Universitat Politècnica de Catalunya (UPC), Barcelona, Spain
Alberto Sanfeliu
Computer Science Department, National Institute of Astrophysics, Optics and Electronics (INAOE), Luis Enrique Erro No. 1, 72840, Sta. Maria Tonantzintla, Puebla, Mexico
José Francisco Martínez Trinidad
Computer Science Department, National Institute of Astrophysics, Optics and Electronics, (INAOE), Luis Enrique Erro No.1, 72840, Sta. Maria Tonantzintla, Puebla, Mexico
Jesús Ariel Carrasco Ochoa

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Moreno-Daniel, A., Juang, BH., Nolazco-Flores, J.A. (2004). Speaker Verification Using Coded Speech. In: Sanfeliu, A., Martínez Trinidad, J.F., Carrasco Ochoa, J.A. (eds) Progress in Pattern Recognition, Image Analysis and Applications. CIARP 2004. Lecture Notes in Computer Science, vol 3287. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30463-0_45

Download citation

DOI: https://doi.org/10.1007/978-3-540-30463-0_45
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23527-9
Online ISBN: 978-3-540-30463-0
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)

Speaker Verification Using Coded Speech

Abstract

Chapter PDF

Similar content being viewed by others

Analysis of Performance Improvement for Speaker Verification by Combining Feature Vectors of LPC Spectral Envelope, MFCC and pLPC Pole Distribution

Efficient speaker identification using spectral entropy

Text-Independent Speaker Identification Using Vowel Formants

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Navigation

Speaker Verification Using Coded Speech

Abstract

Chapter PDF

Similar content being viewed by others

Analysis of Performance Improvement for Speaker Verification by Combining Feature Vectors of LPC Spectral Envelope, MFCC and pLPC Pole Distribution

Efficient speaker identification using spectral entropy

Text-Independent Speaker Identification Using Vowel Formants

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation