Speaker Dependent Frequency Cepstrum Coefficients

Orság, Filip

doi:10.1007/978-3-642-10847-1_32

Speaker Dependent Frequency Cepstrum Coefficients

Filip Orság⁵

Conference paper

788 Accesses

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 58))

Abstract

This paper aims at speaker recognition based upon a novel set of features. Feature extraction is a crucial phase of the speaker recognition process and a proper feature set can influence it dramatically. Many well-known features are not suitable for the speaker recognition as those merge the specifics of the individual voices to make them universal. Therefore, we need features accentuating the individual differences of our voices to be able to recognise speakers reliably. This paper introduces Speaker Dependent Frequency Cepstrum Coefficients (SDFCC) intended for the speaker recognition purposes only. Experimental results prove increase of the reliability in comparison to the well-known features. According to the test results, the SDFCC are very useful and promising for the speaker recognition.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Rodman, D.R.: Computer Speech Technology. Artech House, Boston (1999)
Google Scholar
Sigmund, M.: Speaker Normalization by Long-Time Spectrum. In: Proceedings of Radioelektronika 1996, Brno, CZ, pp. 144–147 (1996)
Google Scholar
Oppenheim, A.V., Schafer, R.W., Buck, J.R.: Discrete-Time Signal Processing, 2nd edn. Prentice Hall, Upper Saddle River (1999)
Google Scholar
Sigmund, M.: Estimation of Vocal Tract Long-Time Spectrum. In: Proceedings of Elektronische Sprachsignalverarbeitung, Dresden, vol. 9, pp. 190–192 (1998)
Google Scholar
Sigmund, M.: Speaker Recognition – Identifying People by their Voices. Conferment thesis FEE BUT, Brno (2000) ISBN 80-214-1590-8
Google Scholar
Markel, J.D., Gray, A.H.: Linear Prediction of Speech. Springer, New York (1976)
MATH Google Scholar
Xafopoulos, A.: Speaker Verification. Tampere International Center for Signal Processing, TUT, Tampere, Finland (2001)
Google Scholar
Baggenstoss, P.M.: Hidden Markov Models Toolbox. Naval Undersea Warfare Centre, Newport, RI (2001)
Google Scholar
Woodward, J.D., Orlans, N.M., Higgins, P.T.: Biometrics: Identity Assurance in the Information Age. McGraw-Hill/Osborne, Berkley (2003)
Google Scholar
Orsag, F.: Biometric Security Systems – Speaker Recognition Technology. Dissertation, Brno, CZ (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Information Technology, Bozetechova 2, 612 66, Brno, Czech Republic
Filip Orság

Authors

Filip Orság
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Warsaw & Infobright Inc.,, Poland
Dominik Ślęzak
Hannam University, 306-791, Daejeon, South Korea
Tai-hoon Kim
National Chiao Tung University, Hsinchu, Taiwan
Wai-Chi Fang
Mississippi State University, Mississippi State MS, USA
Kirk P. Arnett

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Orság, F. (2009). Speaker Dependent Frequency Cepstrum Coefficients. In: Ślęzak, D., Kim, Th., Fang, WC., Arnett, K.P. (eds) Security Technology. SecTech 2009. Communications in Computer and Information Science, vol 58. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-10847-1_32

Download citation

DOI: https://doi.org/10.1007/978-3-642-10847-1_32
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-10846-4
Online ISBN: 978-3-642-10847-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics