Abstract
A novel approach to speaker recognition in mobile or IP network environment is described. In this approach, we use decoded line spectral frequency (LSF) parameters directly from compressed speech packets instead of using parameters from decompression and analysis procedure. Furthermore, we reduce the number of LSF series based on a restricted temporal decomposition method. Consequently, proposed approach gets more than three times faster than a traditional speaker recognition approach without losing any accuracy according to our experiments.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Kondoz, A.: Digital Speech, Coding for Low Bit Rate Communication Systems. John Wiley & Sons, Chichester (1994)
Huang, X., Acero, A., Hon, H.: Spoken Language Processing, A Guide to Theory, Algorithm, and System Development. Prentice-Hall, Englewood Cliffs (2001)
Reynolds, D.A., Rose, R.C.: Robust Text-Independent Speaker Identification Using Gaussian Mixture Speaker Models. IEEE Transactions on Speech Audio Processing 3(1), 72–83 (1995)
Saastamoinen, J., Karpov, E., Hautamaki, V., Franti, P.: Accuracy of MFCC-Based Speaker Recognition in Series 60 Device. EURASIP Journal on Applied Signal Processing 17, 2816–2827 (2005)
Aggarwal, C., Olshefski, D., Saha, D., Shae, Z.-Y., Yu, P.: CSR: Speaker Recognition from Compressed VoIP Packet Stream. In: IEEE Int. Conf. on Multimedia & Expo, Amsterdam, The Netherlands, July 2005, pp. 970–973 (2005)
de Alencar, V.F.S., Alcaim, A.: Transformations of LPC and LSF Parameters to Speech Recognition Features. In: Singh, S., Singh, M., Apte, C., Perner, P. (eds.) ICAPR 2005. LNCS, vol. 3686, pp. 522–528. Springer, Heidelberg (2005)
Kim, S., Oh, Y.: Efficient quantization method for LSF parameters based on restricted temporal decomposition. Electronics Letters 35(12), 962–963 (1999)
Atal, B.: Efficient Coding of LPC parameters by temporal decomposition. In: Proc. ICASSP’83, Boston, MA, pp. 81–84 (1983)
Campbell Jr., J.: Testing with the YOHO CD-ROM Voice Verification Corpus. In: ICASP’95, pp. 341–345 (1995)
TIA/EIA/IS-96 Speech Service Option Standard for Wideband Spread Spectrum Cellular System
Furui, S.: On the Role of Spectral Transition for Speech Perception. Journal of Acoustic Society of America 80(4), 1016–1025 (1986)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer Berlin Heidelberg
About this paper
Cite this paper
Kim, SJ., Kim, MS., Yu, HJ. (2007). Speaker Recognition Using Temporal Decomposition of LSF for Mobile Environment. In: Lee, YH., Kim, HN., Kim, J., Park, Y., Yang, L.T., Kim, S.W. (eds) Embedded Software and Systems. ICESS 2007. Lecture Notes in Computer Science, vol 4523. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-72685-2_32
Download citation
DOI: https://doi.org/10.1007/978-3-540-72685-2_32
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-72684-5
Online ISBN: 978-3-540-72685-2
eBook Packages: Computer ScienceComputer Science (R0)