An Analysis of Speaker Recognition Using Bagging CAN2 and Pole Distribution of Speech Signals

Kurogi, Shuichi; Mineishi, Shota; Sato, Seitaro

doi:10.1007/978-3-642-17537-4_45

Shuichi Kurogi¹⁹,
Shota Mineishi¹⁹ &
Seitaro Sato¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 6443))

Included in the following conference series:

International Conference on Neural Information Processing

2431 Accesses
10 Citations

Abstract

A method of speaker recognition which uses feature vectors of pole distribution derived from piecewise linear predictive coefficients obtained by bagging CAN2 (competitive associative net 2) is presented and analyzed. The CAN2 is a neural net for learning efficient piecewise linear approximation of nonlinear function, and the bagging CAN2 (bootstrap aggregating version of CAN2) is used to obtain statistically stable multiple linear predictive coefficients. From the coefficients, the present method obtains a number of poles which are supposed to reflect the shape of the speaker’s vocal tract. Then, the pole distribution is used as a feature vector for speaker recognition. The effectiveness is analyzed and validated using real speech data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Kurogi, S., Ueno, T., Sawa, M.: A batch learning method for competitive associative net and its application to function approximation. In: Proc. SCI 2004, pp. 24–28 (2004)
Google Scholar
Kurogi, S., Nedachi, N., Funatsu, Y.: Reproduction and recognition of vowel signals using single and bagging competitive associative nets. In: Ishikawa, M., Doya, K., Miyamoto, H., Yamakawa, T. (eds.) ICONIP 2007, Part II. LNCS, vol. 4985, pp. 40–49. Springer, Heidelberg (2008)
Chapter Google Scholar
Kurogi, S.: Improving generalization performance via out-of-bag estimate using variable size of bags. J. Japanese Neural Network Society 16(2), 81–92 (2009)
Article Google Scholar
Kurogi, S., Sato, S., Ichimaru, K.: Speaker Recognition Using Pole Distribution of Speech Signals Obtained by Bagging CAN2. In: Leung, C.S., Lee, M., Chan, J.H. (eds.) ICONIP 2009. LNCS, vol. 5863, pp. 622–629. Springer, Heidelberg (2009)
Chapter Google Scholar
Ahalt, A.C., Krishnamurthy, A.K., Chen, P., Melton, D.E.: Competitive learning algorithms for vector quantization. Neural Networks 3, 277–290 (1990)
Article Google Scholar
Kohonen, T.: Associative Memory. Springer, Heidelberg (1977)
Book MATH Google Scholar
Campbell, J.P.: Speaker Recognition: A Tutorial. Proc. the IEEE 85(9), 1437–1462 (1997)
Article Google Scholar
Furui, S.: Speaker Recognition. In: Cole, R., Mariani, J., et al. (eds.) Survey of the state of the art in human language technology, pp. 36–42. Cambridge University Press, Cambridge (1998)
Google Scholar
Hasan, M.R., Jamil, M., Rabbani, M.G., Rahman, M.S.: Speaker identification using Mel frequency cepstral coefficients. In: Proc. ICEC 2004, pp. 565–568 (2004)
Google Scholar
Bocklet, T., Shriberg, E.: Speaker recognition using syllable-based constraints for cepstral frame selection. In: Proc. ICASSP (2009)
Google Scholar
Breiman, L.: Bagging predictors. Machine Learning 26, 123–140 (1996)
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Kyusyu Institute of Technology, Tobata, Kitakyushu, Fukuoka, 804-8550, Japan
Shuichi Kurogi, Shota Mineishi & Seitaro Sato

Authors

Shuichi Kurogi
View author publications
You can also search for this author in PubMed Google Scholar
Shota Mineishi
View author publications
You can also search for this author in PubMed Google Scholar
Seitaro Sato
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Information Technology, Murdoch University, 6150, Murdoch, WA, Australia
Kok Wai Wong
The Australian National University, 0200, Canberra, ACT, Australia
B. Sumudu U. Mendis
School of Electrical, Computer and Telecommunications Engineering, University of Wollongong, Northfields Avenue, 2522, P.O. Box, Wollongong, NSW, Australia
Abdesselam Bouzerdoum

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kurogi, S., Mineishi, S., Sato, S. (2010). An Analysis of Speaker Recognition Using Bagging CAN2 and Pole Distribution of Speech Signals. In: Wong, K.W., Mendis, B.S.U., Bouzerdoum, A. (eds) Neural Information Processing. Theory and Algorithms. ICONIP 2010. Lecture Notes in Computer Science, vol 6443. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17537-4_45

Download citation

DOI: https://doi.org/10.1007/978-3-642-17537-4_45
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-17536-7
Online ISBN: 978-3-642-17537-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics