Ubiquitous and Robust Text-Independent Speaker Recognition for Home Automation Digital Life

Wang, Jhing-Fa; Kuan, Ta-Wen; Wang, Jia-chang; Gu, Gaung-Hui

doi:10.1007/978-3-540-69293-5_24

Jhing-Fa Wang¹,
Ta-Wen Kuan¹,
Jia-chang Wang¹ &
…
Gaung-Hui Gu¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5061))

Included in the following conference series:

International Conference on Ubiquitous Intelligence and Computing

1166 Accesses
5 Citations

Abstract

This paper presents a ubiquitous and robust text-independent speaker recognitionarchitecture for home automation digital life. In this architecture, a multiple microphone configuration is adopted to receive the pervasive speech signals. The multi-channel speech signals are then added together with a mixer. In a ubiquitous computing environment, the received speech signal is usually heavily corrupted by background noises. An SNR-aware subspace speech enhancement approach is used as a pre-processing to enhance the mixed signal. Considering the text-independent speaker recognition, this paper applies a multi-class support vectors machine (SVM)[10][11] instead of conventional Gaussian mixture models (GMMs)[12]. In our experiments, the speaker recognition rate can averagely reach 97.2% with the proposed ubiquitous speaker recognitionarchitecture.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Cortes, C., Vapnik, V.: Support vector networks. Machine Learning 20, 273–297 (1995)
MATH Google Scholar
Vapnik, V.: The Nature of Statistical Learning Theory. Springer, New York (1995)
MATH Google Scholar
Vapnik, V.: Statistical Learning Theory. Wiley, New York (1998)
MATH Google Scholar
Schölkopf, B., Mika, S., Burges, C., Knirsch, P., Müller, K.-R., Rätsch, G., Smola, A.: Input space vs. feature space in kernel-based methods. IEEE Transactions on Neural Networks 10(5), 1000–1017 (1999)
Article Google Scholar
Ephraim, Y., Van Trees, H.L.: A signal subspace approach for speech enhancement. IEEE Transactions on Speech and Audio Processing 3(4), 251–266 (1995)
Article Google Scholar
Jia-Ching, W., Hsiao-Ping, L., Jhing-Fa, W., Chung-Hsien, Y.: Critical Band Subspace-Based Speech Enhancement Using SNR and Auditory Masking Aware Technique. IEICE Transactions on Information and Systems E90-D(7), 1055–1062 (2007)
Article Google Scholar
Hui-Ling, H., Fang-Lin, C.: ESVM: Evolutionary support vector machine for automatic feature selection and Classification of micro array data. BioSystems 90, 516–528 (2007)
Article Google Scholar
Shung-Yung, L.: Efficient text independent speaker recognition withwavelet feature selection based multilayered neural network using supervised learning algorithm. Pattern Recognition 40, 3616–3620 (2007)
Article MATH Google Scholar
Shung-Yung, L.: Wavelet feature selection based neural networks with application to the text independent speaker identification. BioSystems 90, 516–528 (2007)
Article Google Scholar
Vincent, W., Steve, R.: Speaker verification using sequence discriminant support vector machines. IEEE transactions on speech and audio processing 13(2) (March 2005)
Google Scholar
Campbell, W.M., Campbell, J.P., Gleason, T.P., Reynolds, D.A., Shen, W.: Speaker Verification Using Support Vector Machines and High-Level Features. IEEE transactions on speech, audio and language processing 15(7) (September 2007)
Google Scholar
Burget, L., Matĕjka, P., Schwarz, P., Glembek, O., Cĕrnocký, J.H.: Analysis of Feature Extraction and Channel Compensation in a GMM Speaker Recognition System. IEEE transactions on speech, audio and language processing 15(7), 1979–1985 (2007)
Google Scholar
Rabiner, L.R., Schafer, R.W.: Digital Processing of Speech Recognition Signals. Prentice-Hall Co. Ltd, Englewood Cliffs (1978)
Google Scholar
Huang, X., Acero, A., Hon, H.: Spoken Language Processing: A Guide to Theory, Algorithm and System Development. Prentice-Hall Co. Ltd, Englewood Cliffs (2001)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electrical Engineering, National Cheng-Kung University, No.1, Dasyue Rd., East District, Tainan City, 701, Taiwan, R.O.C.
Jhing-Fa Wang, Ta-Wen Kuan, Jia-chang Wang & Gaung-Hui Gu

Authors

Jhing-Fa Wang
View author publications
You can also search for this author in PubMed Google Scholar
Ta-Wen Kuan
View author publications
You can also search for this author in PubMed Google Scholar
Jia-chang Wang
View author publications
You can also search for this author in PubMed Google Scholar
Gaung-Hui Gu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Frode Eika Sandnes Yan Zhang Chunming Rong Laurence T. Yang Jianhua Ma

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, JF., Kuan, TW., Wang, Jc., Gu, GH. (2008). Ubiquitous and Robust Text-Independent Speaker Recognition for Home Automation Digital Life. In: Sandnes, F.E., Zhang, Y., Rong, C., Yang, L.T., Ma, J. (eds) Ubiquitous Intelligence and Computing. UIC 2008. Lecture Notes in Computer Science, vol 5061. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-69293-5_24

Download citation

DOI: https://doi.org/10.1007/978-3-540-69293-5_24
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-69292-8
Online ISBN: 978-3-540-69293-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics