Example-Specific Density Based Matching Kernel for Classification of Varying Length Patterns of Speech Using Support Vector Machines

Sachdev, Abhijeet; Dileep, A. D.; Thenkanidiyoor, Veena

doi:10.1007/978-3-319-26532-2_20

Abhijeet Sachdev¹⁷,
A. D. Dileep¹⁷ &
Veena Thenkanidiyoor¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 9489))

Included in the following conference series:

International Conference on Neural Information Processing

2078 Accesses
2 Citations

Abstract

In this paper, we propose example-specific density based matching kernel (ESDMK) for the classification of varying length patterns of long duration speech represented as sets of feature vectors. The proposed kernel is computed between the pair of examples, represented as sets of feature vectors, by matching the estimates of the example-specific densities computed at every feature vector in those two examples. In this work, the number of feature vectors of an example among the K nearest neighbors of a feature vector is considered as an estimate of the example-specific density. The minimum of the estimates of two example-specific densities, one for each example, at a feature vector is considered as the matching score. The ESDMK is then computed as the sum of the matching score computed at every feature vector in a pair of examples. We study the performance of the support vector machine (SVM) based classifiers using the proposed ESDMK for speech emotion recognition and speaker identification tasks and compare the same with that of the SVM-based classifiers using the state-of-the-art kernels for varying length patterns.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Rabiner, L., Juang, B.-H.: Fundamentals of Speech Recognition. Pearson Education, New Jersey (2003)
MATH Google Scholar
Reynolds, D.A.: Speaker identification and verification using Gaussian mixture speaker models. Speech Commun. 17, 91–108 (1995)
Article Google Scholar
Reynolds, D.A., Quatieri, T.F., Dunn, R.B.: Speaker verification using adapted Gaussian mixture models. Digit. Signal Proc. 10(1–3), 19–41 (2000)
Article Google Scholar
Dileep, A.D., Chandra Sekhar, C.: GMM-based intermediate matching kernel for classification of varying length patterns of long duration speech using support vector machines. IEEE Trans. Neural Netw. Learn. Syst. 25(8), 1421–1432 (2014)
Article Google Scholar
Smith, N., Gales, M., Niranjan, M.: Data-dependent kernels in SVM classification of speech patterns. Technical Report CUED/F-INFENG/TR.387, Engineering Department, Cambridge University, Cambridge, April 2001
Google Scholar
Lee, K.-A., You, C.H., Li, H., Kinnunen, T.: A GMM-based probabilistic sequence kernel for speaker verification. In: Proceedings of INTERSPEECH, Antwerp, Belgium, pp. 294–297, August 2007
Google Scholar
Campbell, W.M., Sturim, D.E., Reynolds, D.A.: Support vector machines using GMM supervectors for speaker verification. IEEE Signal Process. Lett. 13(5), 308–311 (2006)
Article Google Scholar
You, C.H., Lee, K.A., Li, H.: An SVM kernel with GMM-supervector based on the Bhattacharyya distance for speaker recognition. IEEE Signal Process. Lett. 16(1), 49–52 (2009)
Article Google Scholar
Dileep, A.D., Sekhar Chandra, C.: Speaker recognition using pyramid match kernel based support vector machines. Int. J. Speech Technol. 15(3), 365–379 (2012)
Article Google Scholar
Jaakkola, T., Diekhans, M., Haussler, D.: A discriminative framework for detecting remote protein homologies. J. Comput. Biol. 7(1–2), 95–114 (2000)
Article Google Scholar
Burkhardt, F., Paeschke, A., Rolfes, M., Weiss, W.S.B.: A database of German emotional speech. In: Proceedings of INTERSPEECH, Lisbon, Portugal, pp. 1517–1520, September 2005
Google Scholar
Steidl, S.: Automatic classification of emotion-related user states in spontaneous childern’s speech. Ph.D. Thesis, Der Technischen Fakultät der Universität Erlangen-Nürnberg, Germany (2009)
Google Scholar
The NIST year 2002 speaker recognition evaluation plan (2002). http://www.itl.nist.gov/iad/mig/tests/spk/2002/
The NIST year 2003 speaker recognition evaluation plan (2003). http://www.itl.nist.gov/iad/mig/tests/sre/2003/
Chang, C.-C., Lin, C.-J.: LIBSVM: a library for support vector machines. ACM Trans. Intell. Syst. Technol. 2(3), 27:1–27:27 (2011). http://www.csie.ntu.edu.tw/cjlin/libsvm
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Computing and Electrical Engineering, Indian Institute of Technology Mandi, Mandi, 175001, Himachal Pradesh, India
Abhijeet Sachdev & A. D. Dileep
Department of Computer Science and Engineering, National Institute of Technology Goa, Ponda, 401403, Goa, India
Veena Thenkanidiyoor

Authors

Abhijeet Sachdev
View author publications
You can also search for this author in PubMed Google Scholar
A. D. Dileep
View author publications
You can also search for this author in PubMed Google Scholar
Veena Thenkanidiyoor
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to A. D. Dileep .

Editor information

Editors and Affiliations

University of Istanbul, Istanbul, Turkey
Sabri Arik
University at Qatar, Doha, Qatar
Tingwen Huang
Tunku Abdul Rahman University College, Kuala Lumpur, Malaysia
Weng Kin Lai
University of Science Technology, Wuhan, China
Qingshan Liu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sachdev, A., Dileep, A.D., Thenkanidiyoor, V. (2015). Example-Specific Density Based Matching Kernel for Classification of Varying Length Patterns of Speech Using Support Vector Machines. In: Arik, S., Huang, T., Lai, W., Liu, Q. (eds) Neural Information Processing. ICONIP 2015. Lecture Notes in Computer Science(), vol 9489. Springer, Cham. https://doi.org/10.1007/978-3-319-26532-2_20

Download citation

DOI: https://doi.org/10.1007/978-3-319-26532-2_20
Published: 12 November 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-26531-5
Online ISBN: 978-3-319-26532-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics