Speaker Identification from Mixture of Speech and Non-speech Audio Signal

Yasmin, Ghazaala; Dhara, Subrata; Mahindar, Rudrendu; Das, Asit Kumar

doi:10.1007/978-981-13-0514-6_47

Ghazaala Yasmin¹⁹,
Subrata Dhara²⁰,
Rudrendu Mahindar²⁰ &
…
Asit Kumar Das²¹

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 758))

853 Accesses
1 Citations

Abstract

Separating speaker from an amalgam of multiple sounds is a challenging area in the domain of speech processing. Henceforth, it has been quickly led to the new area of development in the subfield of speech processing called speaker identification. The proposed work presents a new approach to catch this problem by using acoustic features of the audio signal. The mixture of speech and non-speech audio signal has got separated by using filtering algorithm followed by the recognition of the speech audio by extracting noteworthy acoustic features. A new feature has got implemented as part of contribution to the proposed work named del-MFCC. The computed features have been served for identification of speakers using different popular classifiers. The performance of the presented methodology has been compared with the existing related methods to express the usefulness of the proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Reynolds, D.A.: Automatic speaker recognition using Gaussian mixture speaker models (1995)
Google Scholar
Dudeja, K., Kharbanda, A.: Applications of digital signal processing to speech recognition. Int. J. Res. 2(5), 191–194 (2015)
Google Scholar
Xu, H.H.: Text Dependent Speaker Recognition Study (2015)
Google Scholar
Revathi, A., Ganapathy, R., Venkataramani, Y.: Text independent speaker recognition and speaker independent speech recognition using iterative clustering approach. Int. J. Comput. Sci. Inf. Technol. (IJCSIT) 1(2), 30–42 (2009)
Google Scholar
Reynolds, D.A., Quatieri, T.F., Dunn, R.B.: Speaker verification using adapted Gaussian mixture models. Digit. Signal Process. 10(1–3), 19–41 (2000)
Article Google Scholar
Kua, J.M.K., Thiruvaran, T., Nosratighods, M., Ambikairajah, E., Epps, J.: Investigation of spectral centroid magnitude and frequency for speaker recognition. In: Odyssey, p. 7 (2010)
Google Scholar
Doddington, G.R.: Speaker recognition based on idiolectal differences between speakers. In: Interspeech, pp. 2521–2524 (2001)
Google Scholar
Paul, D., Parekh, R.: Automated speech recognition of isolated words using neural networks. Int. J. Eng. Sci. Technol. (IJEST) 3(6), 4993–5000 (2011)
Google Scholar
Otero, P.L.: Improved strategies for speaker segmentation and emotional state detection (2015)
Google Scholar
Campbell, J.P.: Speaker recognition: a tutorial. Proc. IEEE 85(9), 1437–1462 (1997)
Article Google Scholar
Atame, S., Shanthi Therese, S., Gedam, M.: A Survey on: Continuous Voice Recognition Techniques
Google Scholar
Mermelstein, P.: Distance measures for speech recognition, psychological and instrumental. Pattern Recog. Artif. Intell. 116, 374–388 (1976)
Google Scholar
Lartillot, O., Toiviainen, P., Eerola, T.: A matlab toolbox for music information retrieval. In: Data Analysis, Machine Learning and Applications, pp. 261–268 (2008)
Google Scholar
Beigi, H.: Audio source classification using speaker recognition techniques. World Wide Web (2011)
Google Scholar
Mazaira-Fernandez, L.M., Álvarez-Marquina, A., Gómez-Vilda, P.: Improving speaker recognition by biometric voice deconstruction. Front. Bioeng. Biotechnol. 3 (2015)
Google Scholar
Srivastava, S.: Weka: a tool for data preprocessing, classification, ensemble, clustering and association rule mining. Int. J. Comput. Appl. 88(10) (2014)
Google Scholar

Download references

Author information

Authors and Affiliations

CSE Department, St. Thomas‘ College of Engineering & Technology, Kolkata, 700023, India
Ghazaala Yasmin
ECE Department, St. Thomas‘ College of Engineering & Technology, Kolkata, 700023, India
Subrata Dhara & Rudrendu Mahindar
CST Department, Indian Institute of Engineering Science and Technology, Shibpur, Howrah, 711103, India
Asit Kumar Das

Authors

Ghazaala Yasmin
View author publications
You can also search for this author in PubMed Google Scholar
Subrata Dhara
View author publications
You can also search for this author in PubMed Google Scholar
Rudrendu Mahindar
View author publications
You can also search for this author in PubMed Google Scholar
Asit Kumar Das
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ghazaala Yasmin .

Editor information

Editors and Affiliations

Department of Computer Science and Engineering, Sri Sivani College of Engineering, Srikakulam, Andhra Pradesh, India
Janmenjoy Nayak
Machine Intelligence Research Labs (MIR Labs), Scientific Network for Innovation and Research Excellence, Washington, USA
Ajith Abraham
Department of Mechanical Engineering, Sri Sivani College of Engineering, Srikakulam, Andhra Pradesh, India
B. Murali Krishna
Department of Electrical and Electronics Engineering, Sri Sivani College of Engineering, Srikakulam, Andhra Pradesh, India
G. T. Chandra Sekhar
Department of Computer Science and Technology, Indian Institute of Engineering Science and Technology (IIEST), Shibpur, Howrah, West Bengal, India
Asit Kumar Das

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yasmin, G., Dhara, S., Mahindar, R., Das, A.K. (2019). Speaker Identification from Mixture of Speech and Non-speech Audio Signal. In: Nayak, J., Abraham, A., Krishna, B., Chandra Sekhar, G., Das, A. (eds) Soft Computing in Data Analytics . Advances in Intelligent Systems and Computing, vol 758. Springer, Singapore. https://doi.org/10.1007/978-981-13-0514-6_47

Download citation

DOI: https://doi.org/10.1007/978-981-13-0514-6_47
Published: 22 August 2018
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-0513-9
Online ISBN: 978-981-13-0514-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics