Abstract
Separating speaker from an amalgam of multiple sounds is a challenging area in the domain of speech processing. Henceforth, it has been quickly led to the new area of development in the subfield of speech processing called speaker identification. The proposed work presents a new approach to catch this problem by using acoustic features of the audio signal. The mixture of speech and non-speech audio signal has got separated by using filtering algorithm followed by the recognition of the speech audio by extracting noteworthy acoustic features. A new feature has got implemented as part of contribution to the proposed work named del-MFCC. The computed features have been served for identification of speakers using different popular classifiers. The performance of the presented methodology has been compared with the existing related methods to express the usefulness of the proposed method.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Reynolds, D.A.: Automatic speaker recognition using Gaussian mixture speaker models (1995)
Dudeja, K., Kharbanda, A.: Applications of digital signal processing to speech recognition. Int. J. Res. 2(5), 191–194 (2015)
Xu, H.H.: Text Dependent Speaker Recognition Study (2015)
Revathi, A., Ganapathy, R., Venkataramani, Y.: Text independent speaker recognition and speaker independent speech recognition using iterative clustering approach. Int. J. Comput. Sci. Inf. Technol. (IJCSIT) 1(2), 30–42 (2009)
Reynolds, D.A., Quatieri, T.F., Dunn, R.B.: Speaker verification using adapted Gaussian mixture models. Digit. Signal Process. 10(1–3), 19–41 (2000)
Kua, J.M.K., Thiruvaran, T., Nosratighods, M., Ambikairajah, E., Epps, J.: Investigation of spectral centroid magnitude and frequency for speaker recognition. In: Odyssey, p. 7 (2010)
Doddington, G.R.: Speaker recognition based on idiolectal differences between speakers. In: Interspeech, pp. 2521–2524 (2001)
Paul, D., Parekh, R.: Automated speech recognition of isolated words using neural networks. Int. J. Eng. Sci. Technol. (IJEST) 3(6), 4993–5000 (2011)
Otero, P.L.: Improved strategies for speaker segmentation and emotional state detection (2015)
Campbell, J.P.: Speaker recognition: a tutorial. Proc. IEEE 85(9), 1437–1462 (1997)
Atame, S., Shanthi Therese, S., Gedam, M.: A Survey on: Continuous Voice Recognition Techniques
Mermelstein, P.: Distance measures for speech recognition, psychological and instrumental. Pattern Recog. Artif. Intell. 116, 374–388 (1976)
Lartillot, O., Toiviainen, P., Eerola, T.: A matlab toolbox for music information retrieval. In: Data Analysis, Machine Learning and Applications, pp. 261–268 (2008)
Beigi, H.: Audio source classification using speaker recognition techniques. World Wide Web (2011)
Mazaira-Fernandez, L.M., Álvarez-Marquina, A., Gómez-Vilda, P.: Improving speaker recognition by biometric voice deconstruction. Front. Bioeng. Biotechnol. 3 (2015)
Srivastava, S.: Weka: a tool for data preprocessing, classification, ensemble, clustering and association rule mining. Int. J. Comput. Appl. 88(10) (2014)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Yasmin, G., Dhara, S., Mahindar, R., Das, A.K. (2019). Speaker Identification from Mixture of Speech and Non-speech Audio Signal. In: Nayak, J., Abraham, A., Krishna, B., Chandra Sekhar, G., Das, A. (eds) Soft Computing in Data Analytics . Advances in Intelligent Systems and Computing, vol 758. Springer, Singapore. https://doi.org/10.1007/978-981-13-0514-6_47
Download citation
DOI: https://doi.org/10.1007/978-981-13-0514-6_47
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-0513-9
Online ISBN: 978-981-13-0514-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)