Singer Identification Using MFCC and LPC Coefficients from Indian Video Songs

Ratanpara, Tushar; Patel, Narendra

doi:10.1007/978-3-319-13728-5_31

Tushar Ratanpara⁶ &
Narendra Patel⁶

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 337))

2724 Accesses
5 Citations

Abstract

Singer identification is one of the challenging tasks in Music information retrieval (MIR) category. Music of India generates 4-5% of net revenue for a movie. Indian video songs include variety of singers. The research presented in this paper is to identify singer using MFCC and LPC coefficients from Indian video songs. Initially Audio portion is extracted from Indian video songs. Audio portion is divided into segments. For each segment, 13 Mel-frequency cepstral coefficients (MFCC) and 13 linear predictive coding (LPC) coefficients are computed. Principal component analysis method is used to reduce the dimensionality of segments. Singer models are trained using Naive bayes classifier and back propagation algorithm using neural network. The proposed approach is tested using different combinations of coefficients with male and female Indian singers.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Regnier, L., Peeters, G.: Singer verification: singer model vs. song model. In: IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 437–440 (2012)
Google Scholar
Mammone, R., Zhang, X., Ramachandran, R.P.: Robust speaker recognition: A feature-based approach. IEEE Signal Processing Magazine 13, 58–71 (1996)
Article Google Scholar
Logan, B.: Mel frequency cepstral coefficients for music modeling. In: International Symposium Music Information Retrieval (2000)
Google Scholar
Doungpaisan, P.: Singer identification using time-frequency audio feature. In: Liu, D., Zhang, H., Polycarpou, M., Alippi, C., He, H. (eds.) ISNN 2011, Part II. LNCS, vol. 6676, pp. 486–495. Springer, Heidelberg (2011)
Chapter Google Scholar
Maazouzi, F., Bahi, H.: Use of Gaussian Mixture Models and Vector Quantization for Singing Voice Classification in Commercial Music Productions. In: 10th International Symposium on Programming and Systems (ISPS), pp. 116–121 (2011)
Google Scholar
Bartsch, M., Wakefield, G.: Singing voice identification using spectral envelope estimation. IEEE Transactions on Speech and Audio Processing 12, 100–109 (2004)
Article Google Scholar
Deshmukh, S., Bhirud, S.G.: A Hybrid Selection Method of Audio Descriptors for Singer Identification in North Indian Classical Music. In: Fifth International Conference on Emerging Trends in Engineering and Technology, pp. 224–227 (2012)
Google Scholar
Jackson, L.B.: Digital Filters and Signal Processing, 2nd edn., pp. 255–257. Kluwer Academic Publishers, Boston (1989)
Book Google Scholar
Maniya, H., Hasan, M.: Comparative study of naïve bayes classifier and KNN for Tuberculosis. In: International Conference on Web Services Computing (2011)
Google Scholar
Saduf, Wani, M.: Comparative study of back propagation learning algorithms for neural networks. International Journal of Advanced Research in Computer Science and Software Engineering 3 (2013)
Google Scholar
Meijer, R., Goeman, J.: Efficient approximate k-fold and leave-one-out cross-validation for ridge regression. Biometrical Journal 55(2) (2013)
Google Scholar
Rodríguez, J., Pérez, A., Lozano, J.: Sensitivity Analysis of k-Fold Cross Validation in Prediction Error Estimation. IEEE Transactions on Pattern Analysis and Machine Intelligence 32(2) (2010)
Google Scholar

Download references

Author information

Authors and Affiliations

C. U. Shah University, Wadhwan City, Gujarat, India
Tushar Ratanpara & Narendra Patel

Authors

Tushar Ratanpara
View author publications
You can also search for this author in PubMed Google Scholar
Narendra Patel
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Tushar Ratanpara .

Editor information

Editors and Affiliations

Department of Computer Science and Engineering, Anil Neerukonda Institute of Technology and Sciences, Vishakapatnam, India
Suresh Chandra Satapathy
School of Information Technology, Jawaharlal Nehru Technological University Hyderabad, Hyderabad, India
A. Govardhan
Department of CSE, CMR Technical Campus, Hyderabad, India
K. Srujan Raju
Department of Computer Science & Engineering, Faculty of Engg., Tech. & Management, University of Kalyani, Kalyani, West Bengal, India
J. K. Mandal

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ratanpara, T., Patel, N. (2015). Singer Identification Using MFCC and LPC Coefficients from Indian Video Songs. In: Satapathy, S., Govardhan, A., Raju, K., Mandal, J. (eds) Emerging ICT for Bridging the Future - Proceedings of the 49th Annual Convention of the Computer Society of India (CSI) Volume 1. Advances in Intelligent Systems and Computing, vol 337. Springer, Cham. https://doi.org/10.1007/978-3-319-13728-5_31

Download citation

DOI: https://doi.org/10.1007/978-3-319-13728-5_31
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-13727-8
Online ISBN: 978-3-319-13728-5
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics