Abstract
Singer identification is one of the challenging tasks in Music information retrieval (MIR) category. Music of India generates 4-5% of net revenue for a movie. Indian video songs include variety of singers. The research presented in this paper is to identify singer using MFCC and LPC coefficients from Indian video songs. Initially Audio portion is extracted from Indian video songs. Audio portion is divided into segments. For each segment, 13 Mel-frequency cepstral coefficients (MFCC) and 13 linear predictive coding (LPC) coefficients are computed. Principal component analysis method is used to reduce the dimensionality of segments. Singer models are trained using Naive bayes classifier and back propagation algorithm using neural network. The proposed approach is tested using different combinations of coefficients with male and female Indian singers.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Regnier, L., Peeters, G.: Singer verification: singer model vs. song model. In: IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 437–440 (2012)
Mammone, R., Zhang, X., Ramachandran, R.P.: Robust speaker recognition: A feature-based approach. IEEE Signal Processing Magazine 13, 58–71 (1996)
Logan, B.: Mel frequency cepstral coefficients for music modeling. In: International Symposium Music Information Retrieval (2000)
Doungpaisan, P.: Singer identification using time-frequency audio feature. In: Liu, D., Zhang, H., Polycarpou, M., Alippi, C., He, H. (eds.) ISNN 2011, Part II. LNCS, vol. 6676, pp. 486–495. Springer, Heidelberg (2011)
Maazouzi, F., Bahi, H.: Use of Gaussian Mixture Models and Vector Quantization for Singing Voice Classification in Commercial Music Productions. In: 10th International Symposium on Programming and Systems (ISPS), pp. 116–121 (2011)
Bartsch, M., Wakefield, G.: Singing voice identification using spectral envelope estimation. IEEE Transactions on Speech and Audio Processing 12, 100–109 (2004)
Deshmukh, S., Bhirud, S.G.: A Hybrid Selection Method of Audio Descriptors for Singer Identification in North Indian Classical Music. In: Fifth International Conference on Emerging Trends in Engineering and Technology, pp. 224–227 (2012)
Jackson, L.B.: Digital Filters and Signal Processing, 2nd edn., pp. 255–257. Kluwer Academic Publishers, Boston (1989)
Maniya, H., Hasan, M.: Comparative study of naïve bayes classifier and KNN for Tuberculosis. In: International Conference on Web Services Computing (2011)
Saduf, Wani, M.: Comparative study of back propagation learning algorithms for neural networks. International Journal of Advanced Research in Computer Science and Software Engineering 3 (2013)
Meijer, R., Goeman, J.: Efficient approximate k-fold and leave-one-out cross-validation for ridge regression. Biometrical Journal 55(2) (2013)
RodrÃguez, J., Pérez, A., Lozano, J.: Sensitivity Analysis of k-Fold Cross Validation in Prediction Error Estimation. IEEE Transactions on Pattern Analysis and Machine Intelligence 32(2) (2010)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Ratanpara, T., Patel, N. (2015). Singer Identification Using MFCC and LPC Coefficients from Indian Video Songs. In: Satapathy, S., Govardhan, A., Raju, K., Mandal, J. (eds) Emerging ICT for Bridging the Future - Proceedings of the 49th Annual Convention of the Computer Society of India (CSI) Volume 1. Advances in Intelligent Systems and Computing, vol 337. Springer, Cham. https://doi.org/10.1007/978-3-319-13728-5_31
Download citation
DOI: https://doi.org/10.1007/978-3-319-13728-5_31
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-13727-8
Online ISBN: 978-3-319-13728-5
eBook Packages: EngineeringEngineering (R0)