Abstract
Singer identification is a difficult topic in music information Retriveal research area. Because the background instrumental accompaniment in audio music is regarded as noise source that has to reduce a performance.
This paper proposes a singer identification algorithm thai is able to automatically identify a singer in an audio music signal with background music by using Time-Frequency audio feature. The main idea is used a spectrogram to able effective Time-Frequency feature and used as the input for classification. The proposed technique is test with 20 different singer. Sereval classification technique are compared,such as Feed-Forward Neural Network, k-Nearest Neighbor (kNN) and Minimum least square linear classifier(Fisher). The experimental result on singer identification using a spectrogram with Feed-Forward Neural Networkand and k-Nearest Neighbor (kNN) can effectively identify the singer in music signal with background music more than 92%.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Whitman, B., Flake, G., Lawrence, S.: Artist detection in music with Minnow match. In: Proceedings of the 2001 IEEE Workshop on Neural Networks for Signal Processing, Falmouth, MA, pp. 559–568 (2001)
Berenzweig, A., Ellis, D., Lawrence, S.: Using voice segments to improve artist classification of music. In: AES 22nd International Conference, Espoo, Finland (2002)
Makeyev, O., Sazonov, E., Schuckers, S., Melanson, E., Neuman, M.: Limited receptive area neural classifier for recognition of swallowing sounds using short-time Fourier transform. In: Proc. International Joint Conference on Neural Networks IJCNN 2007, Orlando, USA, August 12-17, pp. 1417.1–1417.6 (2007)
Lin, C.-C., Chen, S.-H., Truong, T.-K., Chang, Y.: Audio Classification and Categorization Based on Wavelets and Support Vector Machine. IEEE Transactions on Speech and Audio Processing 13(5), 644–651 (2005)
Esmaili, S., Krishnan, S., Raahemifar, K.: Content Based Audio Classification and Retrieval Using Joint Time-Frequency Analysis. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2004), vol. 5, pp. 665–668 (May 2004)
Wang, J.-C., Lee, H.-P., Wang, J.-F., Lin, C.-B.: Robust environmental sound recognition for home automation. IEEE Transactions on Automation Science and Engineering 5(1), 25–31 (2008)
Yoshii, K., Goto, M., Okuno, H.G.: Rum Sound Recognition for Polyphonic Audio Signals by Adaptation and Matching of Spectrogram Templates With Harmonic Structure Suppression. IEEE Transactions on Audio, Speech, and Language Processing 15(1), 333–345 (2007)
Toyoda, Y., Huang, J., Ding, S., Liu, Y.: Environmental sound recognition by the instantaneous spectrum combined with the time pattern of power. In: Proceedings of the 2nd IASTED International Conference on Neural Networks and Computational Intelligence, pp. 169–172 (2004)
Makeyev, O., Sazonov, E., Schuckers, S., Melanson, E., Neuman, M.: Limited receptive area neural classifier for recognition of swallowing sounds using short-time Fourier transform. In: International Joint Conference on Neural Networks, IJCNN 2007, Orlando, USA, August 12-17, pp. 1417.1–1417.6 (2007a)
Makeyev, O., Sazonov, E., Schuckers, S., Lopez-Meyer, P., Melanson, E., Neuman, M.: Limited receptive area neural classifier for recognition of swallowing sounds using continuous wavelet transform. In: 29th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS 2007, pp. 3128–3131 (August 2007)
Ajmera, J., McCowan, I., Bourlard, H.: Speech/music segmentation using entropy and dynamism features in a HMM classification framework. Speech Communication 40(3), 351–363 (2003)
Rajapakse, M., Wyse, L.: Generic audio classification using a hybrid model based on GMMS and HMMS. Proceedings of the IEEE, 1550–1555 (2005)
Toyoda, Y., Huang, J., Ding, S., Liu, Y.: Environmental Sound Recognition by Multilayered Neural Networks. In: CIT 2004, pp. 123–127 (2004)
Georgoulas, G., Georgopoulos, V.C., Stylios, C.D.: Speech Sound Classification and Detection of Articulation Disorders with Support Vector Machines and Wavelets. In: 28th IEEE EMBS Annual International Conference, New York City, New York, USA, August 30-September 3 (2006)
Lin, C.-C., Chen, S.-H., Truong, T.-K., Chang, Y.: Audio classification and categorization based on wavelets and support vector machine. IEEE Transactions on Speech and Audio Processing 13(5), 644–651 (2005)
Kim, Y.E., Whitman, B.: Singer identification in popular music recordings using voice coding features. In: Proceedings of the 3rd International Conference on Music Information Retrieval, Paris, France (2002)
Fujihara, H., Goto, M., Kitahara, T., Okuno, H.G.: A Modeling of Singing Voice Robust to Accompaniment Sounds and Its Application to Singer Identification and Vocal-Timbre-Similarity-Based Music Information Retrieval. IEEE Transactions on Audio, Speech, and Language Processing 18(3), 638–648 (2010)
Demuth, H., Beale, M.: Neural Network Toolbox for Use with Matlab: User’s Guide (version 4), p. 2000. The MathWorks Inc.
Peerapol, K.: Singing Voice Recognition based on Matching of Spectrogram Pattern. In: Proceedings of International Joint Conference on Neural Networks, Atlanta, Georgia, USA, June 14-19, pp. 978–981 (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Doungpaisan, P. (2011). Singer Identification Using Time-Frequency Audio Feature. In: Liu, D., Zhang, H., Polycarpou, M., Alippi, C., He, H. (eds) Advances in Neural Networks – ISNN 2011. ISNN 2011. Lecture Notes in Computer Science, vol 6676. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-21090-7_57
Download citation
DOI: https://doi.org/10.1007/978-3-642-21090-7_57
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-21089-1
Online ISBN: 978-3-642-21090-7
eBook Packages: Computer ScienceComputer Science (R0)