Singer Identification Using Time-Frequency Audio Feature

Doungpaisan, Pafan

doi:10.1007/978-3-642-21090-7_57

Pafan Doungpaisan²¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 6676))

Included in the following conference series:

International Symposium on Neural Networks

2355 Accesses
2 Citations

Abstract

Singer identification is a difficult topic in music information Retriveal research area. Because the background instrumental accompaniment in audio music is regarded as noise source that has to reduce a performance.

This paper proposes a singer identification algorithm thai is able to automatically identify a singer in an audio music signal with background music by using Time-Frequency audio feature. The main idea is used a spectrogram to able effective Time-Frequency feature and used as the input for classification. The proposed technique is test with 20 different singer. Sereval classification technique are compared,such as Feed-Forward Neural Network, k-Nearest Neighbor (kNN) and Minimum least square linear classifier(Fisher). The experimental result on singer identification using a spectrogram with Feed-Forward Neural Networkand and k-Nearest Neighbor (kNN) can effectively identify the singer in music signal with background music more than 92%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Singer Identification Based on Artificial Neural Network

Singer identification using perceptual features and cepstral coefficients of an audio signal from Indian video songs

Article Open access 25 June 2015

Singer Identification Using MFCC and LPC Coefficients from Indian Video Songs

References

Whitman, B., Flake, G., Lawrence, S.: Artist detection in music with Minnow match. In: Proceedings of the 2001 IEEE Workshop on Neural Networks for Signal Processing, Falmouth, MA, pp. 559–568 (2001)
Google Scholar
Berenzweig, A., Ellis, D., Lawrence, S.: Using voice segments to improve artist classification of music. In: AES 22nd International Conference, Espoo, Finland (2002)
Google Scholar
Makeyev, O., Sazonov, E., Schuckers, S., Melanson, E., Neuman, M.: Limited receptive area neural classifier for recognition of swallowing sounds using short-time Fourier transform. In: Proc. International Joint Conference on Neural Networks IJCNN 2007, Orlando, USA, August 12-17, pp. 1417.1–1417.6 (2007)
Google Scholar
Lin, C.-C., Chen, S.-H., Truong, T.-K., Chang, Y.: Audio Classification and Categorization Based on Wavelets and Support Vector Machine. IEEE Transactions on Speech and Audio Processing 13(5), 644–651 (2005)
Article Google Scholar
Esmaili, S., Krishnan, S., Raahemifar, K.: Content Based Audio Classification and Retrieval Using Joint Time-Frequency Analysis. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2004), vol. 5, pp. 665–668 (May 2004)
Google Scholar
Wang, J.-C., Lee, H.-P., Wang, J.-F., Lin, C.-B.: Robust environmental sound recognition for home automation. IEEE Transactions on Automation Science and Engineering 5(1), 25–31 (2008)
Article Google Scholar
Yoshii, K., Goto, M., Okuno, H.G.: Rum Sound Recognition for Polyphonic Audio Signals by Adaptation and Matching of Spectrogram Templates With Harmonic Structure Suppression. IEEE Transactions on Audio, Speech, and Language Processing 15(1), 333–345 (2007)
Article Google Scholar
Toyoda, Y., Huang, J., Ding, S., Liu, Y.: Environmental sound recognition by the instantaneous spectrum combined with the time pattern of power. In: Proceedings of the 2nd IASTED International Conference on Neural Networks and Computational Intelligence, pp. 169–172 (2004)
Google Scholar
Makeyev, O., Sazonov, E., Schuckers, S., Melanson, E., Neuman, M.: Limited receptive area neural classifier for recognition of swallowing sounds using short-time Fourier transform. In: International Joint Conference on Neural Networks, IJCNN 2007, Orlando, USA, August 12-17, pp. 1417.1–1417.6 (2007a)
Google Scholar
Makeyev, O., Sazonov, E., Schuckers, S., Lopez-Meyer, P., Melanson, E., Neuman, M.: Limited receptive area neural classifier for recognition of swallowing sounds using continuous wavelet transform. In: 29th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS 2007, pp. 3128–3131 (August 2007)
Google Scholar
Ajmera, J., McCowan, I., Bourlard, H.: Speech/music segmentation using entropy and dynamism features in a HMM classification framework. Speech Communication 40(3), 351–363 (2003)
Article Google Scholar
Rajapakse, M., Wyse, L.: Generic audio classification using a hybrid model based on GMMS and HMMS. Proceedings of the IEEE, 1550–1555 (2005)
Google Scholar
Toyoda, Y., Huang, J., Ding, S., Liu, Y.: Environmental Sound Recognition by Multilayered Neural Networks. In: CIT 2004, pp. 123–127 (2004)
Google Scholar
Georgoulas, G., Georgopoulos, V.C., Stylios, C.D.: Speech Sound Classification and Detection of Articulation Disorders with Support Vector Machines and Wavelets. In: 28th IEEE EMBS Annual International Conference, New York City, New York, USA, August 30-September 3 (2006)
Google Scholar
Lin, C.-C., Chen, S.-H., Truong, T.-K., Chang, Y.: Audio classification and categorization based on wavelets and support vector machine. IEEE Transactions on Speech and Audio Processing 13(5), 644–651 (2005)
Article Google Scholar
Kim, Y.E., Whitman, B.: Singer identification in popular music recordings using voice coding features. In: Proceedings of the 3rd International Conference on Music Information Retrieval, Paris, France (2002)
Google Scholar
Fujihara, H., Goto, M., Kitahara, T., Okuno, H.G.: A Modeling of Singing Voice Robust to Accompaniment Sounds and Its Application to Singer Identification and Vocal-Timbre-Similarity-Based Music Information Retrieval. IEEE Transactions on Audio, Speech, and Language Processing 18(3), 638–648 (2010)
Article Google Scholar
Demuth, H., Beale, M.: Neural Network Toolbox for Use with Matlab: User’s Guide (version 4), p. 2000. The MathWorks Inc.
Google Scholar
Peerapol, K.: Singing Voice Recognition based on Matching of Spectrogram Pattern. In: Proceedings of International Joint Conference on Neural Networks, Atlanta, Georgia, USA, June 14-19, pp. 978–981 (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Information Technology, Faculty of Industrial Technology and Management, King Mongkut’s University of Technology North Bangkok, 1518 Pibulsongkram Road, Bangsue, Bangkok, 10800, Thailand
Pafan Doungpaisan

Authors

Pafan Doungpaisan
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Automation, Key Laboratory of Complex Systems and Intelligence Science, Chinese Academy of Sciences, 100190, Beijing, China
Derong Liu
College of Information Science and Engineering, Northeastern University, 110004, Shenyang, Liaoing, China
Huaguang Zhang
Department of Electrical and Computer Engineering, University of Cyprus, 75 Kallipoleos Avenue, 1678, Nicosia, Cyprus
Marios Polycarpou
Dipartimento di Elettronica, Politecnico di Milano, Piazza L. da Vinci 32, 20133, Milano, Italy
Cesare Alippi
Deptartment of Electrical, Computer and Biomedical Engineering, University of Rhode Island, 02881, Kingston, RI, USA
Haibo He

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Doungpaisan, P. (2011). Singer Identification Using Time-Frequency Audio Feature. In: Liu, D., Zhang, H., Polycarpou, M., Alippi, C., He, H. (eds) Advances in Neural Networks – ISNN 2011. ISNN 2011. Lecture Notes in Computer Science, vol 6676. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-21090-7_57

Download citation

DOI: https://doi.org/10.1007/978-3-642-21090-7_57
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-21089-1
Online ISBN: 978-3-642-21090-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Singer Identification Using Time-Frequency Audio Feature

Abstract

Access this chapter

Preview

Similar content being viewed by others

Singer Identification Based on Artificial Neural Network

Singer identification using perceptual features and cepstral coefficients of an audio signal from Indian video songs

Singer Identification Using MFCC and LPC Coefficients from Indian Video Songs

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Singer Identification Using Time-Frequency Audio Feature

Abstract

Access this chapter

Preview

Similar content being viewed by others

Singer Identification Based on Artificial Neural Network

Singer identification using perceptual features and cepstral coefficients of an audio signal from Indian video songs

Singer Identification Using MFCC and LPC Coefficients from Indian Video Songs

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation