Skip to main content

Singer Identification Using Time-Frequency Audio Feature

  • Conference paper
Advances in Neural Networks – ISNN 2011 (ISNN 2011)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 6676))

Included in the following conference series:

Abstract

Singer identification is a difficult topic in music information Retriveal research area. Because the background instrumental accompaniment in audio music is regarded as noise source that has to reduce a performance.

This paper proposes a singer identification algorithm thai is able to automatically identify a singer in an audio music signal with background music by using Time-Frequency audio feature. The main idea is used a spectrogram to able effective Time-Frequency feature and used as the input for classification. The proposed technique is test with 20 different singer. Sereval classification technique are compared,such as Feed-Forward Neural Network, k-Nearest Neighbor (kNN) and Minimum least square linear classifier(Fisher). The experimental result on singer identification using a spectrogram with Feed-Forward Neural Networkand and k-Nearest Neighbor (kNN) can effectively identify the singer in music signal with background music more than 92%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Whitman, B., Flake, G., Lawrence, S.: Artist detection in music with Minnow match. In: Proceedings of the 2001 IEEE Workshop on Neural Networks for Signal Processing, Falmouth, MA, pp. 559–568 (2001)

    Google Scholar 

  2. Berenzweig, A., Ellis, D., Lawrence, S.: Using voice segments to improve artist classification of music. In: AES 22nd International Conference, Espoo, Finland (2002)

    Google Scholar 

  3. Makeyev, O., Sazonov, E., Schuckers, S., Melanson, E., Neuman, M.: Limited receptive area neural classifier for recognition of swallowing sounds using short-time Fourier transform. In: Proc. International Joint Conference on Neural Networks IJCNN 2007, Orlando, USA, August 12-17, pp. 1417.1–1417.6 (2007)

    Google Scholar 

  4. Lin, C.-C., Chen, S.-H., Truong, T.-K., Chang, Y.: Audio Classification and Categorization Based on Wavelets and Support Vector Machine. IEEE Transactions on Speech and Audio Processing 13(5), 644–651 (2005)

    Article  Google Scholar 

  5. Esmaili, S., Krishnan, S., Raahemifar, K.: Content Based Audio Classification and Retrieval Using Joint Time-Frequency Analysis. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2004), vol. 5, pp. 665–668 (May 2004)

    Google Scholar 

  6. Wang, J.-C., Lee, H.-P., Wang, J.-F., Lin, C.-B.: Robust environmental sound recognition for home automation. IEEE Transactions on Automation Science and Engineering 5(1), 25–31 (2008)

    Article  Google Scholar 

  7. Yoshii, K., Goto, M., Okuno, H.G.: Rum Sound Recognition for Polyphonic Audio Signals by Adaptation and Matching of Spectrogram Templates With Harmonic Structure Suppression. IEEE Transactions on Audio, Speech, and Language Processing 15(1), 333–345 (2007)

    Article  Google Scholar 

  8. Toyoda, Y., Huang, J., Ding, S., Liu, Y.: Environmental sound recognition by the instantaneous spectrum combined with the time pattern of power. In: Proceedings of the 2nd IASTED International Conference on Neural Networks and Computational Intelligence, pp. 169–172 (2004)

    Google Scholar 

  9. Makeyev, O., Sazonov, E., Schuckers, S., Melanson, E., Neuman, M.: Limited receptive area neural classifier for recognition of swallowing sounds using short-time Fourier transform. In: International Joint Conference on Neural Networks, IJCNN 2007, Orlando, USA, August 12-17, pp. 1417.1–1417.6 (2007a)

    Google Scholar 

  10. Makeyev, O., Sazonov, E., Schuckers, S., Lopez-Meyer, P., Melanson, E., Neuman, M.: Limited receptive area neural classifier for recognition of swallowing sounds using continuous wavelet transform. In: 29th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS 2007, pp. 3128–3131 (August 2007)

    Google Scholar 

  11. Ajmera, J., McCowan, I., Bourlard, H.: Speech/music segmentation using entropy and dynamism features in a HMM classification framework. Speech Communication 40(3), 351–363 (2003)

    Article  Google Scholar 

  12. Rajapakse, M., Wyse, L.: Generic audio classification using a hybrid model based on GMMS and HMMS. Proceedings of the IEEE, 1550–1555 (2005)

    Google Scholar 

  13. Toyoda, Y., Huang, J., Ding, S., Liu, Y.: Environmental Sound Recognition by Multilayered Neural Networks. In: CIT 2004, pp. 123–127 (2004)

    Google Scholar 

  14. Georgoulas, G., Georgopoulos, V.C., Stylios, C.D.: Speech Sound Classification and Detection of Articulation Disorders with Support Vector Machines and Wavelets. In: 28th IEEE EMBS Annual International Conference, New York City, New York, USA, August 30-September 3 (2006)

    Google Scholar 

  15. Lin, C.-C., Chen, S.-H., Truong, T.-K., Chang, Y.: Audio classification and categorization based on wavelets and support vector machine. IEEE Transactions on Speech and Audio Processing 13(5), 644–651 (2005)

    Article  Google Scholar 

  16. Kim, Y.E., Whitman, B.: Singer identification in popular music recordings using voice coding features. In: Proceedings of the 3rd International Conference on Music Information Retrieval, Paris, France (2002)

    Google Scholar 

  17. Fujihara, H., Goto, M., Kitahara, T., Okuno, H.G.: A Modeling of Singing Voice Robust to Accompaniment Sounds and Its Application to Singer Identification and Vocal-Timbre-Similarity-Based Music Information Retrieval. IEEE Transactions on Audio, Speech, and Language Processing 18(3), 638–648 (2010)

    Article  Google Scholar 

  18. Demuth, H., Beale, M.: Neural Network Toolbox for Use with Matlab: User’s Guide (version 4), p. 2000. The MathWorks Inc.

    Google Scholar 

  19. Peerapol, K.: Singing Voice Recognition based on Matching of Spectrogram Pattern. In: Proceedings of International Joint Conference on Neural Networks, Atlanta, Georgia, USA, June 14-19, pp. 978–981 (2009)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Doungpaisan, P. (2011). Singer Identification Using Time-Frequency Audio Feature. In: Liu, D., Zhang, H., Polycarpou, M., Alippi, C., He, H. (eds) Advances in Neural Networks – ISNN 2011. ISNN 2011. Lecture Notes in Computer Science, vol 6676. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-21090-7_57

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-21090-7_57

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-21089-1

  • Online ISBN: 978-3-642-21090-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics