Statistical and Neural Classifiers: Application for Singer and Music Discrimination in Polyphonic Music Context

Ezzaidi, Hassan; Bahoura, Mohammed

doi:10.1007/978-3-642-13681-8_16

Hassan Ezzaidi²⁰ &
Mohammed Bahoura²¹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 6134))

Included in the following conference series:

International Conference on Image and Signal Processing

2850 Accesses

Abstract

The problem of identifying sections of singer voice and instruments is investigated in this paper. Three classification techniques: Linde-Buzo-Gray algorithm (LBG), Gaussian Mixture Models (GMM) and feed-forward Multi-Layer Perception (MLP) are presented and compared in this paper. All techniques are based on Mel frequency Cepstral Coefficients (MFCC), which commonly used in the speech and speaker recognition domains. All the proposed approaches yield a decision at every 125 ms only. Particularly, a large experimental data is extracted from the music genre database RWC including various style (68 pieces, 25 subcategories). The recognition scores are evaluated on data used in the training session and others never seen by proposed systems. The best results are obtained with the GMM (94% with train data and 80.5% with test data).

Download to read the full chapter text

Chapter PDF

Singer Identification Based on Artificial Neural Network

Comparative study of singing voice detection methods

Article 29 August 2015

Content-based singer classification on compressed domain audio data

Article 30 July 2014

Keywords

References

Mesaros, A., Virtanen, T., Klapuri, A.: Singer identification in polyphonic music using vocal separation and pattern recognition methods. In: Proc. ISMIR, Vienna, Austria (2007)
Google Scholar
Tzanetaki, G., Essl, G., Cook, P.: Automatic musical genre classification of audio signals. In: Proc. ISMIR, Bloomington, Indiana (2001)
Google Scholar
Linde, Y., Buzo, A., Gray, R.M.: An algorithm for vector quantizer design. IEEE Trans. Comm. 28(1), 84–95 (1980)
Article Google Scholar
Berenzweig, A., Ellis, D., Lawrence, S.: Using voice segments to improve artist classification of music. In: Proc. AES-22 Intl. Conf. on Virt., Synth., and Ent. Audio., Espoo, Finland (June 2002)
Google Scholar
Kim, Y.E., Whitman, B.: Singer identification in popular music recordings using voice coding featuress. In: Proc. ISMIR, Paris, France (2002)
Google Scholar
Goto, M., Hashiguchi, H., Nishimura, T., Oka, R.: Rwc music database: Music genre database and musical instrument sound database. In: Proc. ISMIR, pp. 229–230 (2003)
Google Scholar
The auditory toolbox for matlab, http://cobweb.ecn.purdue.edu/~malcolm/interval/1998010/

Download references

Author information

Authors and Affiliations

Département des Sciences Appliquées, Université du Québec à Chicoutimi, 550, boul. de l’Université, Chicoutimi, Qc, Canada, G7H 2B1
Hassan Ezzaidi
Département de Mathématiques, d’Informatique et de Génie, Université du Québec à Rimouski, 300, allée des Ursulines, Rimouski, Qc, Canada, G5L 3A1
Mohammed Bahoura

Authors

Hassan Ezzaidi
View author publications
You can also search for this author in PubMed Google Scholar
Mohammed Bahoura
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Université de Caen Basse-Normandie GREYC UMR CNRS 6072, ENSICAEN, 14050, Caen, France
Abderrahim Elmoataz & Olivier Lezoray &
Département de Mathématiques et d’ informatique, Université de Québec à Trois-Rivières, C.P. 500, G9A 5H7, Trois-Rivières, Québec, Canada
Fathallah Nouboud
Faculté des Sciences, Université IbnZohr, Agadir, Morocco
Driss Mammass
Département d’ Informatique et de Recherche Opérationnelle, Université de Montreal, H3C 3J7, Montréal, QC, Canada
Jean Meunier

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ezzaidi, H., Bahoura, M. (2010). Statistical and Neural Classifiers: Application for Singer and Music Discrimination in Polyphonic Music Context. In: Elmoataz, A., Lezoray, O., Nouboud, F., Mammass, D., Meunier, J. (eds) Image and Signal Processing. ICISP 2010. Lecture Notes in Computer Science, vol 6134. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-13681-8_16

Download citation

DOI: https://doi.org/10.1007/978-3-642-13681-8_16
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-13680-1
Online ISBN: 978-3-642-13681-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)

Statistical and Neural Classifiers: Application for Singer and Music Discrimination in Polyphonic Music Context

Abstract

Chapter PDF

Similar content being viewed by others

Singer Identification Based on Artificial Neural Network

Comparative study of singing voice detection methods

Content-based singer classification on compressed domain audio data

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Navigation

Statistical and Neural Classifiers: Application for Singer and Music Discrimination in Polyphonic Music Context

Abstract

Chapter PDF

Similar content being viewed by others

Singer Identification Based on Artificial Neural Network

Comparative study of singing voice detection methods

Content-based singer classification on compressed domain audio data

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation