Music Genre Classification Using a Gradient-Based Local Texture Descriptor
With the increasing popularity and availability of online music databases that store vast collections of music, automated classification of music genre has attracted significant attention for the management of such large-scale databases. This paper presents a new music genre classification method that utilizes gradient-based texture analysis of the spectrograms constructed from the audio signals. We propose to use gradient directional pattern (GDP)—a robust local texture descriptor that exploits the gradient directional information to encode the local texture properties of an image. The proposed method first computes spectrograms from the audio signals and then applies the GDP operator to construct the feature descriptors that represent micro-level texture details of the spectrograms. We use a support vector machine (SVM) for the classification task. The effectiveness of the proposed method is evaluated using the GTZAN genre collection music database. Our experiments show promising results for the proposed GDP-based spectrogram texture analysis, as compared against some other existing music genre classification methods.
KeywordsMusic genre classification Local texture analysis Spectrogram Gradient directional pattern (GDP)
The authors would like to thank NSERC Discovery Grant Project 1028463, NSERC Engage, AITF, and MITACS Accelerate for partial support of this project.
- 2.Ahmed, F., Kabir, M.H.: Directional ternary pattern (dtp) for facial expression recognition. In: IEEE International Conference on Consumer Electronics, pp. 265–266 (2012)Google Scholar
- 5.Costa, Y., Oliveira, L., Koerich, A., Gouyon, F.: Music genre recognition using spectrograms. In: International Conference on Systems, Signals and Image Processing, pp. 151–154 (2011)Google Scholar
- 7.Dannenberg, R., Thom, B., Watson, D.: A machine learning approach to musical style recognition. In: International Computer Music Conference (1997)Google Scholar
- 8.Ezzaidi, H., Rouat, J.: Automatic musical genre classification using divergence and average information measures. In: Research report of the world academy of science, engineering and technology (2006)Google Scholar
- 11.Li, T., M, M.O., Li, Q.: A comparative study on content-based music genre classification. In: international ACM SI-GIR conference on research and development in information retrieval, pp. 282–289 (2003)Google Scholar
- 12.Lidy, T., Rauber, A.: Evaluation of feature extractors and psychoacoustic transformations for music genre classification. In: International Conference on Music Information Retrieval, pp. 71–80 (2005)Google Scholar
- 14.McKay, C., Fujinaga, I.: Musical genre classification: is it worth pursuing and how can it be improved? In: International Conference on Music Information Retrieval, pp. 101–106 (2006)Google Scholar
- 15.Neammalai, P., Phimoltares, S., Lursinsap, C.: Speech and music classification using hybrid form of spectrogram and fourier transformation. In: Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, pp. 1–6 (2014)Google Scholar
- 18.Tan, X., Triggs, B.: Enhanced local texture feature sets for face recognition under difficult lighting conditions. In: IEEE International Workshop on Analysis and Modeling of Faces and Gestures, LNCS vol. 4778, pp. 168–182 (2007)Google Scholar
- 20.Wu, H., Zhang, M.: Gabor-lbp features and combined classifiers for music genre classification. In: International Conference on Computer and Information Application, pp. 419–422 (2012)Google Scholar
- 21.Zhao, S., Gao, Y., Zhang, B.: Sobel-lbp. In: IEEE International Conference on Image Processing, pp. 2144–2147 (2008)Google Scholar
Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 2.5 International License (http://creativecommons.org/licenses/by-nc/2.5/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.
The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.