GMM Based Indexing and Retrieval of Music Using MFCC and MPEG-7 Features

Thiruvengatanadhan, R.; Dhanalakshmi, P.; Palanivel, S.

doi:10.1007/978-3-319-13728-5_41

R. Thiruvengatanadhan⁶,
P. Dhanalakshmi⁶ &
S. Palanivel⁶

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 337))

2709 Accesses

Abstract

Audio which includes voice, music, and various kinds of environmental sounds, is an important type of media, and also a significant part of video. The digital music databases in place these days, people begin to realize the importance of effectively managing music databases relying on music content analysis. The goal of music indexing and retrieval system is to provide the user with capabilities to index and retrieve the music data in an efficient manner. For efficient music retrieval, some sort of music similarity measure is desirable. In this paper, we propose a method for indexing and retrieval of the classified music using Mel-Frequency Cepstral Coefficients (MFCC) and MPEG-7 features. Music clip extraction, feature extraction, creation of an index and retrieval of the query clip are the major issues in automatic audio indexing and retrieval. Indexing is done for all the music audio clips using Gaussian mixture model (GMM) models, based on the features extracted. For retrieval, the probability that the query feature vector belongs to each of the Gaussian is computed. The average Probability density function is computed for each of the Gaussians and the retrieval is based on the highest probability.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Grosche, P.M.: Signal Processing Methods for Beat Tracking, Music Segmentation and Audio Retrieval. Thesis Universität des Saarlandes (2012)
Google Scholar
Kumar, R.C.P., Chandy, D.A.: Audio retrieval using timbral feature. In: 2013 International Conference on Emerging Trends in Computing, Communication and Nanotechnology (ICE-CCN), March 25-26, pp. 222–226 (2013), doi:10.1109/ICE-CCN.2013.6528497
Google Scholar
Nagavi, T.C., Anusha, S.B., Monisha, P., Poornima, S.P.: Content based audio retrieval with MFCC feature extraction, clustering and sort-merge techniques Computing. In: 2013 Fourth International Conference on Communications and Networking Technologies (ICCCNT), July 4-6, pp. 1–6 (2013)
Google Scholar
Kinnunen, T., Saeidi, R., Sedlak, F., Lee, K.A., Sandberg, J., Hansson-Sandsten, M., Li, H.: Low-Variance Multitaper MFCC Features: A Case Study in Robust Speaker Verification. IEEE Transactions on Audio, Speech, and Language Processing 20(7), 1990–2001 (2012)
Article Google Scholar
Krishna Kishore, K.V., Krishna Satish, P.: Emotion recognition in speech using MFCC and wavelet features. In: 2013 IEEE 3rd International Advance Computing Conference (IACC), February 22-23, pp. 842–847 (2013)
Google Scholar
Zanoni, M., Ciminieri, D., Sarti, A., Tubaro, S.: Searching for dominant high-level features for Music Information Retrieval. In: 2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO), August 27-31, pp. 2025–2029 (2012)
Google Scholar
Tang, H., Chu, S.M., Hasegawa-Johnson, M., Huang, T.S.: Partially Supervised Speaker Clustering. IEEE Transactions on Pattern Analysis and Machine Intelligence 34(5), 959–971 (2012)
Article Google Scholar
Kim, H.-G., Moreau, N., Sikora, T.: MPEG-7 Audio and Beyond Audio Content Indexing andRetrieval. John wiley & sons Ltd (2004)
Google Scholar
Lidy, T.: Evaluation of New Audio Features and Their Utilization in Novel Music Retrieval Applications, Thesis (December 2006)
Google Scholar
Raś, Z.W., Zhang, X.: Multi-hierarchical Music Automatic Indexing and Retrieval System (2008)
Google Scholar
Goto, M.: A real-time music-scene description system: Predominant-f0 estimation for detecting melody and bass lines in real-world audio signals. Speech Communication 43, 311–329 (2004)
Article Google Scholar
Redner, R.A., Walker, H.F.: Mixture densities, maximum likelihood and the EM algorithm. SIAM Review 26, 195–239 (1984)
Article MATH MathSciNet Google Scholar
Chechik, G., Ie, E., Rehn, M.: Large-Scale Content-Based Audio Retrieval from Text Queries. In: MIR 2008, October 30–31, pp. 105–112 (2008)
Google Scholar

Download references

Author information

Authors and Affiliations

Dept. of Computer Science and Engineering, Annamalai University, Chidambaram, Tamil Nadu, India
R. Thiruvengatanadhan, P. Dhanalakshmi & S. Palanivel

Authors

R. Thiruvengatanadhan
View author publications
You can also search for this author in PubMed Google Scholar
P. Dhanalakshmi
View author publications
You can also search for this author in PubMed Google Scholar
S. Palanivel
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to R. Thiruvengatanadhan .

Editor information

Editors and Affiliations

Department of Computer Science and Engineering, Anil Neerukonda Institute of Technology and Sciences, Vishakapatnam, India
Suresh Chandra Satapathy
School of Information Technology, Jawaharlal Nehru Technological University Hyderabad, Hyderabad, India
A. Govardhan
Department of CSE, CMR Technical Campus, Hyderabad, India
K. Srujan Raju
Department of Computer Science & Engineering, Faculty of Engg., Tech. & Management, University of Kalyani, Kalyani, West Bengal, India
J. K. Mandal

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Thiruvengatanadhan, R., Dhanalakshmi, P., Palanivel, S. (2015). GMM Based Indexing and Retrieval of Music Using MFCC and MPEG-7 Features. In: Satapathy, S., Govardhan, A., Raju, K., Mandal, J. (eds) Emerging ICT for Bridging the Future - Proceedings of the 49th Annual Convention of the Computer Society of India (CSI) Volume 1. Advances in Intelligent Systems and Computing, vol 337. Springer, Cham. https://doi.org/10.1007/978-3-319-13728-5_41

Download citation

DOI: https://doi.org/10.1007/978-3-319-13728-5_41
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-13727-8
Online ISBN: 978-3-319-13728-5
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics