Abstract
Audio which includes voice, music, and various kinds of environmental sounds, is an important type of media, and also a significant part of video. The digital music databases in place these days, people begin to realize the importance of effectively managing music databases relying on music content analysis. The goal of music indexing and retrieval system is to provide the user with capabilities to index and retrieve the music data in an efficient manner. For efficient music retrieval, some sort of music similarity measure is desirable. In this paper, we propose a method for indexing and retrieval of the classified music using Mel-Frequency Cepstral Coefficients (MFCC) and MPEG-7 features. Music clip extraction, feature extraction, creation of an index and retrieval of the query clip are the major issues in automatic audio indexing and retrieval. Indexing is done for all the music audio clips using Gaussian mixture model (GMM) models, based on the features extracted. For retrieval, the probability that the query feature vector belongs to each of the Gaussian is computed. The average Probability density function is computed for each of the Gaussians and the retrieval is based on the highest probability.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Grosche, P.M.: Signal Processing Methods for Beat Tracking, Music Segmentation and Audio Retrieval. Thesis Universität des Saarlandes (2012)
Kumar, R.C.P., Chandy, D.A.: Audio retrieval using timbral feature. In: 2013 International Conference on Emerging Trends in Computing, Communication and Nanotechnology (ICE-CCN), March 25-26, pp. 222–226 (2013), doi:10.1109/ICE-CCN.2013.6528497
Nagavi, T.C., Anusha, S.B., Monisha, P., Poornima, S.P.: Content based audio retrieval with MFCC feature extraction, clustering and sort-merge techniques Computing. In: 2013 Fourth International Conference on Communications and Networking Technologies (ICCCNT), July 4-6, pp. 1–6 (2013)
Kinnunen, T., Saeidi, R., Sedlak, F., Lee, K.A., Sandberg, J., Hansson-Sandsten, M., Li, H.: Low-Variance Multitaper MFCC Features: A Case Study in Robust Speaker Verification. IEEE Transactions on Audio, Speech, and Language Processing 20(7), 1990–2001 (2012)
Krishna Kishore, K.V., Krishna Satish, P.: Emotion recognition in speech using MFCC and wavelet features. In: 2013 IEEE 3rd International Advance Computing Conference (IACC), February 22-23, pp. 842–847 (2013)
Zanoni, M., Ciminieri, D., Sarti, A., Tubaro, S.: Searching for dominant high-level features for Music Information Retrieval. In: 2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO), August 27-31, pp. 2025–2029 (2012)
Tang, H., Chu, S.M., Hasegawa-Johnson, M., Huang, T.S.: Partially Supervised Speaker Clustering. IEEE Transactions on Pattern Analysis and Machine Intelligence 34(5), 959–971 (2012)
Kim, H.-G., Moreau, N., Sikora, T.: MPEG-7 Audio and Beyond Audio Content Indexing andRetrieval. John wiley & sons Ltd (2004)
Lidy, T.: Evaluation of New Audio Features and Their Utilization in Novel Music Retrieval Applications, Thesis (December 2006)
Raś, Z.W., Zhang, X.: Multi-hierarchical Music Automatic Indexing and Retrieval System (2008)
Goto, M.: A real-time music-scene description system: Predominant-f0 estimation for detecting melody and bass lines in real-world audio signals. Speech Communication 43, 311–329 (2004)
Redner, R.A., Walker, H.F.: Mixture densities, maximum likelihood and the EM algorithm. SIAM Review 26, 195–239 (1984)
Chechik, G., Ie, E., Rehn, M.: Large-Scale Content-Based Audio Retrieval from Text Queries. In: MIR 2008, October 30–31, pp. 105–112 (2008)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Thiruvengatanadhan, R., Dhanalakshmi, P., Palanivel, S. (2015). GMM Based Indexing and Retrieval of Music Using MFCC and MPEG-7 Features. In: Satapathy, S., Govardhan, A., Raju, K., Mandal, J. (eds) Emerging ICT for Bridging the Future - Proceedings of the 49th Annual Convention of the Computer Society of India (CSI) Volume 1. Advances in Intelligent Systems and Computing, vol 337. Springer, Cham. https://doi.org/10.1007/978-3-319-13728-5_41
Download citation
DOI: https://doi.org/10.1007/978-3-319-13728-5_41
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-13727-8
Online ISBN: 978-3-319-13728-5
eBook Packages: EngineeringEngineering (R0)