Skip to main content

GMM Based Indexing and Retrieval of Music Using MFCC and MPEG-7 Features

  • Conference paper
  • 2708 Accesses

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 337))

Abstract

Audio which includes voice, music, and various kinds of environmental sounds, is an important type of media, and also a significant part of video. The digital music databases in place these days, people begin to realize the importance of effectively managing music databases relying on music content analysis. The goal of music indexing and retrieval system is to provide the user with capabilities to index and retrieve the music data in an efficient manner. For efficient music retrieval, some sort of music similarity measure is desirable. In this paper, we propose a method for indexing and retrieval of the classified music using Mel-Frequency Cepstral Coefficients (MFCC) and MPEG-7 features. Music clip extraction, feature extraction, creation of an index and retrieval of the query clip are the major issues in automatic audio indexing and retrieval. Indexing is done for all the music audio clips using Gaussian mixture model (GMM) models, based on the features extracted. For retrieval, the probability that the query feature vector belongs to each of the Gaussian is computed. The average Probability density function is computed for each of the Gaussians and the retrieval is based on the highest probability.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Grosche, P.M.: Signal Processing Methods for Beat Tracking, Music Segmentation and Audio Retrieval. Thesis Universität des Saarlandes (2012)

    Google Scholar 

  2. Kumar, R.C.P., Chandy, D.A.: Audio retrieval using timbral feature. In: 2013 International Conference on Emerging Trends in Computing, Communication and Nanotechnology (ICE-CCN), March 25-26, pp. 222–226 (2013), doi:10.1109/ICE-CCN.2013.6528497

    Google Scholar 

  3. Nagavi, T.C., Anusha, S.B., Monisha, P., Poornima, S.P.: Content based audio retrieval with MFCC feature extraction, clustering and sort-merge techniques Computing. In: 2013 Fourth International Conference on Communications and Networking Technologies (ICCCNT), July 4-6, pp. 1–6 (2013)

    Google Scholar 

  4. Kinnunen, T., Saeidi, R., Sedlak, F., Lee, K.A., Sandberg, J., Hansson-Sandsten, M., Li, H.: Low-Variance Multitaper MFCC Features: A Case Study in Robust Speaker Verification. IEEE Transactions on Audio, Speech, and Language Processing 20(7), 1990–2001 (2012)

    Article  Google Scholar 

  5. Krishna Kishore, K.V., Krishna Satish, P.: Emotion recognition in speech using MFCC and wavelet features. In: 2013 IEEE 3rd International Advance Computing Conference (IACC), February 22-23, pp. 842–847 (2013)

    Google Scholar 

  6. Zanoni, M., Ciminieri, D., Sarti, A., Tubaro, S.: Searching for dominant high-level features for Music Information Retrieval. In: 2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO), August 27-31, pp. 2025–2029 (2012)

    Google Scholar 

  7. Tang, H., Chu, S.M., Hasegawa-Johnson, M., Huang, T.S.: Partially Supervised Speaker Clustering. IEEE Transactions on Pattern Analysis and Machine Intelligence 34(5), 959–971 (2012)

    Article  Google Scholar 

  8. Kim, H.-G., Moreau, N., Sikora, T.: MPEG-7 Audio and Beyond Audio Content Indexing andRetrieval. John wiley & sons Ltd (2004)

    Google Scholar 

  9. Lidy, T.: Evaluation of New Audio Features and Their Utilization in Novel Music Retrieval Applications, Thesis (December 2006)

    Google Scholar 

  10. Raś, Z.W., Zhang, X.: Multi-hierarchical Music Automatic Indexing and Retrieval System (2008)

    Google Scholar 

  11. Goto, M.: A real-time music-scene description system: Predominant-f0 estimation for detecting melody and bass lines in real-world audio signals. Speech Communication 43, 311–329 (2004)

    Article  Google Scholar 

  12. Redner, R.A., Walker, H.F.: Mixture densities, maximum likelihood and the EM algorithm. SIAM Review 26, 195–239 (1984)

    Article  MATH  MathSciNet  Google Scholar 

  13. Chechik, G., Ie, E., Rehn, M.: Large-Scale Content-Based Audio Retrieval from Text Queries. In: MIR 2008, October 30–31, pp. 105–112 (2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to R. Thiruvengatanadhan .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Thiruvengatanadhan, R., Dhanalakshmi, P., Palanivel, S. (2015). GMM Based Indexing and Retrieval of Music Using MFCC and MPEG-7 Features. In: Satapathy, S., Govardhan, A., Raju, K., Mandal, J. (eds) Emerging ICT for Bridging the Future - Proceedings of the 49th Annual Convention of the Computer Society of India (CSI) Volume 1. Advances in Intelligent Systems and Computing, vol 337. Springer, Cham. https://doi.org/10.1007/978-3-319-13728-5_41

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-13728-5_41

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-13727-8

  • Online ISBN: 978-3-319-13728-5

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics