Skip to main content

Cross-Similarity Measurement of Music Sections: A Framework for Large-scale Cover Song Identification

  • Conference paper
  • First Online:
Advances in Intelligent Information Hiding and Multimedia Signal Processing

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 63))

  • 1067 Accesses

Abstract

For large-scale cover song identification, most previous works take a single feature vector as the representation of a song. Although this approach ensures structure invariance, it may cause overcorrection since it totally neglects the structure feature of the song. To address this problem, we put forward a novel framework for large-scale cover song identification based on music structure segmentation, aiming at matching the irrelevant sections and ignoring the irrelevant ones. In our implementation, we apply the average and weighted average methods to integrating similarities of section pairs. We evaluate the proposed framework based on three representative previous methods, including 2D Fourier magnitude coefficients, chord profiles, and cognition-inspired descriptors. The experimental results show that the all the three methods in our framework significantly outperform those in their original works.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. R. Typke, F. Wiering, R. C. Veltkamp et al., “A survey of music information retrieval systems.” in ISMIR, 2005, pp. 153–160.

    Google Scholar 

  2. J. Serra, E. Gómez, and P. Herrera, “Audio cover song identification and similarity: background, approaches, evaluation, and beyond,” in Advances in Music Information Retrieval. Springer, 2010, pp. 307–332.

    Google Scholar 

  3. J. Serra, E. Gómez, P. Herrera, and X. Serra, “Chroma binary similarity and local alignment applied to cover song identification,” ICASSP, 2008.

    Google Scholar 

  4. T. Bertin-Mahieux, D. P. Ellis, B. Whitman, and P. Lamere, “The million song dataset,” in ISMIR. University of Miami, 2011, pp. 591–596.

    Google Scholar 

  5. D. P. Ellis and B.-M. Thierry, “Large-scale cover song recognition using the 2d fourier transform magnitude,” in ISMIR, 2012, pp. 241–246.

    Google Scholar 

  6. E. J. Humphrey, O. Nieto, and J. P. Bello, “Data driven and discriminative projections for large-scale cover song identification.” in ISMIR, 2013, pp. 149–154.

    Google Scholar 

  7. M. Khadkevich and M. Omologo, “Large-scale cover song identification using chord profiles.” in ISMIR, 2013, pp. 233–238.

    Google Scholar 

  8. J. van Balen, D. Bountouridis, F. Wiering, R. C. Veltkamp et al., “Cognition-inspired descriptors for scalable cover song retrieval,” in ISMIR, 2014.

    Google Scholar 

  9. F. Bimbot, E. Deruty, G. Sargent, and E. Vincent, “Methodology and resources for the structural segmentation of music pieces into autonomous and comparable blocks,” 2011.

    Google Scholar 

  10. J. Serra, M. Muller, P. Grosche, and J. L. Arcos, “Unsupervised music structure annotation by time series structure features and segment similarity,” IEEE Transactions on Multimedia, vol. 16, no. 5, pp. 1229–1240, 2014.

    Google Scholar 

  11. X. Chuan, “Cover song identification using an enhanced chroma over a binary classifier based similarity measurement framework,” in International Conference on Systems and Informatics (ICSAI). IEEE, 2012, pp. 2170–2176.

    Google Scholar 

  12. J. Pauwels, F. Kaiser, and G. Peeters, “Combining harmony-based and novelty-based approaches for structural segmentation.” in ISMIR, 2013, pp. 601–606.

    Google Scholar 

  13. J. Foote, “Automatic audio segmentation using a measure of audio novelty,” in ICME, vol. 1. IEEE, 2000, pp. 452–455.

    Google Scholar 

  14. M. A. Bartsch and G. H. Wakefield, “Audio thumbnailing of popular music using chroma-based representations,” IEEE Transactions on Multimedia, vol. 7, no. 1, pp. 96–104, 2005.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kang Cai .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Cai, K., Yang, D., Chen, X. (2017). Cross-Similarity Measurement of Music Sections: A Framework for Large-scale Cover Song Identification. In: Pan, JS., Tsai, PW., Huang, HC. (eds) Advances in Intelligent Information Hiding and Multimedia Signal Processing. Smart Innovation, Systems and Technologies, vol 63. Springer, Cham. https://doi.org/10.1007/978-3-319-50209-0_19

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-50209-0_19

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-50208-3

  • Online ISBN: 978-3-319-50209-0

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics