Advertisement

A Method for Music Structure Analysis Using MPEG-4 TwinVQ Audio Compression

  • Michihiro Kobayakawa
  • Mamoru Hoshi
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6297)

Abstract

This present paper propose a method for analyzing a music structure using autocorrelation coefficients computed in the encoding step of TwinVQ audio compression. We phrase the autocorrelation sequence into subsequences by using the extracted musical unit, and then classify the subsequences and assign a label to a class. From a sequence of labels, we extract subsequences of longest match to analyze structure of a piece of music. To evaluate performance of our method, we compare the extracted subsequences by our method with that by hand. The experimental results indicates that our method has a good performance for analyzing structure of a piece of music.

Keywords

Frame Length Accumulation Ratio Label Sequence Characteristic Section Music Structure 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Bartsch, M.A., Wakefield, G.H.: To Catch a Chorus: Using Chroma-based Representations for Audio Thumb-nailing. In: Proceeding of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2001), pp. 15–18 (2001)Google Scholar
  2. 2.
    ISO/IEC JTC 1/SC 29/WG11 N2203: Working Draft of ISO/IEC CD 144963 (May 1998)Google Scholar
  3. 3.
    Iwakami, N., Moriya, T., Miki, S.: High-quality audio-coding at less than 64/kbit/s using transform domain weighted interleave vector quantization. In: Proceedings of International Conference on Acoustics, Speech and Signal Processing (ICASSP 1995), pp. 3095–3098 (1995)Google Scholar
  4. 4.
    Moriya, T., Iwakami, N., Ikeda, K., Miki, S.: Extension and Complexity of TwinVQ Audio Coder. In: Proceedings of International Conference on Acoustics, Speech and Signal Processing (ICASSP 1996), pp. 1029–1032 (1996)Google Scholar
  5. 5.
    Onishi, K., Kobayakawa, M., Hoshi, M., Ohmori, T.: A Feature Independent of Bit Rate For TwinVQ Audio Retrieval. In: Proceedings of IEEE International Conference on Multimedia and Expo. (ICME 2001), pp. 409–416 (2001)Google Scholar
  6. 6.
    Kobayakawa, M., Hoshi, M., Onishi, K.: A method for retrieving music data with different bit ratesusing MPEG-4 TwinVQ audio compression. In: Proceedings of the ACM Multimedia 2005 (MM 2005), pp. 459–462 (2005)Google Scholar
  7. 7.
    Kobayakawa, M., Okunaru, T., Onishi, K., Hoshi, M.: A New Method for Extracting a Period of Beat of Music in Compressed domain of TwinVQ Audio Compression. In: Proceedings of the Fourth IEEE Pacific-Rim comference on Multimedia (PCM 2003), 3A1.2 (2003)Google Scholar
  8. 8.
    Nakanishi, M., Kobayakawa, M., Hoshi, M., Ohmori, T.: A Method for Extracting a Musical Unit to Phrase Music Data in the Compressed Domain of TwinVQ Audio Compression. In: Proceedings of IEEE International Conference on Multimedia and Expo. (ICME 2005), CD-ROM (2005)Google Scholar
  9. 9.
    Liu, C.-C., Tsai, P.-H.: A Singer Identification Technique For Content-Based Classification of MP3 Music Objects. In: Proceedings of the tenth International Conference on Information and Knowledge Management (CIKM 2001), pp. 438–445 (2001)Google Scholar
  10. 10.
    Liu, C.-C., Tsai, P.-H.: Content-Based Retrieval of MP3 Music Objects. In: Proceedings of the eleventh International Conference on Information and Knowledge Management (CIKM 2002), pp. 506–511 (2002)Google Scholar
  11. 11.
    Wang, Y., Vilermo, M.: A Compressed Domain Beat Detector using MP3 Audio Bitstreams. In: Proceedings of the ninth ACM International Conference on Multimedia (MM 2001), pp. 194–202 (2002)Google Scholar
  12. 12.
    Pye, D.: Content-based Methods for Management of Digital Music. In: Proceedings of IEEE International Conference Acoustic, Speech, and Signal Processing (ICASSP 2000), pp. 2437–2440 (2000)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Michihiro Kobayakawa
    • 1
    • 2
  • Mamoru Hoshi
    • 2
  1. 1.Tokyo-Metropolitan College of Industrial Technology 
  2. 2.The University of Electro-Communications 

Personalised recommendations