Effects of Audio Compression on Chord Recognition

Uemura, Aiko; Ishikura, Kazumasa; Katto, Jiro

doi:10.1007/978-3-319-04117-9_34

Effects of Audio Compression on Chord Recognition

Aiko Uemura²²,
Kazumasa Ishikura²² &
Jiro Katto²²

Conference paper

1983 Accesses
2 Citations

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8326))

Abstract

Feature analysis of audio compression is necessary to achieve high accuracy in musical content recognition and content-based music information retrieval (MIR). Bit rate differences are expected to adversely affect musical content analysis and content-based MIR results because the frequency response might be changed by the encoding. In this paper, we specifically examine its effect on the chroma vector, which is a commonly used feature vector for music signal processing. We analyze sound qualities extracted from encoded music files with different bit rates and compare them with the chroma features of original songs obtained using datasets for chord recognition.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Hamawaki, S., Funasawa, S., Katto, J., Ishizaki, H., Hoashi, K., Takishima, Y.: Feature Analysis and Normalization Approach for Robust Content-Based Music Retrieval to Encoded Audio with Different Bit Rates. In: Huet, B., Smeaton, A.F., Mayer-Patel, K., Avrithis, Y. (eds.) MMM 2009. LNCS, vol. 5371, pp. 298–309. Springer, Heidelberg (2009)
Chapter Google Scholar
Fujishima, T.: Realtime Chord Recognition of Musical Sound: a System using Common Lisp Music. In: Proceedings of the International Computer Music Association, pp. 464–467 (1999)
Google Scholar
Harte, C., Sandler, M.: Automatic Chord Identification using a Quantised Chromagram. In: Proceedings of the Audio Engineering Society (2005)
Google Scholar
Ellis, D., Poliner, G.: Identifying Cover Songs with Chroma Features and Dynamic Programming Beat Tracking. In: Proceedings of ICASSP, pp. 1429–1432 (2007)
Google Scholar
Mauch, M., Dixon, S.: Approximate Note Transcription for the Improved Identification of Difficult Chords. In: Proceedings of the International Society for Music Information Retrieval Conference (2010)
Google Scholar
Müller, M., Ewert, S.: Towards Timbre-invariant Audio Features for Harmony-based Music. IEEE Trans. on Audio, Speech, and Language Processing 18(3), 649–662 (2010)
Article Google Scholar
Thiede, T., Treurniet, W.C., Bitto, R., Schmidmer, C., Sporer, T., Beerends, J.G., Colomes, C.: PEAQ-The ITU Standard for Objective Measurement of Perceived Audio Quality. Journal of Audio Engineering Society 48(1/2), 3–29 (2000)
Google Scholar
LAME MP3 Encoder, http://lame.sourceforge.net
Nero AAC Codec, http://www.nero.com/enu/company/about-nero/nero-aac-codec.php
RAREWARES – oggenc2, http://www.rarewares.org/ogg-oggenc.php
Intelligent Sound Processing, http://kom.aau.dk/project/isound/
Kabal, P.: An Examination and Interpretation of ITU-R BS.1387: Perceptual Evaluation of Audio Quality. TSP Lab Technical Report, Dept. ECE, McGill University (2002)
Google Scholar
Supervised Chord Recognition for Music Audio in Matlab, http://labrosa.ee.columbia.edu/projects/chords/
Joachims, T.: Sequence Tagging with Structural Support Vector Machines (2008), http://www.cs.cornell.edu/people/tj/svm_light/svm_hmm.html
isophonics, http://isophonics.net/
Goto, M.: AIST Annotation for the RWC Music Database. In: Proceedings of the International Conference on Music Information Retrieval, pp. 359–360 (2006)
Google Scholar

Download references

Author information

Authors and Affiliations

Waseda University, 3-4-1 Okubo, Shinjuku-ku, Tokyo, 169-8555, Japan
Aiko Uemura, Kazumasa Ishikura & Jiro Katto

Authors

Aiko Uemura
View author publications
You can also search for this author in PubMed Google Scholar
Kazumasa Ishikura
View author publications
You can also search for this author in PubMed Google Scholar
Jiro Katto
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computing, Dublin City University, 9, Dublin, Ireland
Cathal Gurrin
Fakultät IV für Elektrotechnik und Informatik, Technische Universität Berlin / DAI-Labor, 10587, Berlin, Germany
Frank Hopfgartner
Department of Information and Computing Sciences, Universiteit Utrecht, 3584, Utrecht, CC, The Netherlands
Wolfgang Hurst
UiT The Arctic University of Norway, 9019, Tromsø, Norway
Håvard Johansen
Singapore University of Technology and Design, Singapore
Hyowon Lee
School of Electrical Engineering, Dublin City University, Ireland
Noel O’Connor

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Uemura, A., Ishikura, K., Katto, J. (2014). Effects of Audio Compression on Chord Recognition. In: Gurrin, C., Hopfgartner, F., Hurst, W., Johansen, H., Lee, H., O’Connor, N. (eds) MultiMedia Modeling. MMM 2014. Lecture Notes in Computer Science, vol 8326. Springer, Cham. https://doi.org/10.1007/978-3-319-04117-9_34

Download citation

DOI: https://doi.org/10.1007/978-3-319-04117-9_34
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-04116-2
Online ISBN: 978-3-319-04117-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics