Abstract
Feature analysis of audio compression is necessary to achieve high accuracy in musical content recognition and content-based music information retrieval (MIR). Bit rate differences are expected to adversely affect musical content analysis and content-based MIR results because the frequency response might be changed by the encoding. In this paper, we specifically examine its effect on the chroma vector, which is a commonly used feature vector for music signal processing. We analyze sound qualities extracted from encoded music files with different bit rates and compare them with the chroma features of original songs obtained using datasets for chord recognition.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Hamawaki, S., Funasawa, S., Katto, J., Ishizaki, H., Hoashi, K., Takishima, Y.: Feature Analysis and Normalization Approach for Robust Content-Based Music Retrieval to Encoded Audio with Different Bit Rates. In: Huet, B., Smeaton, A.F., Mayer-Patel, K., Avrithis, Y. (eds.) MMM 2009. LNCS, vol. 5371, pp. 298–309. Springer, Heidelberg (2009)
Fujishima, T.: Realtime Chord Recognition of Musical Sound: a System using Common Lisp Music. In: Proceedings of the International Computer Music Association, pp. 464–467 (1999)
Harte, C., Sandler, M.: Automatic Chord Identification using a Quantised Chromagram. In: Proceedings of the Audio Engineering Society (2005)
Ellis, D., Poliner, G.: Identifying Cover Songs with Chroma Features and Dynamic Programming Beat Tracking. In: Proceedings of ICASSP, pp. 1429–1432 (2007)
Mauch, M., Dixon, S.: Approximate Note Transcription for the Improved Identification of Difficult Chords. In: Proceedings of the International Society for Music Information Retrieval Conference (2010)
Müller, M., Ewert, S.: Towards Timbre-invariant Audio Features for Harmony-based Music. IEEE Trans. on Audio, Speech, and Language Processing 18(3), 649–662 (2010)
Thiede, T., Treurniet, W.C., Bitto, R., Schmidmer, C., Sporer, T., Beerends, J.G., Colomes, C.: PEAQ-The ITU Standard for Objective Measurement of Perceived Audio Quality. Journal of Audio Engineering Society 48(1/2), 3–29 (2000)
LAME MP3 Encoder, http://lame.sourceforge.net
Nero AAC Codec, http://www.nero.com/enu/company/about-nero/nero-aac-codec.php
RAREWARES – oggenc2, http://www.rarewares.org/ogg-oggenc.php
Intelligent Sound Processing, http://kom.aau.dk/project/isound/
Kabal, P.: An Examination and Interpretation of ITU-R BS.1387: Perceptual Evaluation of Audio Quality. TSP Lab Technical Report, Dept. ECE, McGill University (2002)
Supervised Chord Recognition for Music Audio in Matlab, http://labrosa.ee.columbia.edu/projects/chords/
Joachims, T.: Sequence Tagging with Structural Support Vector Machines (2008), http://www.cs.cornell.edu/people/tj/svm_light/svm_hmm.html
isophonics, http://isophonics.net/
Goto, M.: AIST Annotation for the RWC Music Database. In: Proceedings of the International Conference on Music Information Retrieval, pp. 359–360 (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Uemura, A., Ishikura, K., Katto, J. (2014). Effects of Audio Compression on Chord Recognition. In: Gurrin, C., Hopfgartner, F., Hurst, W., Johansen, H., Lee, H., O’Connor, N. (eds) MultiMedia Modeling. MMM 2014. Lecture Notes in Computer Science, vol 8326. Springer, Cham. https://doi.org/10.1007/978-3-319-04117-9_34
Download citation
DOI: https://doi.org/10.1007/978-3-319-04117-9_34
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-04116-2
Online ISBN: 978-3-319-04117-9
eBook Packages: Computer ScienceComputer Science (R0)