Music Genre Classification Using an Auditory Memory Model

Jensen, Kristoffer

doi:10.1007/978-3-642-31980-8_7

Kristoffer Jensen¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7172))

Included in the following conference series:

925 Accesses
2 Citations

Abstract

Audio feature estimation is potentially improved by including the auditory short-term memory (STM) model. A new paradigm of audio feature estimation is obtained by adding the influence of notes in the STM. These notes are identified using the directional spectral flux, and the spectral content that is increased by the new note is added to the STM. The STM is exponentially fading with time span and number of elements, and each note only belongs to the STM for a limited time. Initial investigations regarding the behavior of the STM shows promising results, and an initial experiment with sensory dissonance has been undertaken with good results. The parameters obtained from the auditory memory model, along with the dissonance measure, are shown here to be of interest in music genre classification.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 54.99; Price excludes VAT (USA)

Softcover Book: USD 72.00; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Atkinson, R.C., Shiffrin, R.M.: Human memory: A proposed system and its control processes. In: Spence, K.W., Spence, J.T. (eds.) The Psychology of Learning and Motivation, vol. 2, pp. 89–195. Academic Press, New York (1968)
Google Scholar
Pashler, H., Carrier, M.: Structures, Processes, and the Flow of Information. In: Bjork, Bjork (eds.) Memory: Handbook of Perception and Cognition, pp. 3–29. Academic Press (1996)
Google Scholar
Snyder, B.: Music and Memory. An Introduction. The MIT Press, Cambridge (2000)
Google Scholar
Baddeley, A.D., Hitch, G.: Working memory. In: Bower, G.H. (ed.) The Psychology of Learning and Motivation: Advances in Research and Theory, vol. 8, pp. 47–89. Academic Press, New York (1974)
Google Scholar
Baddeley, A.D.: The episodic buffer: a new component of working memory? Trends in Cognitive Science 4, 417–423 (2000)
Article Google Scholar
Miller, G.A.: The magical number seven plus or minus two: some limits on our capacity for processing information. Psychological Review 63(2), 81–97 (1956)
Article Google Scholar
Gross, R.: Psychology: The Science of Mind and Behaviour. Hodder Arnold Publication (2005)
Google Scholar
Massaro, D., Loftus, G.R.: Sensory and Perceptual Storage. In: Bjork, E.L., Bjork, R.A. (eds.) Memory, pp. 86–99. Academic Press, San Diego (1996)
Google Scholar
Foote, J.: A similarity measure for automatic audio classification. In: Proceedings AAAI 1997 Spring Symposium on Intelligent Integration and Use of Text, Image, Video, and Audio Corpora, Stanford, Palo Alto, California, USA (1997)
Google Scholar
McNab, R.J., Smith, L.A., Witten, I.H., Henderson, C.L., Cunningham, S.J.: Towards the digital music library: Tune retrieval from acoustic input. In: Proceedings DL 1996, pp. 11–18 (1996)
Google Scholar
Rolland, P.Y., Raskinis, G., Ganascia, J.G.: Musical content-based retrieval: an overview of the Melodiscov approach and system. ACM Multimedia 1, 81–84 (1999)
Google Scholar
Ghias, A., Logan, J., Chamberlin, D., Smith, B.C.: Query by humming - musical information retrieval in an audio database. In: Proceedings Multimedia, pp. 231–236 (2001)
Google Scholar
Pauws, S., Eggen, B.: PATS: Realization and user evaluation of an automatic playlist generator. In: Proceedings of the 3rd ISMIR, Ircam, France, pp. 222–230 (2002)
Google Scholar
Anderson, J.R., Lebiere, C.: Atomic components of thought, Hillsdale, NJ (1998)
Google Scholar
Moore, B.C.J.: Psychology of Hearing. Academy Press (1997)
Google Scholar
Jensen, K.: Multiple scale music segmentation using rhythm, timbre and harmony. EURASIP Journal on Applied Signal Processing, Special issue on Music Information Retrieval Based on Signal Processing (2007)
Google Scholar
Plomp, R., Levelt, W.J.M.: Tonal Consonance and Critical Bandwidth. J. Acoust. Soc. Am. 38(4), 548–560 (1965)
Article Google Scholar
Sethares, W.: Local consonance and the relationship between timbre and scale. J. Acoust. Soc. Am. 94(3), 1218–1228 (1993)
Article MathSciNet Google Scholar
Meng, A.: Temporal feature integration for music organization. Ph.D. dissertation, IMM, Denmark Technical University (2006)
Google Scholar

Download references

Author information

Authors and Affiliations

ad:mt., Aalborg University Esbjerg, Niels Bohr Vej 8, 6700, Esbjerg, Denmark
Kristoffer Jensen

Authors

Kristoffer Jensen
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

CNRS - LMA, 31 Chemin Joseph Aiguier, 13402, Marseille Cedex 20, France
Sølvi Ystad , Mitsuko Aramaki & Richard Kronland-Martinet , &
Aalborg University Esbjerg, Niels Bohr Vej 8, 6700, Esbjerg, Denmark
Kristoffer Jensen
North Orissa University, Sriram Chandra Vihar, Takatpur, 757003, Baripada, Orissa, India
Sanghamitra Mohanty

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jensen, K. (2012). Music Genre Classification Using an Auditory Memory Model. In: Ystad, S., Aramaki, M., Kronland-Martinet, R., Jensen, K., Mohanty, S. (eds) Speech, Sound and Music Processing: Embracing Research in India. CMMR FRSM 2011 2011. Lecture Notes in Computer Science, vol 7172. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31980-8_7

Download citation

DOI: https://doi.org/10.1007/978-3-642-31980-8_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-31979-2
Online ISBN: 978-3-642-31980-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics