Abstract
Finding appropriate and high-quality audio files for the creation of a sound track nowadays presents a serious hurdle to many media producers. As most digital sound archives restrict the categorization of audio data to verbal taxonomies, this process of retrieving suitable sounds often becomes a tedious and time-consuming part of their work. The research project AllThatSounds tries to enhance the search procedure by supplying additional, associative and semantic classifications of the audio files. This is achieved by annotating these files with suitable metadata according to a customized systematic categorization scheme. Moreover, additional data is collected by the evaluation of user profiles and by analyzing the sounds with signal processing methods. Using artificial intelligence techniques, similarity distances are calculated between all the audio files in the database, so as to devise a different, highly efficient search algorithm by browsing across similar sounds. The project’s result is a tool for structuring sound databases with an efficient search component, which means to guide users to suitable sounds for their sound track of media productions.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Cano, P., Koppenberger, M., Le Groux, S., Ricard, J., Herrera, P., Wack, N.: Nearest-neighbor generic sound classification with a wordnet-based taxonomy. In: Proceedings of AES 116th Convention, Berlin, Germany (2004)
Chion, M.: Audio-Vision - Sound on Screen. Columbia University Press (1994)
Feng, D., Siu, W.C., Zhang, H.J.: Multimedia Information Retrieval and Management. Springer, Berlin (2003)
Mandel, M.M., Ellis, D.P.W.: Song-Level Features and Support Vector Machines for Music Classification. In: Proceedings of the 6th International Conference on Music Information Retrieval, London, pp. 594–599 (2005)
Raffaseder, H.: Audiodesign. Hanser-Fachbuchverlag. Hanser-Fachbuchverlag, Leipzig (2002)
Schafer, M.R.: The Soundscape - Our Sonic Environment and the Tuning of the World. Destiny Books, Rochester (1994)
Sonnenschein, D.: Sound Design - The Expressive Power of Music, Voice and Sound Effects in Cinema. Michael Wiese Productions, Studio City (2001)
Truax, B.: Acoustic Communication, 2nd edn. Ablex Publishing, Westport (2001)
van Leeuwen, T.: Speech, Music, Sound. MacMillan Press Ltd., London (1999)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Rubisch, J., Husinsky, M., Raffaseder, H. (2010). AllThatSounds: Associative Semantic Categorization of Audio Data. In: Ystad, S., Aramaki, M., Kronland-Martinet, R., Jensen, K. (eds) Auditory Display. CMMR ICAD 2009 2009. Lecture Notes in Computer Science, vol 5954. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12439-6_25
Download citation
DOI: https://doi.org/10.1007/978-3-642-12439-6_25
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-12438-9
Online ISBN: 978-3-642-12439-6
eBook Packages: Computer ScienceComputer Science (R0)