Skip to main content

AllThatSounds: Associative Semantic Categorization of Audio Data

  • Conference paper
Auditory Display (CMMR 2009, ICAD 2009)

Abstract

Finding appropriate and high-quality audio files for the creation of a sound track nowadays presents a serious hurdle to many media producers. As most digital sound archives restrict the categorization of audio data to verbal taxonomies, this process of retrieving suitable sounds often becomes a tedious and time-consuming part of their work. The research project AllThatSounds tries to enhance the search procedure by supplying additional, associative and semantic classifications of the audio files. This is achieved by annotating these files with suitable metadata according to a customized systematic categorization scheme. Moreover, additional data is collected by the evaluation of user profiles and by analyzing the sounds with signal processing methods. Using artificial intelligence techniques, similarity distances are calculated between all the audio files in the database, so as to devise a different, highly efficient search algorithm by browsing across similar sounds. The project’s result is a tool for structuring sound databases with an efficient search component, which means to guide users to suitable sounds for their sound track of media productions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Cano, P., Koppenberger, M., Le Groux, S., Ricard, J., Herrera, P., Wack, N.: Nearest-neighbor generic sound classification with a wordnet-based taxonomy. In: Proceedings of AES 116th Convention, Berlin, Germany (2004)

    Google Scholar 

  2. Chion, M.: Audio-Vision - Sound on Screen. Columbia University Press (1994)

    Google Scholar 

  3. Feng, D., Siu, W.C., Zhang, H.J.: Multimedia Information Retrieval and Management. Springer, Berlin (2003)

    MATH  Google Scholar 

  4. Mandel, M.M., Ellis, D.P.W.: Song-Level Features and Support Vector Machines for Music Classification. In: Proceedings of the 6th International Conference on Music Information Retrieval, London, pp. 594–599 (2005)

    Google Scholar 

  5. Raffaseder, H.: Audiodesign. Hanser-Fachbuchverlag. Hanser-Fachbuchverlag, Leipzig (2002)

    Google Scholar 

  6. Schafer, M.R.: The Soundscape - Our Sonic Environment and the Tuning of the World. Destiny Books, Rochester (1994)

    Google Scholar 

  7. Sonnenschein, D.: Sound Design - The Expressive Power of Music, Voice and Sound Effects in Cinema. Michael Wiese Productions, Studio City (2001)

    Google Scholar 

  8. Truax, B.: Acoustic Communication, 2nd edn. Ablex Publishing, Westport (2001)

    Google Scholar 

  9. van Leeuwen, T.: Speech, Music, Sound. MacMillan Press Ltd., London (1999)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Rubisch, J., Husinsky, M., Raffaseder, H. (2010). AllThatSounds: Associative Semantic Categorization of Audio Data. In: Ystad, S., Aramaki, M., Kronland-Martinet, R., Jensen, K. (eds) Auditory Display. CMMR ICAD 2009 2009. Lecture Notes in Computer Science, vol 5954. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12439-6_25

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-12439-6_25

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-12438-9

  • Online ISBN: 978-3-642-12439-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics