Audio Segmentation

Lu, Lie; Hanjalic, Alan

doi:10.1007/978-1-4614-8265-9_1033

Lie Lu³ &
Alan Hanjalic⁴

167 Accesses

Synonyms

Audio parsing; Auditory scene detection

Definition

Audio segmentation refers to the class of theories and algorithms designed to automatically reveal semantically meaningful temporal segments in an audio signal, also referred to as auditory scenes [7]. These scenes can be seen as equivalents of paragraphs in text, and can serve as input into audio categorization processes, either supervised (audio classification) or unsupervised (audio clustering). Through these processes, semantically similar auditory scenes can be grouped together and/or labeled using semantic indexes to provide multi-level, non-linear content-based access to large audio documents and collections.

Historical Background

Automatic detection of auditory scenesis an important step in enabling high-level semantic inference from general audio signals, and can benefit various content-based applications involving both audio and multimodal (multimedia) data sets. Traditional approaches to audio segmentation usually...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 4,499.99; Price excludes VAT (USA)

Hardcover Book: USD 6,499.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Author information

Authors and Affiliations

Microsoft Research Asia, Beijing, China
Lie Lu
Delft University of Technology, Delft, The Netherlands
Alan Hanjalic

Authors

Lie Lu
View author publications
You can also search for this author in PubMed Google Scholar
Alan Hanjalic
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Lie Lu .

Editor information

Editors and Affiliations

Georgia Institute of Technology College of Computing, Atlanta, GA, USA
Ling Liu
University of Waterloo School of Computer Science, Waterloo, ON, Canada
M. Tamer Özsu

Section Editor information

Dept. of Computer Science, New Jersey Inst. of Technology, Newark, NJ, USA
Vincent Oria
Digital Content and Media Sciences ReseaMultimedia Information Research Division, National Institute of Informatics, Tokyo, Japan
Shin'ichi Satoh

Rights and permissions

Reprints and permissions

Copyright information

About this entry

Cite this entry

Lu, L., Hanjalic, A. (2018). Audio Segmentation. In: Liu, L., Özsu, M.T. (eds) Encyclopedia of Database Systems. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-8265-9_1033

Download citation

DOI: https://doi.org/10.1007/978-1-4614-8265-9_1033
Published: 07 December 2018
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-8266-6
Online ISBN: 978-1-4614-8265-9
eBook Packages: Computer ScienceReference Module Computer Science and Engineering

Publish with us

Policies and ethics

Audio Segmentation

Synonyms

Definition

Historical Background

Access this chapter

Recommended Reading

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Section Editor information

Rights and permissions

Copyright information

About this entry

Cite this entry

Download citation

Publish with us

Navigation

Audio Segmentation

Synonyms

Definition

Historical Background

Access this chapter

Recommended Reading

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Section Editor information

Rights and permissions

Copyright information

About this entry

Cite this entry

Download citation

Share this entry

Publish with us

Search

Navigation