Skip to main content

The Acoustic Event Detector of AIT

  • Conference paper
Multimodal Technologies for Perception of Humans (RT 2007, CLEAR 2007)

Abstract

In this paper the acoustic event detection and classification system that has been developed at Athens Information Technology is presented. This system relies on the use of several Hidden Markov Models arranged in a hierarchical manner in order to provide more accurate detections. The audio streams are split into overlapping frames from which the necessary for training and testing features are obtained. A post processing scheme has also been developed in order to smooth the raw detections. The results that were obtained from the application of this system on the testing data of the CLEAR evaluation, obtained from five different sites are presented and the performance of this system is discussed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Rabiner, L.R.: A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition. Proceedings of the IEEE 77(2), 257–286 (1989)

    Article  Google Scholar 

  2. Burges, C.J.C.: A Tutorial on Support Vector Machines for Pattern Recognition. Data Mining and Knowledge Discovery 2, 121–167 (1998)

    Article  Google Scholar 

  3. Temko, A., Malkin, R., Zieger, C., Macho, D., Nadeu, C., Omologo, M.: CLEAR Evaluation of Acoustic Event Detection and Classification Systems. In: Stiefelhagen, R., Garofolo, J.S. (eds.) CLEAR 2006. LNCS, vol. 4122, Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  4. Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification. John Willey & Sons (2001)

    Google Scholar 

  5. Mauuary, L., Monné, J.: Speech/non-speech Detection for Voice Response Systems. In: Eurospeech 1993, Berlin, Germany, pp. 1097–1100 (1993)

    Google Scholar 

  6. Martin, A., Charlet, D., Mauuary, L.: Robust Speech/Non-Speech Detection Using LDA Applied to MFCC. In: ICASSP (2001)

    Google Scholar 

  7. Ramirez, J., Segura, J.C., Benitez, C., Garcia, L., Rubio, A.: Statistical Voice Activity Detection Using a Multiple Observation Likelihood ratio Test. IEEE Signal Processing Letters 12(10), 689–692 (2005)

    Article  Google Scholar 

  8. Temko, A.: AED evaluation plan. In: CLEAR (2007), www.clear-evaluation.org

Download references

Author information

Authors and Affiliations

Authors

Editor information

Rainer Stiefelhagen Rachel Bowers Jonathan Fiscus

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Boukis, C., Polymenakos, L.C. (2008). The Acoustic Event Detector of AIT. In: Stiefelhagen, R., Bowers, R., Fiscus, J. (eds) Multimodal Technologies for Perception of Humans. RT CLEAR 2007 2007. Lecture Notes in Computer Science, vol 4625. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-68585-2_31

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-68585-2_31

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-68584-5

  • Online ISBN: 978-3-540-68585-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics