Abstract
In this paper the acoustic event detection and classification system that has been developed at Athens Information Technology is presented. This system relies on the use of several Hidden Markov Models arranged in a hierarchical manner in order to provide more accurate detections. The audio streams are split into overlapping frames from which the necessary for training and testing features are obtained. A post processing scheme has also been developed in order to smooth the raw detections. The results that were obtained from the application of this system on the testing data of the CLEAR evaluation, obtained from five different sites are presented and the performance of this system is discussed.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Rabiner, L.R.: A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition. Proceedings of the IEEE 77(2), 257–286 (1989)
Burges, C.J.C.: A Tutorial on Support Vector Machines for Pattern Recognition. Data Mining and Knowledge Discovery 2, 121–167 (1998)
Temko, A., Malkin, R., Zieger, C., Macho, D., Nadeu, C., Omologo, M.: CLEAR Evaluation of Acoustic Event Detection and Classification Systems. In: Stiefelhagen, R., Garofolo, J.S. (eds.) CLEAR 2006. LNCS, vol. 4122, Springer, Heidelberg (2007)
Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification. John Willey & Sons (2001)
Mauuary, L., Monné, J.: Speech/non-speech Detection for Voice Response Systems. In: Eurospeech 1993, Berlin, Germany, pp. 1097–1100 (1993)
Martin, A., Charlet, D., Mauuary, L.: Robust Speech/Non-Speech Detection Using LDA Applied to MFCC. In: ICASSP (2001)
Ramirez, J., Segura, J.C., Benitez, C., Garcia, L., Rubio, A.: Statistical Voice Activity Detection Using a Multiple Observation Likelihood ratio Test. IEEE Signal Processing Letters 12(10), 689–692 (2005)
Temko, A.: AED evaluation plan. In: CLEAR (2007), www.clear-evaluation.org
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Boukis, C., Polymenakos, L.C. (2008). The Acoustic Event Detector of AIT. In: Stiefelhagen, R., Bowers, R., Fiscus, J. (eds) Multimodal Technologies for Perception of Humans. RT CLEAR 2007 2007. Lecture Notes in Computer Science, vol 4625. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-68585-2_31
Download citation
DOI: https://doi.org/10.1007/978-3-540-68585-2_31
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-68584-5
Online ISBN: 978-3-540-68585-2
eBook Packages: Computer ScienceComputer Science (R0)