Abstract
A method is presented that detects unexpected acoustic events, i.e., occurrence of acoustic objects that do not belong to any of the learned classes but nevertheless appear to constitutemeaningful acoustic events. Building on the framework [Weinshall et al.], general and specific acoustic classifiers are implemented and combined for detection of events in which they respond in an incongruous way, indicating an unexpected event. Subsequent identification of events is performed by estimation of source direction, for which a novel classification-based approach is outlined. Performance, evaluated in dependence of signal-to-noise ratio (SNR) and type of unexpected event, indicates decent performance at SNRs better than 5 dB.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bach, J.H., Anemüller, J.: Detecting novel objects through classifier incongruence. In: Proc. Interspeech, Makuhari, Japan, pp. 2206–2209 (2010)
Bach, J.H., Kollmeier, B., Anemüller, J.: Modulation-based detection of speech in real background noise: Generalization to novel background classes. In: Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Dallas, Texas, pp. 41–45 (2010)
Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines (2001), software available, http://www.csie.ntu.edu.tw/~cjlin/libsvm
Hermansky, H., Morgan, N.: Rasta processing of speech. IEEE Transactions on Speech and Audio Processing 2(4), 578–589 (1994)
Knapp, C., Carter, G.: The generalized correlation method for estimation of time delay. IEEE Transactions on Acoustics, Speech and Signal Processing 24(4), 320–327 (1976)
Weinshall, D., Hermansky, H., Zweig, A., Luo, J., Jimison, H., Ohl, F., Pavel, M.: Beyond novelty detection: Incongruent events, when general and specific classifiers disagree. In: Koller, D., Schuurmans, D., Bengio, Y., Bottou, L. (eds.) Advances in Neural Information Processing Systems, vol. 21, pp. 1745–1752 (2009)
Tchorz, J., Kollmeier, B.: SNR estimation based on amplitude modulation analysis with applications to noise suppression. IEEE Transactions on Speech and Audio Processing 11(3), 184–192 (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Bach, JH., Kayser, H., Anemüller, J. (2012). Audio Classification and Localization for Incongruent Event Detection. In: Weinshall, D., Anemüller, J., van Gool, L. (eds) Detection and Identification of Rare Audiovisual Cues. Studies in Computational Intelligence, vol 384. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24034-8_2
Download citation
DOI: https://doi.org/10.1007/978-3-642-24034-8_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-24033-1
Online ISBN: 978-3-642-24034-8
eBook Packages: EngineeringEngineering (R0)