Skip to main content

Incongruence Detection in Audio-Visual Processing

  • Chapter
  • 506 Accesses

Part of the book series: Studies in Computational Intelligence ((SCI,volume 384))

Abstract

The recently introduced theory of incongruence allows for detection of unexpected events in observations via disagreement of classifiers on specific and general levels of a classifier hierarchy which encodes the understanding a machine currently has of the world. We present an application of this theory, a hierarchy of classifiers describing an audio-visual speaker detector, and show successful incongruence detection on sequences acquired by a static as well as by a moving AWEAR 2.0 device using the presented classifier hierarchy.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Dalal, N.: INRIA Object Detection and Localization Toolkit (2008), Software, http://pascal.inrialpes.fr/soft/olt

  2. Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: CVPR 2005, vol. 2, pp. 886–893 (2005)

    Google Scholar 

  3. Havlena, M., Ess, A., Moreau, W., Torii, A., Jančošek, M., Pajdla, T., Van Gool, L.: AWEAR 2.0 system: Omni-directional audio-visual data acquisition and processing. In: EGOVIS 2009: First Workshop on Egocentric Vision, pp. 49–56 (2009)

    Google Scholar 

  4. Knapp, C., Carter, G.: The generalized correlation method for estimation of time delay. IEEE Transactions on Acoustics, Speech and Signal Processing 24(4), 320–327 (1976)

    Article  Google Scholar 

  5. Pajdla, T., Havlena, M., Heller, J., Kayser, H., Bach, J.H., Anemüller, J.: Incongruence detection for detecting, removing, and repairing incorrect functionality in low-level processing. Research Report CTU–CMP–2009–19, Center for Machine Perception, K13133 FEE Czech Technical University (2009)

    Google Scholar 

  6. Pavel, M., Jimison, H., Weinshall, D., Zweig, A., Ohl, F., Hermansky, H.: Detection and identification of rare incongruent events in cognitive and engineering systems. Dirac white paper, OHSU (2008)

    Google Scholar 

  7. Schölkopf, B., Smola, A.: Learning with Kernels. The MIT Press, MA (2002)

    Google Scholar 

  8. Torii, A., Havlena, M., Pajdla, T.: Omnidirectional image stabilization by computing camera trajectory. In: Wada, T., Huang, F., Lin, S. (eds.) PSIVT 2009. LNCS, vol. 5414, pp. 71–82. Springer, Heidelberg (2009)

    Chapter  Google Scholar 

  9. Weinshall, D., et al.: Beyond novelty detection: Incongruent events, when general and specific classifiers disagree. In: NIPS 2008, pp. 1745–1752 (2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Havlena, M., Heller, J., Kayser, H., Bach, JH., Anemüller, J., Pajdla, T. (2012). Incongruence Detection in Audio-Visual Processing. In: Weinshall, D., Anemüller, J., van Gool, L. (eds) Detection and Identification of Rare Audiovisual Cues. Studies in Computational Intelligence, vol 384. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24034-8_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-24034-8_5

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-24033-1

  • Online ISBN: 978-3-642-24034-8

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics