Abstract
In this work we present a system for detection of objects from video streams based on properties of human vision such as saccadic eye movements and selective attention. An object, in this application a car, is represented as a collection of features (horizontal and vertical edges) arranged at specific spatial locations with respect to the position of the fixation point (the central edge). The collection of conditional probabilities, that estimate the locations of the car edges given the location of the central edge, are mapped into the weights of the neural network that combines information coming from the edge detectors (bottom-up) with expectations for edge locations (top-down). During the recognition process, the system efficiently searches the space of possible segmentations by investigating the local regions of the image in a way similar to human eye movements, probing and analyzing different locations of the input at different times. In contrast to motion-based models for vehicle detection [7, 8], our approach does not rely on motion information, and the system can detect both still and moving cars in real-time. However, adding motion information should improve the accuracy.
The author performed the work while at Brown University.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
N. Logothetis and N. Sheinberg. Visual object recognition, Annual Review of Neuroscience, 19: 577–621, 1996.
E. Wachsmut, M. Oram and D. Perrett. Recognition of objects and their component parts: responses of single units in the temporal cortex of the macaque, Cerebral Cortex, 4: 509–522 1994.
I. Biederman. Recognition by components: a theory of human image understanding, Psychological Review, 94: 115–147, 1987.
D. H. Hubel and T. N. Wiesel. Receptive fields and functional architecture of monkey striate cortex, J. Physiol., 195: 215–244, 1968.
P. Neskovic and L. Cooper. Neural network-based context driven recognition of on-line cursive script. In 7th International Workshop on Frontiers in Handwriting Recognition, pp. 352–362, 2000.
P. Neskovic, P. Davis, and L. Cooper Interactive parts model: an application to recognition of on-line cursive script. In Advances in Neural Information Processing Systems, 2000.
D. Koller, J. Weber, J. Malik, Robust multiple car tracking with occlusion reasoning, Proceedings of the 5th European Conference on Computer Vision, Springer-Verlag, Berlin, pp. 189–196, 1994.
A. Lipton, H. Fujiyoshi, F. Patil, Moving target classification and tracking from real-time video, IEEE Workshop on Applications of Computer Vision (WACV), Princeton NJ, pp. 8–14, 1998.
D. Koller, K. Danilidis, H. Nagel, Model-Based Object Tracking in Monocular Image Sequences of Road Traffic Scenes, International Journal of Computer Vision, 10–3, pp. 257–281, 1993.
M. Betke, E. Haritaoglu and L. Davis, Highway Scene Analysis. IEEE Conference on Intelligent Transportation Systems, Boston, 1997.
M. Betke, E. Haritaoglu and L. Davis, Real-time multiple vehicle detection and tracking from a moving vehicle, Machine Vision and Applications, 12: 69–83, 2000.
C. Papageorgiou and T. Poggio, A Trainable Object Detection System: Car Detection in Static Images, A.I. Memo No. 1673, MIT, 1999.
Z. Sun, G. Bebis and R. Miller, Quantized wavelet features and support vector machines for on-road vehicle detection, IEEE International Conference on Control, Automation, Robotics and Vision, 2002.
C. Goerick, N. Detlev and M. Werner. Artificial neural networks in real-time car detection and tracking applications, Pattern Recognition Letters, 17: 335–343, 1996.
H. Schneiderman and T. Kanade. A statistical method for 3D object detection applied to faces and cars, IEEE Conference on Computer Vision and Pattern Recognition, 2000.
P. Viola and M. Jones. Rapid object detection using a boosted cascade of simple features, IEEE Conference on Computer Vision and Pattern Recognition, 2001.
M. Weber, M. Welling and P. Perona. Towards Automatic Discovery of Object Categories, Computer Vision and Pattern Recognition, 2000.
S. Agarwal and D. Roth. Learning a sparse representation for object detection, 7th European Conference on Computer Vision, 4: 113–130, 2002.
S-W. Lee, H. H. Bulthoff and T. Poggio (Eds.), Biologically motivated computer vision, Berlin: Springer-Verlag, 2000.
J. Keller, S. Rogers, M Kabrisky and M. Oxley, Object Recognition Based on Human Saccadic Behavior, Pattern Analysis and Applications, Vol. 2, Springer-Verlag, London, pp. 251–263, 1999.
L. Wiskott, J. M. Fellous, N. Kruger, and C. von der Malsburg. Face recognition by elastic bunch graph matching. IEEE Transactions on Pattern Analysis and Machine Intelligence, 19 (7): 775–779, 1997.
Rafael Gonzales and Richard Woods, Digital Image Processing,Addison-Wesley Publishing Company, 1993.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Neskovic, P., Schuster, D., Cooper, L.N. (2004). Biologically inspired recognition system for car detection from real-time video streams. In: Rajapakse, J.C., Wang, L. (eds) Neural Information Processing: Research and Development. Studies in Fuzziness and Soft Computing, vol 152. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39935-3_17
Download citation
DOI: https://doi.org/10.1007/978-3-540-39935-3_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-53564-2
Online ISBN: 978-3-540-39935-3
eBook Packages: Springer Book Archive