Abstract
We present a patch-based algorithm for the purpose of object classification in video surveillance. Within detected regions-of-interest (ROIs) of moving objects in the scene, a feature vector is calculated based on template matching of a large set of image patches. Instead of matching direct image pixels, we use Gabor-filtered versions of the input image at several scales. This approach has been adopted from recent experiments in generic object-recognition tasks. We present results for a new typical video surveillance dataset containing over 9,000 object images. Furthermore, we compare our system performance with another existing smaller surveillance dataset. We have found that with 50 training samples or higher, our detection rate is on the average above 95%. Because of the inherent scalability of the algorithm, an embedded system implementation is well within reach.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Muller-Schneiders, S., Jager, T., Loos, H., Niem, W.: Performance evaluation of a real time video surveillance system. In: Proc. of 2nd Joint IEEE Int. Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance (VS-PETS), pp. 137–144. IEEE Computer Society Press, Los Alamitos (2005)
Kollnig, H., Nagel, H.: 3d pose estimation by directly matching polyhedral models to gray value gradients. Int. Journal of Computer Vision (IJCV) 23(3), 283–302 (1997)
Lou, J., Tan, T., Hu, W., Yang, H., Maybank, S.: 3-d model-based vehicle tracking. IEEE Transactions on Image Processing 14(10), 1561–1569 (2005)
Serre, T., Wolf, L., Bileschi, S., Riesenhuber, M., Poggio, T.: Robust object recognition with cortex-like mechanisms. Transactions on Pattern Analysis and Machine Intelligence (PAMI) 29(3), 411–426 (2007)
Bose, B., Grimson, W.E.L.: Improving object classification in far-field video. In: CVPR. Proc. of IEEE Computer Vision and Pattern Recognition, Washington, DC, USA, vol. 2, pp. 181–188. IEEE Computer Society Press, Los Alamitos (2004)
Haritaoglu, I., Harwood, D., Davis, L.: W4: real-time surveillance of people and their activities. IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI) 22, 809–830 (2000)
Wijnhoven, R., de With, P.: 3d wire-frame object-modeling experiments for video surveillance. In: Proc. of 27th Symposium on Information Theory in the Benelux, pp. 101–108 (2006)
Viola, P., Jones, M.: Rapid object detection using a boosted cascade of simple features. In: CVPR. Proc. of the 2001 IEEE Computer Society Conf. on Computer Vision and Pattern Recognition, vol. 1, pp. 511–518. IEEE, Los Alamitos (2001)
Mohan, A., Papageorgiou, C., Poggio, T.: Example-based object detection in images by components. IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI) 23(4), 349–361 (2001)
Oren, M., Papageorgiou, C., Sinha, P., Osuna, E., Poggio, T.: Pedestrian detection using wavelet templates. In: CVPR. Proc. IEEE Conf. on Computer Vision and Pattern Recognition, San Juan, Puerto Rico, pp. 193–199. IEEE Computer Society Press, Los Alamitos (1997)
Mikolajczyk, K., Schmid, C.: A performance evaluation of local descriptors. IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI) 27(10), 1615–1630 (2005)
Dalai, N., Triggs, B.: Histogram of oriented gradients for human detection. In: CVPR. Proc. of the IEEE Computer Society Conf. on Computer Vision and Pattern Recognition, pp. 886–893. IEEE Computer Society Press, Los Alamitos (2005)
Ke, Y., Sukthankar, R.: Pca-sift: A more distinctive representation for local image descriptors. In: CVPR. Proc. of IEEE Computer Vision and Pattern Recognition, vol. 2, pp. 506–513. IEEE, Los Alamitos (2004)
Mikolajczyk, K., Schmid, C., Zisserman, A.: Human detection based on a probabilistic assembly of robust part detectors. In: Pajdla, T., Matas, J(G.) (eds.) ECCV 2004. LNCS, vol. 3021, pp. 69–81. Springer, Heidelberg (2004)
Ma, X., Grimson, W.: Edge-based rich representation for vehicle classification. In: ICCV. Proc. of IEEE Int. Conf. on Computer Vision, vol. 2, pp. 1185–1192. IEEE Computer Society Press, Los Alamitos (2005)
Serre, T.: Learning a Dictionary of Shape-Components in Visual Cortex: Comparison with Neurons, Humans and Machines. PhD thesis, Massachusetts Institute of Technology Computer Science and Artificial Intelligence Laboratory (April 2006)
Ullman, S., Vidal-Naquet, M., Sali, E.: Visual features of intermediate complexity and their use in classification. Nature Neuroscience 5, 682–687 (2002)
Riesenhuber, M., Poggio, T.: Models of object recognition. Nature Neuroscience 3, 1199–1204 (2000)
Serre, T., Wolf, L., Poggio, T.: Object recognition with features inspired by visual cortex. In: Proc. of Computer Vision and Pattern Recognition (CVPR), pp. 994–1000 (2005)
Mutch, J., Lowe, D.: Multiclass object recognition with sparse, localized features. In: CVPR. IEEE Computer Society Conf. on Computer Vision and Pattern Recognition, vol. 1, pp. 11–18. IEEE Computer Society Press, Los Alamitos (2006)
Collobert, R., Bengio, S., Mariethoz, J.: Torch: a modular machine learning software library. Technical report, Dalle Molle Institute for Perceptual Artificial Intelligence, PO Box 592, Martigny, Valais, Switzerland (October 2002)
Ponce, J., Berg, T., Everingham, M., Forsyth, D., Hebert, M., Lazebnik, S., Marszalek, M., Schmid, C., Russell, B., Torralba, A., Williams, C., Zhang, J., Zisserman, A.: Dataset issues in object recognition. In: Ponce, J., Hebert, M., Schmid, C., Zisserman, A. (eds.) Toward Category-Level Object Recognition. LNCS, vol. 4170, Springer, Heidelberg (2006)
Viola, P., Jones, M., Snow, D.: Detecting pedestrians using patterns of motion and appearance. In: ICCV. Proc. of the Ninth IEEE Int. Conf. on Computer Vision, vol. 2, pp. 734–741. IEEE Computer Society Press, Los Alamitos (2003)
Wu, B., Nevatia, R.: Detection of multiple, partially occluded humans in a single image by bayesian combination of edgelet part detectors. In: ICCV. Proc. of the 10th IEEE Int. Conf. on Computer Vision, vol. 1, pp. 90–97. IEEE Computer Society, Washington, DC, USA (2005)
Zuo, F.: Embedded face recognition using cascaded structures. PhD thesis, Technische Universiteit Eindhoven, The Netherlands (October 2006)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wijnhoven, R., de With, P.H.N. (2007). Patch-Based Experiments with Object Classification in Video Surveillance. In: Blanc-Talon, J., Philips, W., Popescu, D., Scheunders, P. (eds) Advanced Concepts for Intelligent Vision Systems. ACIVS 2007. Lecture Notes in Computer Science, vol 4678. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74607-2_26
Download citation
DOI: https://doi.org/10.1007/978-3-540-74607-2_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74606-5
Online ISBN: 978-3-540-74607-2
eBook Packages: Computer ScienceComputer Science (R0)