Abstract
Most previous methods for generic object recognition explicitly or implicitly assume that an image contains objects from a single category, although objects from multiple categories often appear together in an image. In this paper, we present a novel method for object recognition that explicitly deals with objects of multiple categories coexisting in an image. Furthermore, our proposed method aims to recognize objects by taking advantage of a scene’s context represented by the co-occurrence relationship between object categories. Specifically, our method estimates the mixture ratios of multiple categories in an image via MAP regression, where the likelihood is computed based on the linear combination model of frequency distributions of local features, and the prior probability is computed from the co-occurrence relation. We conducted a number of experiments using the PASCAL dataset, and obtained the results that lend support to the effectiveness of the proposed method.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Biederman, I., Mezzanotte, R., Rabinowitz, J.: Scene perception: detecting and judging objects undergoing relational violations. Cognitive Psychology 14(2), 143–177 (1982)
Csurka, G., Dance, C., Fan, L., Willamowski, J., Bray, C.: Visual categorization with bags of keypoints. In: Proc. ECCV 2004 Workshop on Statistical Learning in Computer Vision, pp. 1–22 (2004)
Everingham, M., Zisserman, A., Williams, C., Van Gool, L.: The 2006 PASCAL Visual Object Classes Challenge (VOC 2006) Results, http://www.pascal-network.org/challenges/VOC/voc2006/results.pdf
Fergus, R., Perona, P., Zisserman, A.: Object class recognition by unsupervised scale-invariant learning. In: Proc. IEEE CS Conf. Computer Vision and Pattern Recognition (CVPR 2003), pp. II-264–II-271 (2003)
Frome, A., Singer, Y., Sha, F., Malik, J.: Learning globally-consistent local distance functions for shape-based image retrieval and classification. In: Proc. IEEE Int’l Conf. Computer Vision (ICCV 2007), pp. 1–8 (2007)
Galleguillos, C., Rabinovich, A., Belongie, S.: Object categorization using co-occurrence, location and appearance. In: Proc. IEEE CS Conf. Computer Vision and Pattern Recognition (CVPR 2008), pp. 1–8 (2008)
Hoiem, D., Efros, A., Hebert, M.: Putting objects in perspective. In: Proc. IEEE CS Conf. Computer Vision and Pattern Recognition (CVPR 2006), pp. 2137–2144 (2006)
Lowe, D.: Distinctive image features from scale-invariant keypoints. Int’l Journal of Compute Vision 60(2), 91–110 (2004)
Qi, G.-J., Hua, X.-S., Rui, Y., Mei, T., Tang, J., Zhang, H.-J.: Concurrent multiple instance learning for image categorization. In: Proc. IEEE CS Conf. Computer Vision and Pattern Recognition (CVPR 2007), pp. 1–8 (2007)
Rabinovich, A., Vedaldi, A., Galleguillos, C., Wiewiora, E., Belongie, S.: Objects in context. In: Proc. IEEE Int’l Conf. Computer Vision (ICCV 2007), pp. 1–8 (2007)
Savarese, S., Fei-Fei, L.: 3D generic object categorization, localization and pose estimation. In: Proc. IEEE Int’l Conf. Computer Vision (ICCV 2007), pp. 1–8 (2007)
Sivic, J., Russell, B., Efros, A., Zisserman, A., Freeman, W.: Discovering objects and their location in images. In: Proc. IEEE Int’l Conf. Computer Vision (ICCV 2005), pp. 370–377 (2005)
Yang, L., Jin, R., Sukthankar, R., Jurie, F.: Unifying discriminative visual codebook generation with classifier training for object category recognition. In: Proc. IEEE CS Conf. Computer Vision and Pattern Recognition (CVPR 2008), pp. 1–8 (2008)
Zhang, J., Marszalek, M., Lazebnik, S., Schmid, C.: Local features and kernels for classification of texture and object categories: a comprehensive study. Int’l Journal of Computer Vision 73(2), 213–238 (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Okabe, T., Kondo, Y., Kitani, K.M., Sato, Y. (2009). Recognizing Multiple Objects via Regression Incorporating the Co-occurrence of Categories. In: Wada, T., Huang, F., Lin, S. (eds) Advances in Image and Video Technology. PSIVT 2009. Lecture Notes in Computer Science, vol 5414. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-92957-4_43
Download citation
DOI: https://doi.org/10.1007/978-3-540-92957-4_43
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-92956-7
Online ISBN: 978-3-540-92957-4
eBook Packages: Computer ScienceComputer Science (R0)