An Integrated Method for Multiple Object Detection and Localization

Das, Dipankar; Mansur, Al; Kobayashi, Yoshinori; Kuno, Yoshinori

doi:10.1007/978-3-540-89646-3_14

Dipankar Das²⁸,
Al Mansur²⁸,
Yoshinori Kobayashi²⁸ &
…
Yoshinori Kuno²⁸

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 5359))

Included in the following conference series:

International Symposium on Visual Computing

1540 Accesses
4 Citations

Abstract

The objective of this paper is to use computer vision to detect and localize multiple object within an image in the presence of a cluttered background, substantial occlusion and significant scale changes. Our approach consists of first generating a set of hypotheses for each object using a generative model (pLSA) with a bag of visual words representing each image. Then, the discriminative part verifies each hypothesis using a multi-class SVM classifier with merging features that combines both spatial shape and color appearance of an object. In the post-processing stage, environmental context information is used to improve the performance of the system. A combination of features and context information are used to investigate the performance on our local database. The best performance is obtained using object-specific weighted merging features and the context information. Our approach overcomes the limitations of some state of the art methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Seemann, E., Leibe, B., Mikolajczyk, K., Schiele, B.: An evaluation of local shape-based features for pedestrain detection. In: Proc. of British Machine Vision Conference (BMVC 2005), Oxford, UK (2005)
Google Scholar
Csurka, G., Dance, C., Fan, L., Willamowski, J., Bray, C.: Visual categorization with bags of keypoints. In: Proc. of European Conference on Computer Vision (ECCV 2004), Workshop on Statistical Learning in Computer Vision, Prague (2004)
Google Scholar
Diplaros, A., Gevers, T., Patras, I.: Combining color and shape information for illumination-viewpoint invariant object recognition. IEEE Transactions on Image Processing 15, 1–11 (2006)
Article Google Scholar
Stella, X., Ralph, G., Jianbo, S.: Concurrent object recognition and segmentation by graph partioning. In: Proc. of Neural Information Processing Systems (NIPS), Vancouver, Canada, pp. 1383–1390 (2002)
Google Scholar
Leibe, B., Leonardis, A., Schiele, B.: Combined object categorization and segmentation with an implicit shape model. In: Proc. of ECCV 2004, Workshop on Statistical Learning in Computer Vision, Prague, pp. 17–32 (2004)
Google Scholar
Guillaume, B., Bill, T.: Hierarchical part-based visual object categorization. In: Proc. of International Conference on Computer Vision and Pattern Recognition (CVPR(1)), San Diego, CA, USA, pp. 710–715 (2005)
Google Scholar
Fergus, R., Perona, P., Zisserman, A.: Weakly supervised scale-invariant learning of model for visual recognition. International Journal of Computer Vision (IJCV) 71, 273–303 (2007)
Article Google Scholar
Ferrari, V., Tinne, T., Luc, V.G.: Object detection by contour segmentation networks. In: Proc. of ECCV(3), Graz, Austria, pp. 14–28 (2006)
Google Scholar
Jacobs, D.: Robust and efficient detection of salient convex groups. IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI) 18, 23–37 (1996)
Article Google Scholar
Marcin, M., Cordelia, S.: Spatial weigthing for bag-of-features. In: Proc. of CVPR (2), New York, NY, pp. 2118–2125 (2006)
Google Scholar
Bosch, A., Zisserman, A., Muñoz, X.: Representing shape with spatial pyramid kernel. In: ACM International Conference on Image and Video Retrieval (CIVR), Amsterdam, The Netherlands, pp. 401–408 (2007)
Google Scholar
Ferrari, V., Fevrier, L., Jurie, F., Schmid, C.: Group of adjacent contour segment for object detection. PAMI 30, 30–51 (2008)
Article Google Scholar
Josef, S., Bryan, R.C., Alexei, A., Zisserman, A., William, T.: Discovering objects and their location in images. In: Proc. of the IEEE International Conference on Computer Vision (ICCV), Beijing, China, pp. 370–377 (2005)
Google Scholar
Stefan, Z., Manuela, M.: Detection and localization of multiple objects. In: Proc. of Humanoids, Genoa, Italy (2006)
Google Scholar
Erik, M.C., Jochen, T.: Shared Features for Scalable Appearance-Based Object Recognition. In: Proc. of IEEE Workshop on Application of Computer Vision (WACV), Breckenridge, Colorado, pp. 16–21 (2005)
Google Scholar
Bosch, A., Zisserman, A., Muñoz, X.: Scene classification using a hybrid generative/discriminative approach. PAMI 30, 712–727 (2008)
Article Google Scholar
Fritz, M., Leibe, B., Caputo, B., Schiele, B.: Integrating representative and discriminative models for object category detection. In: Proc. of ICCV, Beijing, China, pp. 1363–1370 (2005)
Google Scholar
Hofmann, T.: Unsupervised learning by probabilistic latent semantic analysis. Machine Learning 42, 177–196 (2001)
Article MATH Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. IJCV 60, 91–110 (2004)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Graduate School of Science and Engineering, Saitama University, 255 Shimo-Okubo, Sakura-ku, Saitama-shi, Saitama, 338-8570, Japan
Dipankar Das, Al Mansur, Yoshinori Kobayashi & Yoshinori Kuno

Authors

Dipankar Das
View author publications
You can also search for this author in PubMed Google Scholar
Al Mansur
View author publications
You can also search for this author in PubMed Google Scholar
Yoshinori Kobayashi
View author publications
You can also search for this author in PubMed Google Scholar
Yoshinori Kuno
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science and Engineering, University of Nevada, Reno, USA
George Bebis
NASA Ames Research Center, Moffett Field, CA, USA
Richard Boyle
Lawrence Berkeley National Laboratory, Berkeley, CA, USA
Bahram Parvin
Desert Research Institute, Reno, NV, USA
Darko Koracin
Digital Image Research Center, Kingston University, London, UK
Paolo Remagnino
Mitsubishi Electric Research Laboratories, P.O. Box 02139, Cambridge, MA, USA
Fatih Porikli
Computer and Information Science and Engineering, University of Florida, P.O. Box, FL 32611-6120, Gainsville, USA
Jörg Peters
IBM T.J. Watson Research Center, 19 Skyline Drive, NY 10532, Hawthorne, USA
James Klosowski
128 Memorial Mall, Stewart B001, IN 47907, West Lafayette, USA
Laura Arns
Denver Museum of Nature and Space, 2001 Colorade Blvd. Denver,, CO 80205, USA
Yu Ka Chun
Departmen of Computer Science,, NC State University, Campus Box 8206, NC 27695-8206, Raleigh, USA
Theresa-Marie Rhyne
Los Alamos National Labs, P.O. Box 1663, NM 87545, Los Alamos, USA
Laura Monroe

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Das, D., Mansur, A., Kobayashi, Y., Kuno, Y. (2008). An Integrated Method for Multiple Object Detection and Localization. In: Bebis, G., et al. Advances in Visual Computing. ISVC 2008. Lecture Notes in Computer Science, vol 5359. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-89646-3_14

Download citation

DOI: https://doi.org/10.1007/978-3-540-89646-3_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-89645-6
Online ISBN: 978-3-540-89646-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics