Contextual Object Detection with a Few Relevant Neighbors

Barnea, Ehud; Ben-Shahar, Ohad

doi:10.1007/978-3-030-20890-5_31

Contextual Object Detection with a Few Relevant Neighbors

Ehud Barnea¹⁸ &
Ohad Ben-Shahar¹⁸

Conference paper
First Online: 02 June 2019

2059 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11362))

Abstract

A natural way to improve the detection of objects is to consider the contextual constraints imposed by the detection of additional objects in a given scene. In this work, we exploit the spatial relations between objects in order to improve detection capacity, as well as analyze various properties of the contextual object detection problem. To precisely calculate context-based probabilities of objects, we developed a model that examines the interactions between objects in an exact probabilistic setting, in contrast to previous methods that typically utilize approximations based on pairwise interactions. Such a scheme is facilitated by the realistic assumption that the existence of an object in any given location is influenced by only few informative locations in space. Based on this assumption, we suggest a method for identifying these relevant locations and integrating them into a mostly exact calculation of probability based on their raw detector responses. This scheme is shown to improve detection results and provides unique insights about the process of contextual inference for object detection. We show that it is generally difficult to learn that a particular object reduces the probability of another, and that in cases when the context and detector strongly disagree this learning becomes virtually impossible for the purposes of improving the results of an object detector. Finally, we demonstrate improved detection results through use of our approach as applied to the PASCAL VOC and COCO datasets.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Arbel, N., Avraham, T., Lindenbaum, M.: Inner-scene similarities as a contextual cue for object detection (2017). arXiv preprint
Google Scholar
Bell, S., Lawrence Zitnick, C., Bala, K., Girshick, R.: Inside-outside net: detecting objects in context with skip pooling and recurrent neural networks. In: CVPR, pp. 2874–2883 (2016)
Google Scholar
Chechetka, A., Guestrin, C.: Evidence-specific structures for rich tractable CRFs. In: Advances in Neural Information Processing Systems, pp. 352–360 (2010)
Google Scholar
Cinbis, R.G., Sclaroff, S.: Contextual object detection using set-based classification. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part VI. LNCS, vol. 7577, pp. 43–57. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-33783-3_4
Chapter Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: CVPR, pp. 886–893 (2005)
Google Scholar
Desai, C., Ramanan, D., Fowlkes, C.C.: Discriminative models for multi-class object layout. Int. J. Comput. Vis. 95(1), 1–12 (2011)
Article MathSciNet Google Scholar
Felzenszwalb, P., McAllester, D., Ramanan, D.: A discriminatively trained, multiscale, deformable part model. In: CVPR, pp. 1–8 (2008)
Google Scholar
Felzenszwalb, P.F., Girshick, R.B., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part-based models. IEEE Trans. Pattern Anal. Mach. Intell. 32(9), 1627–1645 (2010)
Article Google Scholar
Galleguillos, C., Rabinovich, A., Belongie, S.: Object categorization using co-occurrence, location and appearance. In: CVPR, pp. 1–8 (2008)
Google Scholar
Girshick, R.: Fast R-CNN. In: ICCV, pp. 1440–1448 (2015)
Google Scholar
Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: CVPR (2014)
Google Scholar
Heitz, G., Koller, D.: Learning spatial context: using stuff to find things. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part I. LNCS, vol. 5302, pp. 30–43. Springer, Heidelberg (2008). https://doi.org/10.1007/978-3-540-88682-2_4
Chapter Google Scholar
Hoiem, D., Efros, A.A., Hebert, M.: Putting objects in perspective. Int. J. Comput. Vis. 80(1), 3–15 (2008)
Article Google Scholar
Kohli, P., Rother, C.: Higher-order models in computer vision. In: Image Processing and Analysing with Graphs: Theory and Practice, Chap. 3, pp. 65–92. CRC Press (2012)
Google Scholar
Koller, D., Friedman, N.: Probabilistic Graphical Models: Principles and Techniques. MIT Press, Cambridge (2009)
MATH Google Scholar
Komodakis, N., Tziritas, G.: Image completion using efficient belief propagation via priority scheduling and dynamic pruning. IEEE Trans. Pattern Anal. Mach. Intell. 16(11), 2649–2661 (2007)
MathSciNet Google Scholar
Li, J., Wei, Y., Liang, X., Dong, J., Xu, T., Feng, J., Yan, S.: Attentive contexts for object detection. IEEE Trans. Multimed. 19(5), 944–954 (2017)
Article Google Scholar
Lin, T.-Y., et al.: Microsoft COCO: common objects in context. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014, Part V. LNCS, vol. 8693, pp. 740–755. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10602-1_48
Chapter Google Scholar
Luo, W., Li, Y., Urtasun, R., Zemel, R.: Understanding the effective receptive field in deep convolutional neural networks. In: NIPS, pp. 4898–4906 (2016)
Google Scholar
Mairon, R., Ben-Shahar, O.: A closer look at context: from coxels to the contextual emergence of object saliency. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014, Part V. LNCS, vol. 8693, pp. 708–724. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10602-1_46
Chapter Google Scholar
Mottaghi, R., et al.: The role of context for object detection and semantic segmentation in the wild. In: CVPR, pp. 891–898 (2014)
Google Scholar
Oramas, J., Tuytelaars, T.: Recovering hard-to-find object instances by sampling context-based object proposals. CVIU 152, 118–130 (2016)
Google Scholar
José Oramas, M., De Raedt, L., Tuytelaars, T.: Allocentric pose estimation. In: ICCV (2013)
Google Scholar
José Oramas, M., De Raedt, L., Tuytelaars, T.: Towards cautious collective inference for object verification. In: WACV (2014)
Google Scholar
Pearl, J.: Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Morgan Kaufmann, San Francisco (1988)
MATH Google Scholar
Perko, R., Leonardis, A.: A framework for visual-context-aware object detection in still images. CVIU 114(6), 700–711 (2010)
Google Scholar
Rabinovich, A., Vedaldi, A., Galleguillos, C., Wiewiora, E., Belongie, S.: Objects in context. In: ICCV, pp. 1–8. IEEE (2007)
Google Scholar
Ren, S., He, K., Girshick, R., Sun, J.: Fast R-CNN: towards real-time object detection with region proposal networks. In: NIPS, pp. 91–99 (2015)
Google Scholar
Shalev-Shwartz, S., Ben-David, S.: Understanding Machine Learning: From Theory to Algorithms. Cambridge University Press, Cambridge (2014)
Book Google Scholar
Torralba, A., Murphy, K.P., Freeman, W.T.: Contextual models for object detection using boosted random fields. In: NIPS, pp. 1401–1408 (2004)
Google Scholar
Torralba, A., Sinha, P.: Statistical context priming for object detection. In: ICCV, vol. 1, pp. 763–770. IEEE (2001)
Google Scholar
Wolf, L., Bileschi, S.: A critical view of context. Int. J. Comput. Vis. 69(2), 251–261 (2006)
Article Google Scholar
Yu, R., Chen, X., Morariu, V.I., Davis, L.S.: The role of context selection in object detection. British Machine Vision Conference abs/1609.02948 (2016)
Google Scholar

Download references

Acknowledgments

This research was supported in part by Israel Ministry of Science, Technology and Space (MOST Grant 54178). We also thank the Frankel Fund and the Helmsley Charitable Trust through the ABC Robotics Initiative, both at Ben-Gurion University of the Negev, for their generous support.

Author information

Authors and Affiliations

Department of Computer Science, Ben-Gurion University, Beer-Sheva, Israel
Ehud Barnea & Ohad Ben-Shahar

Authors

Ehud Barnea
View author publications
You can also search for this author in PubMed Google Scholar
Ohad Ben-Shahar
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ehud Barnea .

Editor information

Editors and Affiliations

IIIT Hyderabad, Hyderabad, India
C. V. Jawahar
ANU, Canberra, ACT, Australia
Hongdong Li
Simon Fraser University, Burnaby, BC, Canada
Greg Mori
ETH Zurich, Zurich, Zürich, Switzerland
Konrad Schindler

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Barnea, E., Ben-Shahar, O. (2019). Contextual Object Detection with a Few Relevant Neighbors. In: Jawahar, C., Li, H., Mori, G., Schindler, K. (eds) Computer Vision – ACCV 2018. ACCV 2018. Lecture Notes in Computer Science(), vol 11362. Springer, Cham. https://doi.org/10.1007/978-3-030-20890-5_31

Download citation

DOI: https://doi.org/10.1007/978-3-030-20890-5_31
Published: 02 June 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-20889-9
Online ISBN: 978-3-030-20890-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics