Insights of object proposal evaluation

Wang, Yuantian; Huang, Lei; Ren, Tongwei; Zhong, Sheng-Hua; Gu, Han; Liu, Yan

doi:10.1007/s11042-017-5471-6

Insights of object proposal evaluation

Published: 04 December 2017

Volume 78, pages 13111–13130, (2019)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Yuantian Wang¹,
Lei Huang¹,
Tongwei Ren ORCID: orcid.org/0000-0003-3092-424X¹,
Sheng-Hua Zhong²,
Han Gu¹ &
…
Yan Liu³

235 Accesses
1 Citation
Explore all metrics

Abstract

Object proposal aims to locate category-independent objects in a given image with a limited number of object candidates indicated by bounding boxes, which can be served as a fundamental of various multimedia applications. Current evaluation criteria based on recall cannot reveal the real abilities of different object proposal methods in objectness measurement. In this paper, we propose a novel object proposal evaluation criterion instead of recall, named objectness measurement ability (OMA). We first analyze the probability to hit an object by non-repetitive random sampling (HPRS), and provide an algorithm for calculating HPRS efficiently. Based on HPRS, we define OMA and extend three commonly used object proposal evaluation criteria by replacing recall with OMA. We evaluated six typical object proposal methods using recall based criteria and OMA based criteria on the test data of PASCAL VOC 2007 and PASCAL VOC 2012. The experimental results show that OMA based criteria can provide more stable evaluation results than recall based ones in revealing objectness measurement ability.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

MFRPN: Towards High-Quality Region Proposal Generation in Object Detection

Object Detection Based on Improved Exemplar SVMs Using a Generic Object Measure

Focal Loss for Region Proposal Network

References

Alexe B, Deselaers T, Ferrari V (2012) Measuring the objectness of image windows. TPAMI 34(11): 2189–2202
Article Google Scholar
Arbelaez P, Pont-Tuset J, Barron J, Marques F, Malik J (2014) Multiscale combinatorial grouping. In: CVPR, pp 328–335
Bai J, Chen Z, Feng B, Xu B (2014) Chinese image text recognition on grayscale pixels. In: ICASSP, pp 1380–1384
Bao BK, Zhu G, Shen J, Yan S (2013) Robust image analysis with sparse representation on quantized visual features. TIP 22(3):860–871
MathSciNet MATH Google Scholar
Carreira J, Sminchisescu C (2012) Cpmc: automatic object segmentation using constrained parametric min-cuts. TPAMI 34(7):1312–1328
Article Google Scholar
Chavali N, Agrawal H, Mahendru A, Batra D (2015) Object-proposal evaluation protocol is ‘gameable’. Comp Sci
Chen X, Ma H, Wang X, Zhao Z (2015) Improving object proposals with multi-thresholding straddling expansion. In: CVPR, pp 2587–2595
Chen Z, Sun L, Yang S (2009) Auto-cut for web images. In: MM, pp 529–532
Chen Z, Cao J, Song Y, Zhang Y, Li J (2010) Web video categorization based on wikipedia categories and content-duplicated open resources. In: MM, pp 1107–1110
Cheng MM, Zhang Z, Lin WY, Torr P (2014) Bing: binarized normed gradients for objectness estimation at 300fps. In: CVPR, pp 3286–3293
Everingham M, Gool LV, Williams CKI, Winn J, Zisserman A (2010) The pascal visual object classes (voc) challenge. IJCV 88(2):303–338
Article Google Scholar
Everingham M, Van Gool L, Williams CKI, Winn J, Zisserman A The PASCAL visual object classes challenge 2012 (VOC2012) results. http://www.pascal-network.org/challenges/VOC/voc2012/workshop/index.html
Gao Z, Zhang H, Xu G, Xue Y (2015) Multi-perspective and multi-modality joint representation and recognition model for 3d action recognition. NEUCOM 151:554–564
Google Scholar
Gao Z, Zhang Y, Zhang H, Xue YB, Xu GP (2016) Multi-dimensional human action recognition model based on image set and group sparisty. NEUCOM 215:138–149
Google Scholar
Guo J, Ren T, Huang L, Bei J (2017) Saliency detection on sampled images for tag ranking. MMSJ. https://doi.org/10.1007/s00530-017-0546-9
Hosang J, Benenson R, Dollar P, Schiele B (2015) What makes for effective detection proposals? TPAMI 38(4):6644–6665
Google Scholar
Jiang F, Hu HM, Zheng J, Li B (2016) A hierarchal bow for image retrieval by enhancing feature salience. NEUCOM 175(PA):146–154
Google Scholar
Krähenbühl P, Koltun V (2014) Geodesic object proposals. In: ECCV, pp 725–739
Liu AA, Su YT, Nie WZ, Kankanhalli M (2017) Hierarchical clustering multi-task learning for joint human action grouping and recognition. TPAMI 39 (1):102–114
Article Google Scholar
Liu Y, Liu J, Li Z, Tang J, Lu H (2013) Weakly-supervised dual clustering for image semantic segmentation. In: CVPR, pp 2075–2082
Liu J, Li Z, Tang J, Jiang Y, Lu H (2014) Personalized geo-specific tag recommendation for photos on social websites. TMM 16(3):588–600
Google Scholar
Liu J, Ren T, Bao BK, Bei J (2016) Depth-aware layered edge for object proposal. In: ICME. IEEE, pp 1–6
Liu J, Ren T, Wang Y, Zhong SH, Bei J, Chen S (2017) Object proposal on rgb-d images via elastic edge boxes. NEUCOM 236:134–146
Google Scholar
Manen S, Guillaumin M, Gool LV (2013) Prime object proposals with randomized prim’s algorithm. In: ICCV, pp 2536–2543
Rahman ASMM, Saddik AE (2011) Mobile based multimodal retrieval and navigation of learning objects using a 3d car metaphor. In: ICIMCS, pp 103–107
Ren T, Qiu Z, Liu Y, Yu T, Bei J (2015) Soft-assigned bag of features for object tracking. MMSJ 21(2):189–205
Google Scholar
Ren T, Liu Y, Ju R, Wu G (2016) How important is location information in saliency detection of natural images. MTAP 75(5):2543–2564
Google Scholar
Sang J, Mei T, Xu YQ, Zhao C, Xu C, Li S (2013) Interaction design for mobile visual search. TMM 15(7):1665–1676
Google Scholar
Sang J, Xu C (2012) Robust face-name graph matching for movie character identification. TMM 14(3):586–596
Google Scholar
Sang J, Xu C, Liu J (2012) User-aware image tag refinement via ternary semantic analysis. TMM 14(3):883–895
Google Scholar
Sang J, Xu C, Lu D (2012) Learn to personalized image search from the photo sharing websites. TMM 14(4):963–974
Google Scholar
Song X, Zhang J, Han Y, Jiang J (2016) Semi-supervised feature selection via hierarchical regression for web image classification. MMSJ 22(1):41–49
Google Scholar
Tang J, Li H, Qi GJ, Chua TS (2010) Image annotation by graph-based inference with integrated multiple/single instance representations. TMM 12(2):131–141
Google Scholar
Uijlings JRR, Sande KEAVD, Gevers T, Smeulders AWM (2013) Selective search for object recognition. IJCV 104(2):154–171
Article Google Scholar
Wang P, Sun L, Yang S, Smeaton AF (2016) Towards training-free refinement for semantic indexing of visual media. In: MMM
Wang S, Huang Q, Jiang S, Tian Q (2010) Nearest-neighbor classification using unlabeled data for real world image application. In: MM, pp 1151–1154
Zhang K, Liu Q, Song H, Li X (2014) A variational approach to simultaneous image segmentation and bias correction. T Cybernetics 45(8):1426–1437
Article Google Scholar
Zhu S, Aloufi S, El-Saddik A (2015) Utilizing image social clues for automated image tagging. In: ICME, pp 1–6
Zhu Y, Huang X, Huang Q, Tian Q (2016) Large-scale video copy retrieval with temporal-concentration sift. NEUCOM 187(C):83–91
Google Scholar
Zitnick CL, Dollár P (2014) Edge boxes: Locating object proposals from edges. Springer International Publishing

Download references

Acknowledgements

This work is supported by National Science Foundation of China (61321491, 61202320), Undergraduate Innovation Project of Nanjing University (X201610284039), and Collaborative Innovation Center of Novel Software Technology and Industrialization.

Author information

Authors and Affiliations

State Key Laboratory for Novel Software Technology, Nanjing University, Nanjing, China
Yuantian Wang, Lei Huang, Tongwei Ren & Han Gu
College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, China
Sheng-Hua Zhong
Computing Department, The Hong Kong Polytechnic University, Hong Kong, China
Yan Liu

Authors

Yuantian Wang
View author publications
You can also search for this author in PubMed Google Scholar
Lei Huang
View author publications
You can also search for this author in PubMed Google Scholar
Tongwei Ren
View author publications
You can also search for this author in PubMed Google Scholar
Sheng-Hua Zhong
View author publications
You can also search for this author in PubMed Google Scholar
Han Gu
View author publications
You can also search for this author in PubMed Google Scholar
Yan Liu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Tongwei Ren.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wang, Y., Huang, L., Ren, T. et al. Insights of object proposal evaluation. Multimed Tools Appl 78, 13111–13130 (2019). https://doi.org/10.1007/s11042-017-5471-6

Download citation

Received: 01 August 2017
Revised: 11 October 2017
Accepted: 27 November 2017
Published: 04 December 2017
Issue Date: 30 May 2019
DOI: https://doi.org/10.1007/s11042-017-5471-6

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Insights of object proposal evaluation

Abstract

Access this article

Similar content being viewed by others

MFRPN: Towards High-Quality Region Proposal Generation in Object Detection

Object Detection Based on Improved Exemplar SVMs Using a Generic Object Measure

Focal Loss for Region Proposal Network

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Abstract

Access this article

Similar content being viewed by others

MFRPN: Towards High-Quality Region Proposal Generation in Object Detection

Object Detection Based on Improved Exemplar SVMs Using a Generic Object Measure

Focal Loss for Region Proposal Network

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation