Skip to main content
Log in

Insights of object proposal evaluation

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Object proposal aims to locate category-independent objects in a given image with a limited number of object candidates indicated by bounding boxes, which can be served as a fundamental of various multimedia applications. Current evaluation criteria based on recall cannot reveal the real abilities of different object proposal methods in objectness measurement. In this paper, we propose a novel object proposal evaluation criterion instead of recall, named objectness measurement ability (OMA). We first analyze the probability to hit an object by non-repetitive random sampling (HPRS), and provide an algorithm for calculating HPRS efficiently. Based on HPRS, we define OMA and extend three commonly used object proposal evaluation criteria by replacing recall with OMA. We evaluated six typical object proposal methods using recall based criteria and OMA based criteria on the test data of PASCAL VOC 2007 and PASCAL VOC 2012. The experimental results show that OMA based criteria can provide more stable evaluation results than recall based ones in revealing objectness measurement ability.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

References

  1. Alexe B, Deselaers T, Ferrari V (2012) Measuring the objectness of image windows. TPAMI 34(11): 2189–2202

    Article  Google Scholar 

  2. Arbelaez P, Pont-Tuset J, Barron J, Marques F, Malik J (2014) Multiscale combinatorial grouping. In: CVPR, pp 328–335

  3. Bai J, Chen Z, Feng B, Xu B (2014) Chinese image text recognition on grayscale pixels. In: ICASSP, pp 1380–1384

  4. Bao BK, Zhu G, Shen J, Yan S (2013) Robust image analysis with sparse representation on quantized visual features. TIP 22(3):860–871

    MathSciNet  MATH  Google Scholar 

  5. Carreira J, Sminchisescu C (2012) Cpmc: automatic object segmentation using constrained parametric min-cuts. TPAMI 34(7):1312–1328

    Article  Google Scholar 

  6. Chavali N, Agrawal H, Mahendru A, Batra D (2015) Object-proposal evaluation protocol is ‘gameable’. Comp Sci

  7. Chen X, Ma H, Wang X, Zhao Z (2015) Improving object proposals with multi-thresholding straddling expansion. In: CVPR, pp 2587–2595

  8. Chen Z, Sun L, Yang S (2009) Auto-cut for web images. In: MM, pp 529–532

  9. Chen Z, Cao J, Song Y, Zhang Y, Li J (2010) Web video categorization based on wikipedia categories and content-duplicated open resources. In: MM, pp 1107–1110

  10. Cheng MM, Zhang Z, Lin WY, Torr P (2014) Bing: binarized normed gradients for objectness estimation at 300fps. In: CVPR, pp 3286–3293

  11. Everingham M, Gool LV, Williams CKI, Winn J, Zisserman A (2010) The pascal visual object classes (voc) challenge. IJCV 88(2):303–338

    Article  Google Scholar 

  12. Everingham M, Van Gool L, Williams CKI, Winn J, Zisserman A The PASCAL visual object classes challenge 2012 (VOC2012) results. http://www.pascal-network.org/challenges/VOC/voc2012/workshop/index.html

  13. Gao Z, Zhang H, Xu G, Xue Y (2015) Multi-perspective and multi-modality joint representation and recognition model for 3d action recognition. NEUCOM 151:554–564

    Google Scholar 

  14. Gao Z, Zhang Y, Zhang H, Xue YB, Xu GP (2016) Multi-dimensional human action recognition model based on image set and group sparisty. NEUCOM 215:138–149

    Google Scholar 

  15. Guo J, Ren T, Huang L, Bei J (2017) Saliency detection on sampled images for tag ranking. MMSJ. https://doi.org/10.1007/s00530-017-0546-9

  16. Hosang J, Benenson R, Dollar P, Schiele B (2015) What makes for effective detection proposals? TPAMI 38(4):6644–6665

    Google Scholar 

  17. Jiang F, Hu HM, Zheng J, Li B (2016) A hierarchal bow for image retrieval by enhancing feature salience. NEUCOM 175(PA):146–154

    Google Scholar 

  18. Krähenbühl P, Koltun V (2014) Geodesic object proposals. In: ECCV, pp 725–739

  19. Liu AA, Su YT, Nie WZ, Kankanhalli M (2017) Hierarchical clustering multi-task learning for joint human action grouping and recognition. TPAMI 39 (1):102–114

    Article  Google Scholar 

  20. Liu Y, Liu J, Li Z, Tang J, Lu H (2013) Weakly-supervised dual clustering for image semantic segmentation. In: CVPR, pp 2075–2082

  21. Liu J, Li Z, Tang J, Jiang Y, Lu H (2014) Personalized geo-specific tag recommendation for photos on social websites. TMM 16(3):588–600

    Google Scholar 

  22. Liu J, Ren T, Bao BK, Bei J (2016) Depth-aware layered edge for object proposal. In: ICME. IEEE, pp 1–6

  23. Liu J, Ren T, Wang Y, Zhong SH, Bei J, Chen S (2017) Object proposal on rgb-d images via elastic edge boxes. NEUCOM 236:134–146

    Google Scholar 

  24. Manen S, Guillaumin M, Gool LV (2013) Prime object proposals with randomized prim’s algorithm. In: ICCV, pp 2536–2543

  25. Rahman ASMM, Saddik AE (2011) Mobile based multimodal retrieval and navigation of learning objects using a 3d car metaphor. In: ICIMCS, pp 103–107

  26. Ren T, Qiu Z, Liu Y, Yu T, Bei J (2015) Soft-assigned bag of features for object tracking. MMSJ 21(2):189–205

    Google Scholar 

  27. Ren T, Liu Y, Ju R, Wu G (2016) How important is location information in saliency detection of natural images. MTAP 75(5):2543–2564

    Google Scholar 

  28. Sang J, Mei T, Xu YQ, Zhao C, Xu C, Li S (2013) Interaction design for mobile visual search. TMM 15(7):1665–1676

    Google Scholar 

  29. Sang J, Xu C (2012) Robust face-name graph matching for movie character identification. TMM 14(3):586–596

    Google Scholar 

  30. Sang J, Xu C, Liu J (2012) User-aware image tag refinement via ternary semantic analysis. TMM 14(3):883–895

    Google Scholar 

  31. Sang J, Xu C, Lu D (2012) Learn to personalized image search from the photo sharing websites. TMM 14(4):963–974

    Google Scholar 

  32. Song X, Zhang J, Han Y, Jiang J (2016) Semi-supervised feature selection via hierarchical regression for web image classification. MMSJ 22(1):41–49

    Google Scholar 

  33. Tang J, Li H, Qi GJ, Chua TS (2010) Image annotation by graph-based inference with integrated multiple/single instance representations. TMM 12(2):131–141

    Google Scholar 

  34. Uijlings JRR, Sande KEAVD, Gevers T, Smeulders AWM (2013) Selective search for object recognition. IJCV 104(2):154–171

    Article  Google Scholar 

  35. Wang P, Sun L, Yang S, Smeaton AF (2016) Towards training-free refinement for semantic indexing of visual media. In: MMM

  36. Wang S, Huang Q, Jiang S, Tian Q (2010) Nearest-neighbor classification using unlabeled data for real world image application. In: MM, pp 1151–1154

  37. Zhang K, Liu Q, Song H, Li X (2014) A variational approach to simultaneous image segmentation and bias correction. T Cybernetics 45(8):1426–1437

    Article  Google Scholar 

  38. Zhu S, Aloufi S, El-Saddik A (2015) Utilizing image social clues for automated image tagging. In: ICME, pp 1–6

  39. Zhu Y, Huang X, Huang Q, Tian Q (2016) Large-scale video copy retrieval with temporal-concentration sift. NEUCOM 187(C):83–91

    Google Scholar 

  40. Zitnick CL, Dollár P (2014) Edge boxes: Locating object proposals from edges. Springer International Publishing

Download references

Acknowledgements

This work is supported by National Science Foundation of China (61321491, 61202320), Undergraduate Innovation Project of Nanjing University (X201610284039), and Collaborative Innovation Center of Novel Software Technology and Industrialization.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tongwei Ren.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wang, Y., Huang, L., Ren, T. et al. Insights of object proposal evaluation. Multimed Tools Appl 78, 13111–13130 (2019). https://doi.org/10.1007/s11042-017-5471-6

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-017-5471-6

Keywords

Navigation