Task-Specific Salience for Object Recognition

Revaud, Jerome; Lavoue, Guillaume; Ariki, Yasuo; Baskurt, Atilla

doi:10.1007/978-3-642-17934-1_4

Task-Specific Salience for Object Recognition

Jerome Revaud⁴,
Guillaume Lavoue⁴,
Yasuo Ariki⁵ &
…
Atilla Baskurt⁴

Chapter

757 Accesses

Part of the book series: Studies in Computational Intelligence ((SCI,volume 339))

Abstract

Object recognition is a complex and challenging problem. It involves examining many different hypothesis in terms of the object class, position, scale, pose, etc., but the main trend in computer vision systems is to lazily rely on the brute force capacity of computers, that is to explore every possibilities indifferently. Sadly, in many case this scheme is way too slow for real-time or even practical applications. By incorporating salience in the recognition process, several approaches have shown that it is possible to get several orders of speed-up. In this chapter, we demonstrate the link between salience and cascaded processes and show why and how those ones should be constructed. We illustrate the benefits that it provides, in terms of detection speed, accuracy and robustness, and how it eases the combination of heterogeneous feature types (i.e. dense and sparse features) by some innovating strategies from the state-of-the-art and a practical application.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Arbelaez, P., Maire, M., Fowlkes, C., Malik, J.: From contours to regions: An empirical evaluation. In: Computer Vision and Pattern Recognition (2009)
Google Scholar
Canny, J.: A computational approach to edge detection. IEEE Trans. Pattern Anal. Mach. Intell. 8(6), 679–698 (1986)
Article Google Scholar
Chum, O., Zisserman, A.: An exemplar model for learning object classes. In: Computer Vision and Pattern Recognition (2007)
Google Scholar
Chum, O., Matas, J., Kittler, J.: Locally optimized ransac. Pattern Recognition, 236–243 (2003)
Google Scholar
Csurka, G., Dance, C.R., Fan, L., Willamowski, J., Bray, C.: Visual categorization with bags of keypoints. In: ECCV International Workshop on Statistical Learning in Computer Vision (2004)
Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: Schmid, C., Soatto, S., Tomasi, C. (eds.) International Conference on Computer Vision & Pattern Recognition, vol. 2, pp. 886–893 (2005); INRIA Rhône-Alpes,ZIRST-655, av. de l’Europe, Montbonnot-38334
Google Scholar
Elad, M., Hel-Or, Y., Keshet, R.: Pattern detection using a maximal rejection classifier. Pattern Recognition Letters 23, 1459–1471 (2001)
Article Google Scholar
Epshtein, B., Ullman, S.: Semantic hierarchies for recognizing objects and parts. In: Computer Vision and Pattern Recognition, CVPR 2007 (2007)
Google Scholar
Eveland, C.K., Socolinsky, D.A., Priebe, C.E., Marchette, D.J.: A hierarchical methodology for class detection problems with skewed priors. J. Classif. 22(1), 17–48 (2005)
Article MATH MathSciNet Google Scholar
Everingham, M., Van Gool, L., Williams, C. K. I., Winn, J., and Zisserman, A.: The PASCAL Visual Object Classes Challenge 2007 (VOC 2007) Results, http://www.pascalnetwork.org/challenges/VOC/voc2007/workshop
Fei-Fei, L., Fergus, R., Perona, P.: One-shot learning of object categories. IEEE Trans. Pattern Anal. Mach. Intell. 28(4), 594 (2006)
Article Google Scholar
Felzenszwalb, P.F., Girshick, R.B., McAllester, D.: Cascade object detection with deformable part models. In: ComputerVision and Pattern Recognition (2010)
Google Scholar
Fischler, M.A., Bolles, R.C.: Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 24(6), 381–395 (1981)
Article MathSciNet Google Scholar
Fleuret, F., Geman, D.: Coarse-to-fine face detection. Int. J. Comput. Vision 41(1-2), 85–107 (2001)
Article MATH Google Scholar
Fleuret, F., Geman, D.: Stationary features and cat detection. Journal of Machine Learning Research 9, 2549–2578 (2008)
MathSciNet Google Scholar
Freund, Y., Schapire, R.E.: A decision-theoretic generalization of on-line learning and an application to boosting. In: Proceedings of the Second European Conference on Computational Learning Theory, pp. 23–37. Springer, London (1995)
Google Scholar
Gangaputra, S., Geman, D.: A design principle for coarse-to-fine classification. In: CVPR 2006: Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 1877–1884. IEEE ComputerSociety, Washington, DC (2006)
Google Scholar
Gehler, P., Nowozin, S.: On feature combination for multiclass object classification. In: Proceedings of the Twelfth IEEE International Conference on Computer Vision, ICCV (2009)
Google Scholar
Gu, C., Lim, J.J., Arbelaez, P., Malik, J.: Recognition using regions. In: Computer Vision and Pattern Recognition (2009)
Google Scholar
Harris, C., Stephens, M.: A combined corner and edge detection. In: Proceedings of The Fourth Alvey Vision Conference, pp. 147–151 (1988)
Google Scholar
Harzallah, H., Jurie, F., Schmid, C.: Combining efficient object localization and image classification. In: International Conference on Computer Vision (2009)
Google Scholar
Huang, C., Ai, H., Li, Y., Lao, S.: Vector boosting for rotation invariant multi-view face detection. In: International Conference on Computer Vision (ICCV 2005), pp. 446–453. IEEE Computer Society, Los Alamitos (2005)
Google Scholar
Huang, C., Ai, H., Wu, B., Lao, S.: Boosting nested cascade detector for multi-view face detection. In: ICPR 2004: Proceedings of the Pattern Recognition, 17th International Conference on (ICPR 2004), vol. 2, pp. 415–418. IEEE Computer Society, Washington, DC (2004)
Chapter Google Scholar
Itti, L., Koch, C., Niebur, E.: A model of saliency-based visual attention for rapid scene analysis. IEEE Trans.Pattern Anal. Mach. Intell. 20(11), 1254–1259 (1998)
Article Google Scholar
Kruizinga, P., Petkov, N.: Nonlinear operator for oriented texture. IEEE Transactions on Image Processing 8(10), 1395–1407 (1999)
Article MathSciNet Google Scholar
Lampert, C.: An efficient divide-and-conquer cascade for nonlinear object detection. In: Computer Vision and Pattern Recognition (2010)
Google Scholar
Liu, X., Zhang, L., Li, M., Zhang, H., Wang, D.: Boosting image classification with lda-based feature combination for digital photograph management. Pattern Recognition 38(6), 887–901 (2005)
Article Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. IJCV 60(2), 91–110 (2004)
Article Google Scholar
Matas, J., Chum, O., Urban, M., Pajdla, T.: Robust wide baseline stereo from maximally stable extremal regions. In: British Machine Vision Conference, vol. 1, pp. 384–393 (2002)
Google Scholar
Messmer, B.T., Bunke, H.: A new algorithm for errortolerant subgraph isomorphism detection. IEEE Trans. Pattern Anal. Mach. Intell. 20(5), 493–504 (1998)
Article Google Scholar
Moreels, P., Perona, P.: Probabilistic coarse-to-fine object recognition. Technical report, California Institute of Technology (2005)
Google Scholar
Moreels, P., Perona, P.: Evaluation of features detectors and descriptors based on 3d objects. International Journal of Computer Vision 73(3), 263–284 (2007)
Article Google Scholar
Paletta, L., Fritz, G.: Reinforcement learning for decision making in sequential visual attention, pp. 293-306 (2008)
Google Scholar
Paletta, L., Tsotsos, J.K.: Attention in Cognitive Systems. LNCS. Springer, Heidelberg (2008)
Google Scholar
Philbin, J., Chum, O., Isard, M., Sivic, J., Zisserman, A.: Object retrieval with large vocabularies and fast spatial matching. In: Computer Vision and Pattern Recognition, pp. 1–8 (2007)
Google Scholar
Rothganger, F., Lazebnik, S., Schmid, C., Ponce, J.: 3d object modeling and recognition using local affneinvariant image descriptors and multi-view spatial constraints. International Journal of Computer Vision 66(3), 231–259 (2006)
Article Google Scholar
Tola, E., Lepetit, V., Fua, P.: A fast local descriptor for dense matching. In: Computer Vision and Pattern Recognition (CVPR), pp. 1–8 (2008)
Google Scholar
Torralba, A., Murphy, K.P., Freeman, W.T.: Sharing visual features for multiclass and multiview object detection. IEEE Transactions on Pattern Analysis and Machine Intelligence 29(5), 854–869 (2007)
Article Google Scholar
Vedaldi, A., Gulshan, V., Varma, M., Zisserman, A.: Multiple kernels for object detection. In: International Conference on ComputerVision (2009)
Google Scholar
Vidal-Naquet, M., Ullman, S.: Object recognition with informative features and linear classification. In: ICCV 2003: Proceedings of the Ninth IEEE International Conference on Computer Vision, p. 281. IEEE Computer Society, Washington, DC (2003)
Chapter Google Scholar
Viola, P., Jones, M.: Rapid object detection using a boosted cascade of simple features. In: Computer Vision and Pattern Recognition, pp. 511–518 (2001)
Google Scholar
Viola, P., Jones, M.J.: Robust real-time face detection. International Journal of Computer Vision 57(2), 137–154 (2004)
Article Google Scholar
Walther, D., Koch, C.: Modeling attention to salient proto-objects. Neural Networks 19(9), 1395–1407 (2006); Brain and Attention, Brain and Attention
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

INSA-Lyon, LIRIS, UMR5205, Universite de Lyon, CNRS, F-69621, France
Jerome Revaud, Guillaume Lavoue & Atilla Baskurt
CS17 Media Laboratory, Kobe University, Japan
Yasuo Ariki

Authors

Jerome Revaud
View author publications
You can also search for this author in PubMed Google Scholar
Guillaume Lavoue
View author publications
You can also search for this author in PubMed Google Scholar
Yasuo Ariki
View author publications
You can also search for this author in PubMed Google Scholar
Atilla Baskurt
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Deputy Director for Scientific Researches Institute of Informatics, Wroclaw University of Technology , Wyb. Wyspianskiego 27, 51-370, Wroclaw, Poland
Halina Kwaśnicka
Founding Director of the KES Centre School of Electrical and Information Engineering, Knowledge-Based Engineering University of South Australia Adelaide , Mawson Lakes Campus, 5095, South Australia, SA, Australia
Lakhmi C. Jain

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Revaud, J., Lavoue, G., Ariki, Y., Baskurt, A. (2011). Task-Specific Salience for Object Recognition. In: Kwaśnicka, H., Jain, L.C. (eds) Innovations in Intelligent Image Analysis. Studies in Computational Intelligence, vol 339. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17934-1_4

Download citation

DOI: https://doi.org/10.1007/978-3-642-17934-1_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-17933-4
Online ISBN: 978-3-642-17934-1
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics