How to Select and Customize Object Recognition Approaches for an Application?

Sorschag, Robert

doi:10.1007/978-3-642-27355-1_42

Robert Sorschag²²

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7131))

Included in the following conference series:

International Conference on Multimedia Modeling

2004 Accesses

Abstract

Recently, object recognition has been successfully implemented in a couple of multimedia content annotation and retrieval applications. The employed recognition approaches are carefully selected and adapted to the specific needs of their tasks. In this work, we propose a framework to automate the simultaneous selection and customization of the entire recognition process. This framework only requires an annotated set of sample images or videos and precisely specified task requirements to select an appropriate setup among thousands of possibilities. We use an efficient recognition infrastructure and iterative analysis strategies to make this approach practicable for real-world applications. A case study for face recognition from a single image per person demonstrates the capabilities of this holistic approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Stavens, D., Thrun, S.: Unsupervised Learning of Invariant Features using Video. In: CVPR (2010)
Google Scholar
Babenko, B., Dollár, P., Belongie, S.: Task Specific Local Region Matching. In: ICCV (2007)
Google Scholar
Winder, S., Hua, G., Brown, M.: Picking the best DAISY. In: CVPR (2009)
Google Scholar
Everingham, M., Van Gool, L., Williams, C., Winn, J., Zisserman, A.: The Pascal Visual Object Classes (VOC) Challenge. IJCV 88(2), 303–338 (2010)
Article Google Scholar
Tuytelaars, T., Mikolajczyk, K.: Local Invariant Feature Detectors: A Survey. Foundations and Trends in Computer Graphics and Vision 3(3), 177–280 (2008)
Article Google Scholar
Mikolajczyk, K., Schmid, C.: A Performance Evaluation of Local Descriptors. In: PAMI (2005)
Google Scholar
Hsu, C., Chang, C., Lin, D.: A Practical Guide to Support Vector Classification. Technical report, Nat. Taiwan University, Taipei (2003), http://www.csie.ntu.edu.tw/~cjlin/papers/guide/
Varma, M., Ray, D.: Learning the Discriminative Power-Invariance Trade-Off. In: ICCV (2007)
Google Scholar
Jiang, Y., Ngo, C., Yang, J.: Towards Optimal Bag-of-Features for Object Categorization and Semantic Video Retrieval. In: Int. Conf. Image and Video Retrieval (2007)
Google Scholar
Winder, S.A.J., Brown, M.: Learning Local Image Descriptors. In: CVPR (2007)
Google Scholar
Jahrer, M., Grabner, M., Bischof, H.: Learned Local Descriptors for Recognition and Matching. In: Computer Vision Winter Workshop (2008)
Google Scholar
Torralba, A., Russell, B.C., Yeun, J.: LabelMe: Online Image Annotation and Applications. Proceedings of the IEEE 98(8), 1467–1484 (2010)
Article Google Scholar
Doermann, D., Mihalcik, D.: Tools and Techniques for Video Performance Evaluation. In: ICPR, vol. 4, pp. 167-170 (2000)
Google Scholar
Leistner, C., Godec, M., Schulter, S., Saffari, A., Werlberger, M., Bischof, H.: Improving Classifiers with Unlabeled Weakly-Related Videos. In: CVPR (2011)
Google Scholar
Klemmer, S.R.: Papier-Mâché: Toolkit support for tangible interaction. In: Human Factors in Computing Systems (2004)
Google Scholar
Maynes-Aminzade, D., Winograd, T., Igarashi, T.: Eyepatch: Prototyping Camera-based Interaction through Examples. In: Symp. User Interface Software and Technology (2007)
Google Scholar
Muja, M., Rusu, R., Bradski, G., Lowe, D.: REIN - A Fast, Robust, Scalable REcognition INfrastructure. In: International Conference on Robotics and Automation (2011)
Google Scholar
Sorschag, R.: CORI: A Configurable Object Recognition Infrastructure. In: Int. Conf. on Signal and Image Processing Applications (2011)
Google Scholar
Bradski, G., Kaehler, A.: Learning OpenCV, Computer Vision with the Open Source Computer Vision Library. O’Reilly Press (2008), http://opencv.willowgarage.com
Lowe, D.: Distinctive Image Features from Scale-invariant Keypoints. IJCV (2004)
Google Scholar
Tan, S., Chen, S., Zhou, Z.-H., Zhang, F.: Face Recognition from a Single Image per Person: A Survey. Pattern Recognition 39, 1725–1745 (2006)
Article MATH Google Scholar
Viola, P., Jones, M.J.: Robust Real-time Face Detection. IJCV 57(2) (2004)
Google Scholar
Phillips, P.J., Wechsler, H., Huang, J., Rauss, P.J.: The FERET Database and Evaluation Procedure for Face Recognition Algorithms. In: Image and Vision Computing (1998)
Google Scholar
Frigo, M., Johnson, S.: The Design and Implementation of FFTW3. In: Proc. Program Generation, Optimization, and Platform Adaptation, vol. 93(2), pp. 216–231 (2005)
Google Scholar
Manjunath, B., Ohm, J.-R., Vasudevan, V., Yamada, A.: Color and Texture Descriptors. Trans. on Circuits and Systems for Video Technology 11, 703–715 (2001)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Software Technology and Interactive Systems, Vienna University of Technology, Favoritenstrasse 9-11, A-1040, Vienna, Austria
Robert Sorschag

Authors

Robert Sorschag
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Information Technology, Alpen-Adria-Universität Klagenfurt, Universitätsstr. 65-67, 9020, Klagenfurt, Austria
Klaus Schoeffmann
EURECOM, 2229 Rout des Crêtes, BP 193, 06904, Sophia Antipolis Cedex, France
Bernard Merialdo
School of Computer Science, Carnegie Mellon University, 5000 Forbes Ave, 15213-3890, Pittsburgh, PA, USA
Alexander G. Hauptmann
Department of Computer Science, City University of Hong Kong, Tat Chee Ave, Kowloon, Hong Kong
Chong-Wah Ngo
Department of Electronic and Electrical Engineering, University College London, Roberts Building, Torrington Place, WC1E 7JE, London, UK
Yiannis Andreopoulos
Institute of Software Technology and Interactive Systems, Vienna University of Technology, Favoritenstrasse 9-11 188/2, 1040, Vienna, Austria
Christian Breiteneder

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sorschag, R. (2012). How to Select and Customize Object Recognition Approaches for an Application?. In: Schoeffmann, K., Merialdo, B., Hauptmann, A.G., Ngo, CW., Andreopoulos, Y., Breiteneder, C. (eds) Advances in Multimedia Modeling. MMM 2012. Lecture Notes in Computer Science, vol 7131. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-27355-1_42

Download citation

DOI: https://doi.org/10.1007/978-3-642-27355-1_42
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-27354-4
Online ISBN: 978-3-642-27355-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics