Class-Specific Low-Dimensional Representation of Local Features for Viewpoint Invariant Object Recognition

Raytchev, Bisser; Kikutsugi, Yuta; Tamaki, Toru; Kaneda, Kazufumi

doi:10.1007/978-3-642-19318-7_20

Bisser Raytchev¹⁹,
Yuta Kikutsugi¹⁹,
Toru Tamaki¹⁹ &
…
Kazufumi Kaneda¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 6494))

Included in the following conference series:

Asian Conference on Computer Vision

2873 Accesses

Abstract

In this paper we propose a new general framework to obtain more distinctive local invariant features by projecting the original feature descriptors into low–dimensional feature space, while simultaneously incorporating also class information. In the resulting feature space, the features from different objects project to separate areas, while locally the metric relations between features corresponding to the same object are preserved. The low–dimensional feature embedding is obtained by a modified version of classical Multidimensional Scaling, which we call supervised Multidimensional Scaling (sMDS). Experimental results on a database containing images of several different objects with large variation in scale, viewpoint, illumination conditions and background clutter support the view that embedding class information into the feature representation is beneficial and results in more accurate object recognition.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Lowe, D.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 12(60), 91–110 (2004)
Article Google Scholar
Bay, H., Ess, A., Tuytelaars, T., Van Gool, L.: SURF: Speeded Up Robust Features. Computer Vision and Image Understanding (CVIU) 110(3), 346–359 (2008)
Article Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: Proc. IEEE Conference on Computer Vision and Pattern Recognition, pp. 20–25 (2005)
Google Scholar
Tuytelaars, T., Mikolajczyk, K.: Local Invariant Feature Detectors: A Survey. Foundations and Trends in Computer Graphics and Vision 3(3), 177–280 (2007)
Article Google Scholar
Murase, H., Nayar, S.: Visual Learning and Recognition of 3-D Objects from Appearance. International Journal of Computer Vision 14(1), 5–24 (1995)
Article Google Scholar
Csurka, G., Bray, C., Dance, C., Fan, L.: Visual Categorization with bags of keypoints. In: Proc. ECCV Workshop on Statistical Learning in Computer Vision, pp. 1–22 (2004)
Google Scholar
Sivic, J., Zisserman, A.: Video Google: A text retrieval approach to object matching in videos. In: Proceedings of the International Conference on Computer Vision, pp. 1470–1477 (2003)
Google Scholar
Cox, T., Cox, M.: Multidimensional Scaling, 2nd edn. Chapman and Hall, Boca Raton (2000)
MATH Google Scholar
Jolliffe, I.: Principal Component Analysis. Springer, Heidelberg (1986)
Book MATH Google Scholar
Ke, Y., Sukthankar, R.: PCA–SIFT: A More Distinctive Representation for Local Image Descriptors. In: Proceedings IEEE Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 506–513 (2004)
Google Scholar
Grauman, K., Darrell, T.: The pyramid match kernel: discriminative classification with sets of image features. In: Proc. IEEE Int. Conf. Computer Vision, vol. 2, pp. 1458–1465 (2005)
Google Scholar
van Gemert, J.C., Geusebroek, J.-M., Veenman, C.J., Smeulders, A.W.M.: Kernel codebooks for scene categorization. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part III. LNCS, vol. 5304, pp. 696–709. Springer, Heidelberg (2008)
Chapter Google Scholar
Hua, G., Brown, M., Winder, S.: Discriminant embedding for local image descriptors. In: Proc. IEEE Int. Conf. Computer Vision, pp. 1–8 (2007)
Google Scholar
Belhumeur, P.N., Hespanha, J.P., Kriegman, D.J.: Eigenfaces vs. Fisherfaces: Recognition using class specific linear projection. IEEE Trans. PAMI 19(7), 711–720 (1997)
Article Google Scholar
Torgeson, W.: Multidimensional Scaling: I. Theory and method. Psychometrika 17, 401–419 (1952)
Article MathSciNet Google Scholar
Mardia, K., Kent, J., Bibby, J.: Multivariate Analysis. Academic Press, London (1979)
MATH Google Scholar
Wandell, B., Brewer, A.A., Dougherty, R.F.: Visual Field Map Clusters in Human Cortex. Phil. Trans. of the Royal Society London 360, 693–707 (2005)
Article Google Scholar
Gower, J.: Adding a point to vector diagrams in multivariate analysis. Biometrica 55, 582–585 (1968)
Article MATH Google Scholar
http://www.cs.cmu.edu/~yke/pcasift/
Yan, S., Xu, D., Zhang, B., Zhang, H., Yang, Q.: Graph Embedding and Extensions: A General Framework for Dimensionality Rediction. IEEE Trans. PAMI 29(1), 40–51 (2007)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Information Engineering, Hiroshima University, Japan
Bisser Raytchev, Yuta Kikutsugi, Toru Tamaki & Kazufumi Kaneda

Authors

Bisser Raytchev
View author publications
You can also search for this author in PubMed Google Scholar
Yuta Kikutsugi
View author publications
You can also search for this author in PubMed Google Scholar
Toru Tamaki
View author publications
You can also search for this author in PubMed Google Scholar
Kazufumi Kaneda
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Technion – Israel Institute of Technology, Department of Computer Science, 32000, Haifa, Israel
Ron Kimmel
The University of Auckland, 37 Kohimarama Road , Mission Bay, 1071, Auckland, New Zealand
Reinhard Klette
National Institute of Informatics, Chiyoda, 1018430, Tokyo, Japan
Akihiro Sugimoto

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Raytchev, B., Kikutsugi, Y., Tamaki, T., Kaneda, K. (2011). Class-Specific Low-Dimensional Representation of Local Features for Viewpoint Invariant Object Recognition. In: Kimmel, R., Klette, R., Sugimoto, A. (eds) Computer Vision – ACCV 2010. ACCV 2010. Lecture Notes in Computer Science, vol 6494. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19318-7_20

Download citation

DOI: https://doi.org/10.1007/978-3-642-19318-7_20
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-19317-0
Online ISBN: 978-3-642-19318-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics