Can Computer Vision Problems Benefit from Structured Hierarchical Classification?

Hoyoux, Thomas; Rodríguez-Sánchez, Antonio J.; Piater, Justus H.; Szedmak, Sandor

doi:10.1007/978-3-319-23117-4_35

Thomas Hoyoux¹⁵,
Antonio J. Rodríguez-Sánchez¹⁶,
Justus H. Piater¹⁶ &
…
Sandor Szedmak¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 9257))

Included in the following conference series:

International Conference on Computer Analysis of Images and Patterns

2694 Accesses
1 Citations

Abstract

While most current research in the classification domain still focuses on standard “flat” classification, there is an increasing interest in a particular type of structured classification called hierarchical classification. Incorporating knowledge about class hierarchy should be beneficial to computer vision systems as suggested by the fact that humans seem to organize objects into hierarchical structures based on visual geometrical similarities. In this paper, we analyze whether hierarchical classification provides better performance than flat classification by comparing three structured classification methods – Structured K-Nearest Neighbors, Structured Support Vector Machines and Maximum Margin Regression – with their flat counterparts on two very different computer vision tasks: facial expression recognition, for which we emphasize the underlying hierarchical structure, and 3D shape classification. The obtained results show no or only marginal improvement, which questions the way the data should be exploited for hierarchical classification in computer vision.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Astikainen, K., Holm, L., Pitkänen, E., Szedmak, S., Rousu, J.: Towards structured output prediction of enzyme function. BioMed. Central (2008)
Google Scholar
Belongie, S., Malik, J., Puzicha, J.: Shape matching and object recognition using shape contexts. IEEE PAMI 24(24), 509–522 (2002)
Article Google Scholar
Crammer, K., Singer, Y.: On the algorithmic implementation of multiclass kernel-based vector machines. JMLR 2, 265–292 (2001)
Google Scholar
Ekman, P., Rosenberg, E.L.: What the face reveals: Basic and applied studies of spontaneous expression using the Facial Action Coding System (FACS). Oxford University Press (1997)
Google Scholar
Fei-Fei, L., Fergus, R., Perona, P.: One-shot learning of object categories. IEEE PAMI 28(4), 594–611 (2006)
Article Google Scholar
Kiritchenko, S., Matwin, S., Famili, A.F.: Functional annotation of genes using hierarchical text categorization. In: BioLINK SIG: LLIKB (2005)
Google Scholar
Lampert, C., Nickisch, H., Harmeling, S.: Attribute-based classification for zero-shot learning of object categories. IEEE PAMI (2013)
Google Scholar
Lucey, P., Cohn, J.F., Kanade, T., Saragih, J., Ambadar, Z., Matthews, I.: The extended cohn-kanade dataset (ck+): A complete dataset for action unit and emotion-specified expression. In: CVPRW, pp. 94–101 (2010)
Google Scholar
Rodríguez-Sánchez, A., Tsotsos, J.: The roles of endstopped and curvature tuned computations in a hierarchical representation of 2D shape. PLOS ONE 7(8), 1–13 (2012)
Article Google Scholar
Rusu, R.B., Cousins, S.: 3D is here: Point cloud library (PCL). In: IEEE ICRA, pp. 1–4 (2011)
Google Scholar
Rusu, R., Bradski, G., Thibaux, R., Hsu, J.: Fast 3D recognition and pose using the viewpoint feature histogram. In: IROS, pp. 2155–2162 (2010)
Google Scholar
Serre, T., Wolf, L., Bileschi, S., Riesenhuber, M.: Robust object recognition with cortex-like mechanisms. IEEE PAMI 29(3), 411–426 (2007)
Article Google Scholar
Shilane, P., Min, P., Kazhdan, M., Funkhouser, T.: The princeton shape benchmark. In: Shape Modeling Applications, pp. 167–178 (2004)
Google Scholar
Silla Jr., C.N., Freitas, A.A.: A survey of hierarchical classification across different application domains. DMKD 22(1–2), 31–72 (2011)
MATH MathSciNet Google Scholar
Tombari, F., Salti, S., Di Stefano, L.: Unique shape context for 3D data description. In: Workshop on 3D Object Retrieval, pp. 57–62. ACM (2010)
Google Scholar
Tsochantaridis, I., Hofmann, T., Joachims, T., Altun, Y.: Support vector machine learning for interdependent and structured output spaces. In: ICML, p. 104. ACM (2004)
Google Scholar
Wang, X., Liu, Y., Zha, H.: Intrinsic spin images: A subspace decomposition approach to understanding 3D deformable shapes. In: 3DPVT, vol. 10, pp. 17–20 (2010)
Google Scholar
Weidenbacher, U., Neumann, H.: Extraction of surface-related features in a recurrent model of V1–V2 interactions. PLOS ONE 4(6), e5909 (2009)
Google Scholar
Wohlkinger, W., Vincze, M.: Ensemble of shape functions for 3D object classification. In: IEEE ROBIO, pp. 2987–2992 (2011)
Google Scholar

Download references

Author information

Authors and Affiliations

INTELSIG, Montefiore Institute, University of Liège, Liège, Belgium
Thomas Hoyoux
Intelligent and Interactive Systems, Institute of Computer Science, University of Innsbruck, Innsbruck, Austria
Antonio J. Rodríguez-Sánchez, Justus H. Piater & Sandor Szedmak

Authors

Thomas Hoyoux
View author publications
You can also search for this author in PubMed Google Scholar
Antonio J. Rodríguez-Sánchez
View author publications
You can also search for this author in PubMed Google Scholar
Justus H. Piater
View author publications
You can also search for this author in PubMed Google Scholar
Sandor Szedmak
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Thomas Hoyoux .

Editor information

Editors and Affiliations

University of Malta, Msida, Malta
George Azzopardi
University of Groningen, Groningen, The Netherlands
Nicolai Petkov

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hoyoux, T., Rodríguez-Sánchez, A.J., Piater, J.H., Szedmak, S. (2015). Can Computer Vision Problems Benefit from Structured Hierarchical Classification?. In: Azzopardi, G., Petkov, N. (eds) Computer Analysis of Images and Patterns. CAIP 2015. Lecture Notes in Computer Science(), vol 9257. Springer, Cham. https://doi.org/10.1007/978-3-319-23117-4_35

Download citation

DOI: https://doi.org/10.1007/978-3-319-23117-4_35
Published: 26 August 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-23116-7
Online ISBN: 978-3-319-23117-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics