Abstract
While most current research in the classification domain still focuses on standard “flat” classification, there is an increasing interest in a particular type of structured classification called hierarchical classification. Incorporating knowledge about class hierarchy should be beneficial to computer vision systems as suggested by the fact that humans seem to organize objects into hierarchical structures based on visual geometrical similarities. In this paper, we analyze whether hierarchical classification provides better performance than flat classification by comparing three structured classification methods – Structured K-Nearest Neighbors, Structured Support Vector Machines and Maximum Margin Regression – with their flat counterparts on two very different computer vision tasks: facial expression recognition, for which we emphasize the underlying hierarchical structure, and 3D shape classification. The obtained results show no or only marginal improvement, which questions the way the data should be exploited for hierarchical classification in computer vision.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Astikainen, K., Holm, L., Pitkänen, E., Szedmak, S., Rousu, J.: Towards structured output prediction of enzyme function. BioMed. Central (2008)
Belongie, S., Malik, J., Puzicha, J.: Shape matching and object recognition using shape contexts. IEEE PAMI 24(24), 509–522 (2002)
Crammer, K., Singer, Y.: On the algorithmic implementation of multiclass kernel-based vector machines. JMLR 2, 265–292 (2001)
Ekman, P., Rosenberg, E.L.: What the face reveals: Basic and applied studies of spontaneous expression using the Facial Action Coding System (FACS). Oxford University Press (1997)
Fei-Fei, L., Fergus, R., Perona, P.: One-shot learning of object categories. IEEE PAMI 28(4), 594–611 (2006)
Kiritchenko, S., Matwin, S., Famili, A.F.: Functional annotation of genes using hierarchical text categorization. In: BioLINK SIG: LLIKB (2005)
Lampert, C., Nickisch, H., Harmeling, S.: Attribute-based classification for zero-shot learning of object categories. IEEE PAMI (2013)
Lucey, P., Cohn, J.F., Kanade, T., Saragih, J., Ambadar, Z., Matthews, I.: The extended cohn-kanade dataset (ck+): A complete dataset for action unit and emotion-specified expression. In: CVPRW, pp. 94–101 (2010)
Rodríguez-Sánchez, A., Tsotsos, J.: The roles of endstopped and curvature tuned computations in a hierarchical representation of 2D shape. PLOS ONE 7(8), 1–13 (2012)
Rusu, R.B., Cousins, S.: 3D is here: Point cloud library (PCL). In: IEEE ICRA, pp. 1–4 (2011)
Rusu, R., Bradski, G., Thibaux, R., Hsu, J.: Fast 3D recognition and pose using the viewpoint feature histogram. In: IROS, pp. 2155–2162 (2010)
Serre, T., Wolf, L., Bileschi, S., Riesenhuber, M.: Robust object recognition with cortex-like mechanisms. IEEE PAMI 29(3), 411–426 (2007)
Shilane, P., Min, P., Kazhdan, M., Funkhouser, T.: The princeton shape benchmark. In: Shape Modeling Applications, pp. 167–178 (2004)
Silla Jr., C.N., Freitas, A.A.: A survey of hierarchical classification across different application domains. DMKD 22(1–2), 31–72 (2011)
Tombari, F., Salti, S., Di Stefano, L.: Unique shape context for 3D data description. In: Workshop on 3D Object Retrieval, pp. 57–62. ACM (2010)
Tsochantaridis, I., Hofmann, T., Joachims, T., Altun, Y.: Support vector machine learning for interdependent and structured output spaces. In: ICML, p. 104. ACM (2004)
Wang, X., Liu, Y., Zha, H.: Intrinsic spin images: A subspace decomposition approach to understanding 3D deformable shapes. In: 3DPVT, vol. 10, pp. 17–20 (2010)
Weidenbacher, U., Neumann, H.: Extraction of surface-related features in a recurrent model of V1–V2 interactions. PLOS ONE 4(6), e5909 (2009)
Wohlkinger, W., Vincze, M.: Ensemble of shape functions for 3D object classification. In: IEEE ROBIO, pp. 2987–2992 (2011)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Hoyoux, T., Rodríguez-Sánchez, A.J., Piater, J.H., Szedmak, S. (2015). Can Computer Vision Problems Benefit from Structured Hierarchical Classification?. In: Azzopardi, G., Petkov, N. (eds) Computer Analysis of Images and Patterns. CAIP 2015. Lecture Notes in Computer Science(), vol 9257. Springer, Cham. https://doi.org/10.1007/978-3-319-23117-4_35
Download citation
DOI: https://doi.org/10.1007/978-3-319-23117-4_35
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-23116-7
Online ISBN: 978-3-319-23117-4
eBook Packages: Computer ScienceComputer Science (R0)