A Multimodal Constellation Model for Object Category Recognition

Kamiya, Yasunori; Takahashi, Tomokazu; Ide, Ichiro; Murase, Hiroshi

doi:10.1007/978-3-540-92892-8_33

Yasunori Kamiya⁵,
Tomokazu Takahashi⁶,
Ichiro Ide⁵ &
…
Hiroshi Murase⁵

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5371))

Included in the following conference series:

International Conference on Multimedia Modeling

782 Accesses
4 Citations

Abstract

Object category recognition in various appearances is one of the most challenging task in the object recognition research fields. The major approach to solve the task is using the Bag of Features (BoF). The constellation model is another approach that has the following advantages: (a) Adding and changing the candidate categories is easy; (b) Its description accuracy is higher than BoF; (c) Position and scale information, which are ignored by BoF, can be used effectively. On the other hand, this model has two weak points: (1) It is essentially an unimodal model that is unsuitable for categories with many types of appearances. (2) The probability function that represents the constellation model takes a long time to calculate. In this paper we propose a “Multimodal Constellation Model” to solve the two weak points of the constellation model. Experimental results showed the effectivity of the proposed model by comparison to methods using BoF.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bishop, C.M.: Pattern Recognition and Machine Learning. Springer, Heidelberg (2006)
MATH Google Scholar
Bosch, A., Zisserman, A., Muñoz, X.: Scene classification via pLSA. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3954, pp. 517–530. Springer, Heidelberg (2006)
Chapter Google Scholar
Csurka, G., Dance, C.R., Fan, L., Willamowski, J., Bray, C.: Visual categorization with bags of keypoints. In: Proc. ECCV International Workshop on Statistical Learning in Computer Vision, pp. 1–22 (2004)
Google Scholar
Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum likelihood from incomplete data via the EM algorithm. J. Royal Statistical Society, Series B 39(1), 1–38 (1977)
MathSciNet MATH Google Scholar
Everingham, M., Zisserman, A., Williams, C.K.I., Van Gool, L.: The PASCAL Visual Object Classes Challenge 2006 Results (VOC 2006) (2006), http://www.pascal-network.org/challenges/VOC/voc2006/results.pdf
Fei-Fei, L., Perona, A.P.: A bayesian hierarchical model for learning natural scene categories. In: Proc. IEEE Computer Society Conf. on Computer Vision and Pattern Recognition, vol. 2, pp. 524–531 (2005)
Google Scholar
Fergus, R., Perona, P., Zisserman, A.: Object class recognition by unsupervised scale-invariant learning. In: Proc. IEEE Computer Society Conf. on Computer Vision and Pattern Recognition, vol. 2, pp. 264–271 (2003)
Google Scholar
Fergus, R., Perona, P., Zisserman, A.: A sparse object category model for efficient learning and exhaustive recognition. In: Proc. IEEE Computer Society Conf. on Computer Vision and Pattern Recognition, vol. 1, pp. 380–387 (2005)
Google Scholar
Grauman, K., Darrell, T.: The pyramid match kernel: discriminative classification with sets of image features. In: Proc. IEEE Int. Conf. on Computer Vision, vol. 2, pp. 1458–1465 (2005)
Google Scholar
Kadir, T., Brady, M.: Saliency, scale and image description. Int. J. of Computer Vision 45(2), 83–105 (2001)
Article MATH Google Scholar
Ma, X., Grimson, W.E.L.: Edge-based rich representation for vehicle classification. In: Proc. IEEE Int. Conf. on Computer Vision, vol. 2, pp. 1185–1192 (2005)
Google Scholar
Varma, M., Ray, D.: Learning the discriminative power-invariance trade-off. In: Proc. IEEE Int. Conf. on Computer Vision (2007)
Google Scholar
Wang, G., Zhang, Y., Fei-Fei, L.: Using dependent regions for object categorization in a generative framework. In: Proc. IEEE Computer Society Conf. on Computer Vision and Pattern Recognition, vol. 2, pp. 1597–1604 (2006)
Google Scholar
Weber, M., Welling, M., Perona, P.: Towards automatic discovery of object categories. In: Proc. IEEE Computer Society Conf. on Computer Vision and Pattern Recognition, vol. 2, pp. 101–108 (2000)
Google Scholar
Weber, M., Welling, M., Perona, P.: Unsupervised learning of models for recognition. In: Vernon, D. (ed.) ECCV 2000. LNCS, vol. 1842, pp. 18–32. Springer, Heidelberg (2000)
Chapter Google Scholar
Zhang, J., Marszalek, M., Lazebnik, S., Schmid, C.: Local features and kernels for classification of texture and object categories: A comprehensive study. Int. J. of Computer Vision (2), 213–238 (2007)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Graduate School of Information Science, Nagoya University, Furo-cho, Chikusa-ku, Nagoya, 464-8601, Japan
Yasunori Kamiya, Ichiro Ide & Hiroshi Murase
Faculty of Economics and Information, Gifu Shotoku Gakuen University, 1-38, Nakauzura, Gifu, 500-8288, Japan
Tomokazu Takahashi

Authors

Yasunori Kamiya
View author publications
You can also search for this author in PubMed Google Scholar
Tomokazu Takahashi
View author publications
You can also search for this author in PubMed Google Scholar
Ichiro Ide
View author publications
You can also search for this author in PubMed Google Scholar
Hiroshi Murase
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Eurécom, 2229, route des crêtes, 06904, Sophia-Antipolis, France
Benoit Huet
Dublin City University, Dublin, Ireland
Alan Smeaton
Department of Computer Science, University of North Carolina, Chapel Hill, NC, USA
Ketan Mayer-Patel
Image, Video and Multimedia Systems Laboratory, School of Electrical and Computer Engineering, National Technical University of Athens, 9 Iroon Polytechniou Str., 157 80, Athens, Greece
Yannis Avrithis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kamiya, Y., Takahashi, T., Ide, I., Murase, H. (2009). A Multimodal Constellation Model for Object Category Recognition. In: Huet, B., Smeaton, A., Mayer-Patel, K., Avrithis, Y. (eds) Advances in Multimedia Modeling . MMM 2009. Lecture Notes in Computer Science, vol 5371. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-92892-8_33

Download citation

DOI: https://doi.org/10.1007/978-3-540-92892-8_33
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-92891-1
Online ISBN: 978-3-540-92892-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics