Object Category Recognition Using Generative Template Boosting

Peng, Shaowu; Lin, Liang; Porway, Jake; Sang, Nong; Zhu, Song-Chun

doi:10.1007/978-3-540-74198-5_16

Shaowu Peng^1,3,
Liang Lin^2,3,
Jake Porway⁴,
Nong Sang^1,3 &
…
Song-Chun Zhu^3,4

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 4679))

Included in the following conference series:

International Workshop on Energy Minimization Methods in Computer Vision and Pattern Recognition

1407 Accesses
3 Citations

Abstract

In this paper, we present a framework for object categorization via sketch graphs, structures that incorporate shape and structure information. In this framework, we integrate the learnable And-Or graph model, a hierarchical structure that combines the reconfigurability of a stochastic context free grammar(SCFG) with the constraints of a Markov random field(MRF), and we sample object configurations as training templates from this generative model. Based on these synthesized templates, four steps of discriminative approaches are adopted for cascaded pruning, while a template matching method is developed for top-down verification. These synthesized templates are sampled from the whole configuration space following the maximum entropy constraints. In contrast to manually choosing data, they have a great ability to represent the variability of each object category. The generalizability and flexibility of our framework is illustrated on 20 categories of sketch-based objects under different scales.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Berg, A., Berg, T., Malik, J.: Shape Matching and Object Recognition using Low Distortion Correspondence. CVPR (2005)
Google Scholar
Lowe, D.G.: Distinctive image features from scaleinvariant keypoints. IJCV 60(2), 91–110 (2004)
Article Google Scholar
Estrada, F., Jepson, A.: Perceptual Grouping for Contour Extraction. ICPR (2004)
Google Scholar
Han, F., Zhu, S.C.: Bottom-up/top-down image parsing by attribute graph grammar, ICCV 2 (2005)
Google Scholar
Jurie, F., Triggs, B.: Creating Efficient Codebooks for Visual Recognition. ICCV (2005)
Google Scholar
Csurka, G., Dance, C., Fan, L., Willamowski, J., Bray, C.: Visual Categorization with Bags of Keypoints. In: SLCV workshop in conjunction with ECCV (2004)
Google Scholar
Dorko, G., Schmid, C.: Selection of Scale-Invariant Parts for Object Class Recognition. ICCV (2003)
Google Scholar
Chen, H., Xu, Z., Liu, Z., Zhu, S.C.: Composite Templates for Cloth Modeling and Sketching. CVPR 1, 943–950 (2006)
Google Scholar
Porway, J., Yao, Z., Zhu, S.C.: Learning an and-or graph for modeling and recognizing object categories. In: CVPR 2007, NO. 1892 (submitted, 2007)
Google Scholar
Lin, L., Zhu, S.C., Wang, Y.: Layered Graph Match with Graph Editing. In: CVPR 2007, NO. 2755 (submitted, 2007)
Google Scholar
Fischler, M., Elschlager, R.: The representation and matching of pictorial structures. IEEE Transactions on Computers 22(1), 67–92 (1973)
Article Google Scholar
Weber, M., Welling, M., Perona, P.: Towards automatic discovery of object categories. CVPR (2000)
Google Scholar
Felzenszwalb, P., Hut tenlocher, D.: Pictorial Structures for Object Recognition. IJCV 61(1), 55–79 (2005)
Article Google Scholar
Viola, P., Jones, M.: Rapid Object Detection using a Boosted Cascade of Simple Features. CVPR (2001)
Google Scholar
Fergus, R., Perona, P., Zisserman, A.: Object class recognition by unsupervised scale- invariant learning. CVPR (2003)
Google Scholar
Zhu, S.C., Yuille, A.L.: Forms: A flexible object recognition and modeling system. IJCV 20(3), 187–212 (1996)
Article Google Scholar
Zhu, S.C., Mumford, D.: Quest for a Stochastic Grammar of Images, Foundations and Trends in Computer Graphics and Vision (to appear, 2007)
Google Scholar
Ullman, S., Sali, E., Vidal-Naquet, M.: A Fragment-Based Approach to Object Representation and Classification. In: Proc. 4th Intl. Workshop on Visual Form, Capri, Italy (2001)
Google Scholar
Nayar, S.K., Murase, H., Nene, S.A.: Parametric Appearance Representation. In: Nayar, S.K., Poggio, T. (eds.) Early Visual Learning (1996)
Google Scholar
Belongie, S., Malik, J., Puzicha, J.: Shape matching and object recognition using shape contexts. PAMI 24(4), 509–522 (2002)
Google Scholar
Ferrari, V., Tuytelaars, T., Van Gool, L.: Object Detection by Contour Segment Networks. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3954, Springer, Heidelberg (2006)
Google Scholar
Tu, Z.W.: Probabilistic Boosting Tree: Learning Discriminative Models for Classification, Recognition, and Clustering, ICCV (2005)
Google Scholar
Chi, Z., Geman, S.: Estimation of probabilistic context-free grammars, Computational Linguistics 24(2) (1998)
Google Scholar
Yao, Z., Yang, X., Zhu, S.C.: An Integrated Image Annotation Tool and Large Scale Ground Truth Database. In: CVPR 2007, NO. 1407 (submitted, 2007)
Google Scholar

Download references

Author information

Authors and Affiliations

IPRAI, Huazhong University of Science and Technology, Wuhan, 430074, P.R. China
Shaowu Peng & Nong Sang
School of Information Science and Technology, Beijing Institute of Technology, Beijing, 100081, P.R. China
Liang Lin
Lotus Hill Institute for Computer Vision and Information Science, Ezhou, 436000, P.R. China
Shaowu Peng, Liang Lin, Nong Sang & Song-Chun Zhu
Departments of Statistics, University of California, Los Angeles, Los Angeles, California, 90095, USA
Jake Porway & Song-Chun Zhu

Authors

Shaowu Peng
View author publications
You can also search for this author in PubMed Google Scholar
Liang Lin
View author publications
You can also search for this author in PubMed Google Scholar
Jake Porway
View author publications
You can also search for this author in PubMed Google Scholar
Nong Sang
View author publications
You can also search for this author in PubMed Google Scholar
Song-Chun Zhu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Alan L. Yuille Song-Chun Zhu Daniel Cremers Yongtian Wang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Peng, S., Lin, L., Porway, J., Sang, N., Zhu, SC. (2007). Object Category Recognition Using Generative Template Boosting . In: Yuille, A.L., Zhu, SC., Cremers, D., Wang, Y. (eds) Energy Minimization Methods in Computer Vision and Pattern Recognition. EMMCVPR 2007. Lecture Notes in Computer Science, vol 4679. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74198-5_16

Download citation

DOI: https://doi.org/10.1007/978-3-540-74198-5_16
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74195-4
Online ISBN: 978-3-540-74198-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics