Scene Categorization with Class Extendibility and Effective Discriminative Ability

Lan, Zongyu; Su, Songzhi; Chen, Shu-Yuan; Li, Shaozi

doi:10.1007/978-3-642-22158-3_8

Zongyu Lan^6,7,
Songzhi Su^6,7,
Shu-Yuan Chen⁸ &
…
Shaozi Li^6,7

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 11))

748 Accesses
2 Citations

Abstract

Most of the numerous studies of scene categorization assume a fixed number of classes, and none categorize images with efficient class extendibility while preserving discriminative ability. This capability is crucial for an effective image categorization system. The proposed scene categorization method provides category-specific visual-word construction and image representation. The proposed method is effective for several reasons. First, since the visual-word construction and image representation are category-specific, image features related to the original classes need not be recreated when new classes are added, which minimizes reconstruction overhead. Second, since the visual-word construction and image representation are category-specific, the corresponding learning model for classification has substantial discriminating power. Experimental results confirm that the accuracy of the proposed method is superior to existing methods when using single-type and single-scale features.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Quelhas, P., Monay, F., Odobez, J.M., Gatica-Perez, D., Tuytelaars, T., Van Gool, L.: Modeling scenes with local descriptors and latent aspects. In: Proc. IEEE Int. Conf. Computer Vision, pp. 883–890 (2005)
Google Scholar
Li, F.-F., Perona, P.: A Bayesian hierarchical model for learning natural scene categories. In: Proc. IEEE Int. Conf. Computer Vision and Pattern Recognition, pp. 524–531 (2005)
Google Scholar
Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: Proc. IEEE Int. Conf. Computer Vision, pp. 2169–2178 (2006)
Google Scholar
Zhou, X., Zhuang, X., Tang, H., Hasegawa-Johnson, M., Huang, T.S.: Novel Gaussianized vector representation for improved natural scene categorization. Pattern Recognition Letters 31(8), 702–708 (2010)
Article Google Scholar
Bosch, A., Munoz, X., Marti, R.: Which is the best way to organize/classify images by content? Image Vision Computing 25(6), 778–791 (2007)
Article Google Scholar
Galleguillos, C., Belongie, S.: Context-based object categorization: A critical survey. Computer Vision and Image Understanding 114(1), 712–722 (2010)
Article Google Scholar
Hoang, N.V., Gouet-Brunet, V., Rukoz, M., Manouvrier, M.: Embedding spatial information into image content description for scene retrieval. Pattern Recognition 43(9), 3013–3024 (2008)
Article Google Scholar
Lu, Z., Ip, H.H.S.: Combining context, consistency, and diversity cues for interactive image categorization. IEEE Trans. Multimedia 12(3), 194–203 (2010)
Article Google Scholar
Qin, J., Yung, N.H.C.: Scene categorization via contextual visual words. Pattern Recognition 43(5), 1874–1888 (2010)
Article MATH Google Scholar
Van Gemert, J.C., Snoek, C.G.M., Veenman, C.J., Smeulders, A.W.M., Geusebroek, J.-M.: Comparing compact codebooks for visual categorization. Computer Vision and Image Understanding 114(4), 450–462 (2010)
Article Google Scholar
Van Gemert, J.C., Veenman, C.J., Smeulders, A.W.M., Geusebroek, J.-M.: Visual word ambiguity. IEEE Trans. Pattern Recognition and Machine Intelligence 32(7), 1271–71283 (2010)
Article Google Scholar
Wu, L., Hoi, S.C.H., Yu, N.: Semantics preserving bag-of-words models and applications. IEEE Trans. Image Processing 19(7), 1908–1920 (2010)
Article Google Scholar
Sun, Z.-L., Rajan, D., Chia, L.-T.: Scene Classification using multiple features in a two-stage probabilistic classification framework. Neurocomputing 73(16-18), 2971–2979 (2010)
Article Google Scholar
Abdullah, A., Veltkamp, R.C., Wiering, M.A.: Fixed partitioning salient points with MPEG-7 cluster correlograms for image categorization. Pattern Recognition 43(3), 650–662 (2010)
Article MATH Google Scholar
Bosch, A., Zisserman, A., Munoz, X.: Scene classification using a hybrid generative/discriminative approach. IEEE Trans. Pattern Recognition and Machine Intelligence 30(4), 712–727 (2008)
Article Google Scholar
Cheng, H., Wang, R.: Semantic modeling of natural scenes based on contextual Bayesian networks. Pattern Recognition 43(12), 4042–4054 (2010)
Article MATH Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Computer Vision 60(2), 91–110 (2004)
Article Google Scholar
Alpaydin, E.: Introduction to Machine Learning. MIT Press, Cambridge (2010)
MATH Google Scholar
Chang, C.-C., Lin, C.-J.: LIBSVM: A binary for support vector machine. Software available at, http://www.csie.ntu.edu.tw/cjlin/libsvm
Dalal, N., Triggs, B.: Histogram of oriented gradient for human detection. In: Proc. Computer Vision and Pattern Recognition, pp. 886–893 (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

Cognitive Science Department, Xiamen University, Xiamen, China
Zongyu Lan, Songzhi Su & Shaozi Li
Fujian Key Laboratory of the Brain-like Intelligent Systems, Xiamen University, China
Zongyu Lan, Songzhi Su & Shaozi Li
Department of Computer Science and Engineering, Yuan Ze University, Taiwan
Shu-Yuan Chen

Authors

Zongyu Lan
View author publications
You can also search for this author in PubMed Google Scholar
Songzhi Su
View author publications
You can also search for this author in PubMed Google Scholar
Shu-Yuan Chen
View author publications
You can also search for this author in PubMed Google Scholar
Shaozi Li
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Informatics, University of Piraeus, 80 Karaoli & Dimitriou St., 18534, Piraeus, Greece
George A. Tsihrintzis & Maria Virvou &
School of Electrical and Information Engineering, University of South Australia, Mawson Lakes Campus, SA 5095, Adelaide, South Australia, Australia
Lakhmi C. Jain
KES International, 2115, BN43 9AF, Shoreham-by-sea, West Sussex, United Kingdom
Robert J. Howlett

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lan, Z., Su, S., Chen, SY., Li, S. (2011). Scene Categorization with Class Extendibility and Effective Discriminative Ability. In: Tsihrintzis, G.A., Virvou, M., Jain, L.C., Howlett, R.J. (eds) Intelligent Interactive Multimedia Systems and Services. Smart Innovation, Systems and Technologies, vol 11. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22158-3_8

Download citation

DOI: https://doi.org/10.1007/978-3-642-22158-3_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-22157-6
Online ISBN: 978-3-642-22158-3
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics