Abstract
Most of the numerous studies of scene categorization assume a fixed number of classes, and none categorize images with efficient class extendibility while preserving discriminative ability. This capability is crucial for an effective image categorization system. The proposed scene categorization method provides category-specific visual-word construction and image representation. The proposed method is effective for several reasons. First, since the visual-word construction and image representation are category-specific, image features related to the original classes need not be recreated when new classes are added, which minimizes reconstruction overhead. Second, since the visual-word construction and image representation are category-specific, the corresponding learning model for classification has substantial discriminating power. Experimental results confirm that the accuracy of the proposed method is superior to existing methods when using single-type and single-scale features.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Quelhas, P., Monay, F., Odobez, J.M., Gatica-Perez, D., Tuytelaars, T., Van Gool, L.: Modeling scenes with local descriptors and latent aspects. In: Proc. IEEE Int. Conf. Computer Vision, pp. 883–890 (2005)
Li, F.-F., Perona, P.: A Bayesian hierarchical model for learning natural scene categories. In: Proc. IEEE Int. Conf. Computer Vision and Pattern Recognition, pp. 524–531 (2005)
Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: Proc. IEEE Int. Conf. Computer Vision, pp. 2169–2178 (2006)
Zhou, X., Zhuang, X., Tang, H., Hasegawa-Johnson, M., Huang, T.S.: Novel Gaussianized vector representation for improved natural scene categorization. Pattern Recognition Letters 31(8), 702–708 (2010)
Bosch, A., Munoz, X., Marti, R.: Which is the best way to organize/classify images by content? Image Vision Computing 25(6), 778–791 (2007)
Galleguillos, C., Belongie, S.: Context-based object categorization: A critical survey. Computer Vision and Image Understanding 114(1), 712–722 (2010)
Hoang, N.V., Gouet-Brunet, V., Rukoz, M., Manouvrier, M.: Embedding spatial information into image content description for scene retrieval. Pattern Recognition 43(9), 3013–3024 (2008)
Lu, Z., Ip, H.H.S.: Combining context, consistency, and diversity cues for interactive image categorization. IEEE Trans. Multimedia 12(3), 194–203 (2010)
Qin, J., Yung, N.H.C.: Scene categorization via contextual visual words. Pattern Recognition 43(5), 1874–1888 (2010)
Van Gemert, J.C., Snoek, C.G.M., Veenman, C.J., Smeulders, A.W.M., Geusebroek, J.-M.: Comparing compact codebooks for visual categorization. Computer Vision and Image Understanding 114(4), 450–462 (2010)
Van Gemert, J.C., Veenman, C.J., Smeulders, A.W.M., Geusebroek, J.-M.: Visual word ambiguity. IEEE Trans. Pattern Recognition and Machine Intelligence 32(7), 1271–71283 (2010)
Wu, L., Hoi, S.C.H., Yu, N.: Semantics preserving bag-of-words models and applications. IEEE Trans. Image Processing 19(7), 1908–1920 (2010)
Sun, Z.-L., Rajan, D., Chia, L.-T.: Scene Classification using multiple features in a two-stage probabilistic classification framework. Neurocomputing 73(16-18), 2971–2979 (2010)
Abdullah, A., Veltkamp, R.C., Wiering, M.A.: Fixed partitioning salient points with MPEG-7 cluster correlograms for image categorization. Pattern Recognition 43(3), 650–662 (2010)
Bosch, A., Zisserman, A., Munoz, X.: Scene classification using a hybrid generative/discriminative approach. IEEE Trans. Pattern Recognition and Machine Intelligence 30(4), 712–727 (2008)
Cheng, H., Wang, R.: Semantic modeling of natural scenes based on contextual Bayesian networks. Pattern Recognition 43(12), 4042–4054 (2010)
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Computer Vision 60(2), 91–110 (2004)
Alpaydin, E.: Introduction to Machine Learning. MIT Press, Cambridge (2010)
Chang, C.-C., Lin, C.-J.: LIBSVM: A binary for support vector machine. Software available at, http://www.csie.ntu.edu.tw/cjlin/libsvm
Dalal, N., Triggs, B.: Histogram of oriented gradient for human detection. In: Proc. Computer Vision and Pattern Recognition, pp. 886–893 (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lan, Z., Su, S., Chen, SY., Li, S. (2011). Scene Categorization with Class Extendibility and Effective Discriminative Ability. In: Tsihrintzis, G.A., Virvou, M., Jain, L.C., Howlett, R.J. (eds) Intelligent Interactive Multimedia Systems and Services. Smart Innovation, Systems and Technologies, vol 11. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22158-3_8
Download citation
DOI: https://doi.org/10.1007/978-3-642-22158-3_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-22157-6
Online ISBN: 978-3-642-22158-3
eBook Packages: EngineeringEngineering (R0)