Abstract
Automatic image scene detection is a crucial step for various tasks in computer vision. Current scene detection methods are often computationally expensive for use in real-time image classification. In this paper, a novel and efficient scene detection method based on local invariant features is presented. First, the SIFT feature detector and descriptor has been utilized to extract local image features since the SIFT descriptor has been proved to be an excellent local method that yields high quality features. However, the SIFT descriptor has been shown to produce high dimensional and redundant local features, which can create processing difficulty and computational burden in the successive classification stage. Therefore, two new feature selection strategies are proposed to reduce the number of SIFT keypoints and hence reduce the computational complexity. In both strategies, each image is represented by a single feature vector which assures the efficiency. Finally, a multi-classifier based on a support vector machine is applied to perform the scene detection task. Experimental results show that the proposed method can achieve accurate satisfactory classification results with significantly reduced computational complexity.
Similar content being viewed by others
References
Bosch A, Zisserman A, Munoz X (2006) Scene classification via pLSA. Computer Vision–ECCV 2006:517–530
Burt P, Adelson E (1983) The Laplacian pyramid as a compact image code. IEEE Trans Commun 31(4):532–540
Chen Y, Wang JZ (2004) Image categorization by learning and reasoning with regions. J Mach Learn Res 5:913–939
Fei-Fei L, Perona P (2005) A bayesian hierarchical model for learning natural scene categories. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2005. IEEE, pp 524–531
Hofmann T (2001) Unsupervised learning by probabilistic latent semantic analysis. Mach Learn 42(1):177–196
Horster E, Lienhart R (2007) Fusing local image descriptors for large-scale image retrieval. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR’07, 2007. IEEE, pp 1–8
Jin B, Hu W, Wang H (2012) Image classification based on pLSA fusing spatial relationships between Topics. Signal Process Lett, IEEE 19(3):151–154
Keller M, Bengio S (2004) Theme-topic mixture model for document representation. Learning Methods for Text Understanding and Mining
Lazebnik S, Schmid C, Ponce J (2006) Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2006. IEEE, pp 2169–2178
Li J, Wang JZ (2003) Automatic linguistic indexing of pictures by a statistical modeling approach. IEEE Trans Pattern Anal Mach Intell 25(9):1075–1088
Lowe DG (1999) Object recognition from local scale-invariant features. In: The Proceedings of the Seventh IEEE International Conference on Computer Vision, 1999. IEEE, pp 1150–1157
Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2):91–110
Murphy K, Torralba A, Freeman W (2003) Using the forest to see the trees: a graphical model relating features, objects and scenes. Adv Neural Inf Process Syst 16
Qin J, Yung NHC (2012) Feature fusion within local region using localized maximum-margin learning for scene categorization. Pattern Recognit 45(4):1671–1683. doi:10.1016/j.patcog.2011.09.027
Quelhas P, Monay F, Odobez JM, Gatica-Perez D, Tuytelaars T (2007) A thousand words in a scene. IEEE Trans Pattern Anal Mach Intell 29(9):1575–1589
Rasiwasia N, Vasconcelos N (2008) Scene classification with low-dimensional semantic spaces and weak supervision. In: IEEE Conference on Computer Vision and Pattern Recognition CVPR 2008, 2008. IEEE, pp 1–6
Szummer M, Picard RW (1998) Indoor-outdoor image classification. In: IEEE International Workshop on Content-Based Access of Image and Video Database, 1998. IEEE, pp 42–51
Vailaya A, Figueiredo MAT, Jain AK, Zhang HJ (2001) Image classification for content-based indexing. IEEE Trans Image Process 10(1):117–130
Varma M, Ray D (2007) Learning the discriminative power-invariance trade-off. In: IEEE 11th International Conference on Computer Vision, ICCV 2007, 2007. IEEE, pp 1–8
Wallraven C, Caputo B, Graf A (2003) Recognition with local features: the kernel recipe. In: Ninth IEEE International Conference on Computer Vision 2003. IEEE, pp 257–264
Wan HL, Chowdhury M (2003) Image semantic classification by using SVM. J Softw 14(11):1891–1899
Yang J, Jiang YG, Hauptmann AG, Ngo CW (2007) Evaluating bag-of-visual-words representations in scene classification. In: Proceedings of the International Workshop on Workshop on multimedia information retrieval, 2007. ACM, pp 197–206
Zeng P, Wen J, Wu L (2007) Scene classification using low-level feature and intermediate feature. In: Proceedings of SPIE Fifth International Symposium on Multispectral Image Processing and Pattern Recognition, MIPPR2007, 2007. pp 1–7
Acknowledgments
This work has been supported by the National Natural Science Foundation of China under contract No. 61201337 and by Changsha Municipal Science and Technology Project under contract No. K1205045-11. The authors are grateful to the anonymous reviewers for valuable comments.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Xie, Y., Zhang, XP., Luan, X. et al. A novel specific image scenes detection method. Multimed Tools Appl 74, 105–122 (2015). https://doi.org/10.1007/s11042-013-1496-7
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-013-1496-7