Abstract
A novel bionic, parallel, and hierarchical scene classification framework is presented in this paper. Moreover, we build the model based on the perception as defined by the human visual system. At first, we use an image pyramid to present both the global scene and local patches containing specific objects. Second, we build our own codebooks, which satisfy both long stare and short saccade similar to humans. Next, we train the visual words by generative and discriminative methods, respectively, which could obtain the initial scene categories based on the potential semantics using the bag-of-words model. Then, we use a neural network to simulate a human decision process. This leads to the final scene category. Experiments show that the parallel, hierarchical image representation, and classification model obtain superior results with respect to accuracy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Lazebnik S, Schmid C, Ponce J (2006) Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: IEEE conference on computer vision and pattern recognition, pp 1–8
Malisiewicz T, Efros AA (2008) Recognition by association via learning per-exemplar distances. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1–8
Fei–Fei L, Perona P (2005) A Bayesian hierarchical model for learning natural scene categories, In: IEEE conference on computer vision and pattern recognition, pp 524–531
Talter BW (2009) Current understanding of eye guidance. Vis Cognition 17:777–789
Talter BW, Vincent BT (2008) Systematic tendencies in scene viewing. J Eye Mov Res 2:1–18
Talter BW, Vincent BT (2009) The prominence of behavioural biases in eye guidance. Vis Cognition 17:1029–1054
Grauman K, Darrell T (2005) Pyramid match kernels: discriminative classification with sets of image features. In: Proceedings of the international conference on computer vision, pp 1–8
Lowe DG (1999) Object recognition from local scale-invariant features. In: International conference on computer vision, Corfu, Greece, pp 1150–1157
Hofmann T (1998) Probabilistic latent semantic indexing. In: Proceedings of the SIGIR conference research and development in information retrieval, pp 1–13
Torralba A (2003) Contextual priming for object detection. Int J Comput Vision 53(2):169–191
Acknowledgments
This work was financially supported by the Chinese People’s Public Security University Natural Science Foundation (2011LG08).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Feng, W., Zhou, X. (2014). Research on HVS-Inspired, Parallel, and Hierarchical Scene Classification Framework. In: Sun, F., Hu, D., Liu, H. (eds) Foundations and Practical Applications of Cognitive Systems and Information Processing. Advances in Intelligent Systems and Computing, vol 215. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37835-5_21
Download citation
DOI: https://doi.org/10.1007/978-3-642-37835-5_21
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-37834-8
Online ISBN: 978-3-642-37835-5
eBook Packages: EngineeringEngineering (R0)