Context-Based Object Recognition: Indoor Versus Outdoor Environments

Alameer, Ali; Degenaar, Patrick; Nazarpour, Kianoush

doi:10.1007/978-3-030-17798-0_38

Ali Alameer^16,17,
Patrick Degenaar^16,18 &
Kianoush Nazarpour^16,18

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 944))

Included in the following conference series:

Science and Information Conference

2286 Accesses
4 Citations

Abstract

Object recognition is a challenging problem in high-level vision. Models that perform well for the outdoor domain, perform poorly in the indoor domain and the reverse is also true. This is due to the dramatic discrepancies of the global properties of each environment, for instance, backgrounds and lighting conditions. Here, we show that inferring the environment before or during the recognition process can dramatically enhance the recognition performance. We used a combination of deep and shallow models for object and scene recognition, respectively. Also, we used three novel topologies that can provide a trade-off between classification accuracy and decision sensitivity. We achieved a classification accuracy of 97.91%, outperforming the performance of a single GoogLeNet by 13%. In another experiment, we achieved an accuracy of 95% to categorise indoor and outdoor scenes by inference.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Li, Z., Wang, Y., Yu, J., Guo, Y., Cao, W.: Deep learning based radiomics (DLR) and its usage in noninvasive IDH1 prediction for low grade glioma. Sci. Rep. 7(11), 5467 (2017)
Article Google Scholar
Hu, X., Zhang, J., Li, J., Zhang, B.: Sparsity-regularized hmax for visual recognition. PloS One 9(1), 215–243 (2014)
Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)
Google Scholar
Jia, Y., Shelhamer, E., Donahue, J., Karayev, S., Long, J., Girshick, R., Guadarrama, S., Darrell, T.: Caffe: convolutional architecture for fast feature embedding. In: Proceedings of the 22nd ACM International Conference on Multimedia, pp. 675–678 (2014)
Google Scholar
Ghazaei, G., Alameer, A., Degenaar, P., Morgan, G., Nazarpour, K.: Deep learning-based artificial vision for grasp classification in myoelectric hands. J. Neural Eng. 14(3), 036025 (2017)
Article Google Scholar
Abolghasemi, V., Chen, M., Alameer, A., Ferdowsi, S., Chambers, J., Nazarpour, K.: Incoherent dictionary pair learning: application to a novel open-source database of chinese numbers. IEEE Sig. Process. Lett. 25(4), 472–476 (2018)
Article Google Scholar
Ghazaei, G., Alameer, A., Degenaar, P., Morgan, G., Nazarpour, K.: An exploratory study on the use of convolutional neural networks for object grasp classification. In: Proceedings of the 2nd IET International Conference on Processing Intelligent Signal Processing (ISP), pp. 5–8 (2015)
Google Scholar
Riesenhuber, M., Poggio, T.: Hierarchical models of object recognition in cortex. Nat. Neurosci. 2(11), 1019–1025 (1999)
Article Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems 25, pp. 1097–1105 (2012)
Google Scholar
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556, vol. 9, no. 1 (2014)
Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., Rabinovich, A.: Going deeper with convolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–9 (2015)
Google Scholar
Quattoni, A., Torralba, A.: Recognizing indoor scenes. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 413–420 (2009)
Google Scholar
Alameer, A., Degenaar, P., Nazarpour, K.: Biologically-inspired object recognition system for recognizing natural scene categories. In: International Conference for Students on Applied Engineering (ICSAE), pp. 129–132. IEEE (2016)
Google Scholar
Hansen, L.K., Salamon, P.: Neural network ensembles. IEEE Trans. Pattern Anal. Mach. Intell. 12(10), 993–1001 (1990)
Article Google Scholar
Xu, L., Krzyzak, A., Suen, C.Y.: Methods of combining multiple classifiers and their applications to handwriting recognition. IEEE Trans. Syst. Man Cybern. 22(3), 418–435 (1992)
Article Google Scholar
Tumer, K., Ghosh, J.: Analysis of decision boundaries in linearly combined neural classifiers. Pattern Recogn. 29(2), 341–348 (1996)
Article Google Scholar
Ho, T.K., Hull, J.J., Srihari, S.N.: Decision combination in multiple classifier systems. IEEE Trans. Pattern Anal. Mach. Intell. 16(1), 66–75 (1994)
Article Google Scholar
Serre, T., Oliva, A., Poggio, T.: A feedforward architecture accounts for rapid categorization. Proc. Natl. Acad. Sci. 104(15), 6424–6429 (2007)
Article Google Scholar
Hubel, D.H., Wiesel, T.N.: Receptive fields, binocular interaction and functional architecture in the cat’s visual cortex. J. Physiol. 160(1), 106–154 (1962)
Article Google Scholar
Alameer, A., Ghazaei, G., Degenaar, P., Nazarpour, K.: An elastic net-regularized HMAX model of visual processing. In: Proceedings of the 2nd IET International Conference on Processing Intelligent Signal Processing (ISP), pp. 1–4 (2015)
Google Scholar
Alameer, A., Ghazaei, G., Degenaar, P., Chambers, J.A., Nazarpour, K.: Object recognition with an elastic net-regularized hierarchical MAX model of the visual cortex. IEEE Sig. Process. Lett. 23(8), 1062–1066 (2016)
Article Google Scholar
Alameer, A., Degenaar, P., Nazarpour, K.: Processing occlusions using elastic-net hierarchical max model of the visual cortex. In: IEEE International Conference on Innovations in Intelligent SysTems and Applications (INISTA), pp. 163–167. IEEE (2017)
Google Scholar
Shen, B., Liu, B.-D., Wang, Q.: Elastic net regularized dictionary learning for image classification. Multimedia Tools Appl. 75, 1–14 (2014)
Google Scholar
Hyvärinen, A., Gutmann, M., Hoyer, P.O.: Statistical model of natural stimuli predicts edge-like pooling of spatial frequency channels in V2. BMC Neurosci. 6(1), 12 (2005)
Article Google Scholar
Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., Fei-Fei, L.: ImageNet: a large-scale hierarchical image database. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2009, pp. 248–255 (2009)
Google Scholar
Zhou, B., Lapedriza, A., Xiao, J., Torralba, A., Oliva, A.: Learning deep features for scene recognition using places database. In: Advances in Neural Information Processing Systems, pp. 487–495 (2014)
Google Scholar
Lin, M., Chen, Q., Yan, S.: Network in network. arXiv preprint arXiv:1312.4400, vol. 6, no. 11, pp. 1019–1025 (2013)
Pan, S.J., Yang, Q.: A survey on transfer learning. IEEE Trans. Knowl. Data Eng. 22(10), 1345–1359 (2010)
Article Google Scholar
Alameer, A., Akkar, H.A.: ECG signal diagnoses using intelligent systems based on FPGA. Eng. Technol. J. 31(7), 1351–1364 (2013). Part (A) Engineering
Google Scholar
Ronquist, F., Huelsenbeck, J.P.: MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics 19(12), 1572–1574 (2003)
Article Google Scholar
Fei-Fei, L., Fergus, R., Perona, P.: Learning generative visual models from few training examples: an incremental bayesian approach tested on 101 object categories. Comput. Vis. Image Underst. 106(1), 59–70 (2007)
Article Google Scholar
Griffin, G., Holub, A., Perona, P.: Caltech-256 object category dataset (2007)
Google Scholar
Vapnik, V.: Support-vector networks. Mach. Learn. 20(3), 273–297 (1995)
MATH Google Scholar
Joubert, O.R., Rousselet, G.A., Fize, D., Fabre-Thorpe, M.: Processing scene context: fast categorization and object interference. Vis. Res. 47(26), 3286–3297 (2007)
Article Google Scholar

Download references

Acknowledgments

The work of A. Alameer was supported by the Higher Committee for Education Development, Iraq (HCED, D1201017). The work of K. Nazarpour was supported by the Engineering and Physical Sciences Research Council, U.K., grants EP/M025977/1 and EP/M025594/1.

Author information

Authors and Affiliations

School of Engineering, Newcastle University, Newcastle, NE1 7RU, UK
Ali Alameer, Patrick Degenaar & Kianoush Nazarpour
School of Natural and Environmental Sciences, Newcastle University, Newcastle Upon Tyne, NE1 7RU, UK
Ali Alameer
Institute of Neuroscience, Newcastle University, Newcastle, NE2 4HH, UK
Patrick Degenaar & Kianoush Nazarpour

Authors

Ali Alameer
View author publications
You can also search for this author in PubMed Google Scholar
Patrick Degenaar
View author publications
You can also search for this author in PubMed Google Scholar
Kianoush Nazarpour
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Ali Alameer or Kianoush Nazarpour .

Editor information

Editors and Affiliations

Saga University, Saga, Saga, Japan
Kohei Arai
The Science and Information (SAI) Organization, Bradford, West Yorkshire, UK
Supriya Kapoor

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Alameer, A., Degenaar, P., Nazarpour, K. (2020). Context-Based Object Recognition: Indoor Versus Outdoor Environments. In: Arai, K., Kapoor, S. (eds) Advances in Computer Vision. CVC 2019. Advances in Intelligent Systems and Computing, vol 944. Springer, Cham. https://doi.org/10.1007/978-3-030-17798-0_38

Download citation

DOI: https://doi.org/10.1007/978-3-030-17798-0_38
Published: 24 April 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-17797-3
Online ISBN: 978-3-030-17798-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics