Advertisement

RNN-LSTM Based Indoor Scene Classification with HoG Features

  • Ambica VermaEmail author
  • Shilpa Sharma
  • Priyanka Gupta
Conference paper
Part of the Communications in Computer and Information Science book series (CCIS, volume 955)

Abstract

The machine learning and artificial intelligence models have evolved with unenvisaged swiftness over the past decade. The machine learning and artificial intelligence models work on the basis of the mathematical models, which are capable of processing the financial, image, video, audio, audio-visual and several other forms of data. In this paper, the work has been carried upon the bulk image data representing the variety of indoor scenes. The goal is to detect the type of indoor scene, which can be utilized by variety of artificial intelligent application for various purposes. The indoor scene recognition has been performed over the computer vision & pattern recognition (CVPR), 2009 dataset, which is consisted of 15620 images. The deep learning mechanism known as recurrent neural network (RNN) has been incorporated for the classification of the indoor scene data over the histogram of oriented gradient (HoG) based features. Specifically, the forget gates based recurrent mode called Long short-term memory (LSTM) has been incorporated for the classification of the indoor scene data. The performance of the proposed model has been analyzed over the CVPR09 dataset using 50 and 100 randomly drawn test cases. The proposed model has been found 92% accurate in comparison with SVM (88%), KNN (80%) and Naïve Bayes (86%). The F1-measure based performance assessment also proves the robustness of LSTM based model with 96% accuracy over 92% of Naïve Bayes, 93% of SVM and 88% of KNN.

Keywords

Deep learning Recurrent neural network Long-short term memory LSTM Indoor scene recognition CVPR’09 

References

  1. 1.
    Espinace, P., Kollar, T., Roy, N., Soto, A.: Indoor scene recognition by a mobile robot through adaptive object detection. Robot. Auton.Syst. 61(9), 932–947 (2013)CrossRefGoogle Scholar
  2. 2.
    Giannoulis, D., Stowell, D., Benetos, E., Rossignol, M., Lagrange, M., Plumbley, M.D.: A database and challenge for acoustic scene classification and event detection. In: 2013 Proceedings of the 21st European Signal Processing Conference (EUSIPCO), pp. 1–5. IEEE (2013)Google Scholar
  3. 3.
    Antanas, L., Hoffmann, M.E., Frasconi, P., Tuytelaars, T., De Raedt, L.: A relational kernel-based approach to scene classification. In: 2013 IEEE Workshop on Applications of Computer Vision (WACV), pp. 133–139. IEEE (2013)Google Scholar
  4. 4.
    Gupta, S., Arbelaez, P., Malik, J.: Perceptual organization and recognition of indoor scenes from RGB-D images. In: 2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 564–571. IEEE (2013)Google Scholar
  5. 5.
    Juneja, M., Andrea, V., Jawahar, C.V., Zisserman, A.: Blocks that shout: distinctive parts for scene classification. In: 2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 923–930. IEEE (2013)Google Scholar
  6. 6.
    Monadjemi, A., Thomas, B.T., Mirmehdi, M.: Experiments on high resolution images towards outdoor scene classification. Technical report, University of Bristol, Department of Computer Science (2002)Google Scholar
  7. 7.
    Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification. John Wiley and Sons, Hoboken (1999)zbMATHGoogle Scholar
  8. 8.
    Fitzpatrick, P.: Indoor/outdoor scene classification project. Pattern Recognition and AnalysisGoogle Scholar
  9. 9.
    Quattoni, A., Torralba, A.: Recognizing indoor scenes. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition CVPR 2009, pp. 413–420. IEEE (2009)Google Scholar
  10. 10.
    Li, L.-J., Su, H., Lim, Y., Fei-Fei, L.: Objects as attributes for scene classification. In: Kutulakos, K.N. (ed.) ECCV 2010. LNCS, vol. 6553, pp. 57–69. Springer, Heidelberg (2012).  https://doi.org/10.1007/978-3-642-35749-7_5CrossRefGoogle Scholar
  11. 11.
    Antanas, L., Hoffmann, M., Frasconi, P., Tuytelaars, T., De Raedt, L.: A relational kernel-based approach to scene classification. In: 2013 IEEE Workshop on Applications of Computer Vision (WACV), pp. 133–139. IEEE (2013)Google Scholar
  12. 12.
    Mesnil, G., Rifai, S., Bordes, A., Glorot, X., Bengio, Y., Vincent, P.: Unsupervised and transfer learning under uncertainty-from object detections to scene categorization. In: ICPRAM, pp. 345–354 (2013)Google Scholar
  13. 13.
    Zhang, L., Zhen, X., Shao, L.: Learning object-to-class kernels for scene classification. IEEE Trans. Image Process. 23(8), 3241–3253 (2014)MathSciNetCrossRefGoogle Scholar
  14. 14.
    Li, L.-J., Su, H., Fei-Fei, L., Xing, X.P.: Object bank: a high-level image representation for scene classification and semantic feature sparsification. In: Advances in Neural Information Processing Systems, pp. 1378–1386 (2010)Google Scholar
  15. 15.
    Alberti, M., Folkesson, J., Jensfelt, P.: Relational approaches for joint object classification and scene similarity measurement in indoor environments. In: AAAI 2014 Spring Symposia: Qualitative Representations for Robots (2014)Google Scholar
  16. 16.
    Russakovsky, O., Lin, Y., Yu, K., Fei-Fei, L.: Object-centric spatial pooling for image classification. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012. LNCS, pp. 1–15. Springer, Heidelberg (2012).  https://doi.org/10.1007/978-3-642-33709-3_1CrossRefGoogle Scholar
  17. 17.
    Espinace, P., Kollar, T., Roy, N., Soto, A.: Indoor scene recognition by a mobile robot through adaptive object detection. Robot. Auton. Syst. 61(9), 932–947 (2013)CrossRefGoogle Scholar
  18. 18.
    Fredembach, C., Schroder, M., Susstrunk, S.: Eigenregions for image classification. IEEE Trans. Pattern Anal. Mach. Intell. 26(12), 1645–1649 (2004)CrossRefGoogle Scholar
  19. 19.
    Mojsilovic, A., Gomes, J., Rogowitz, B.E.: ISEE: perceptual features for image library navigation. In: SPIE Human Vision and Electronic Imaging (2002)Google Scholar
  20. 20.
    Kollar, T., Roy, N.: Utilizing object–object and object–scene context when planning to find things. In: International Conference on Robotics and Automation (2009)Google Scholar
  21. 21.
    Dollar, P., Tu, Z., Perona, P., Belongie, S.: Integral channel features. In: British Machine Vision Conference (2009)Google Scholar
  22. 22.
    Vasudevan, S., Siegwart, R.: Bayesian space conceptualization and place classification for semantic maps in mobile robotics. Robot. Auton. Syst. 56, 522–537 (2008)CrossRefGoogle Scholar
  23. 23.
    Espinace, P., Kollar, T., Soto, A., Roy, N.: Indoor scene recognition through object detection. In: IEEE International Conference on Robotics and Automation (2010)Google Scholar

Copyright information

© Springer Nature Singapore Pte Ltd. 2019

Authors and Affiliations

  1. 1.School of Computer Science and EngineeringLovely Professional UniversityPhagwara, JalandharIndia

Personalised recommendations