Multimedia Tools and Applications

, Volume 77, Issue 20, pp 26219–26238 | Cite as

Vision-based entrance detection in outdoor scenes

  • Mehdi Talebi
  • Abbas VafaeiEmail author
  • Amirhassan Monadjemi


Doors are a significant object for the visually impaired and robots to enter and exit buildings. Although the accuracy of door detection is reported high in indoor scenes, it has become a difficult problem in outdoor scenes in computer vision. The reason may lie in the fact that such properties of a simple ordinary door such as handles, corners, and the gap between the door and the ground may not be visible due to the great variety of doors in outdoor environments. In this paper, we present a vision-based method for detecting building entrances in outdoor images. After extracting the lines and deleting the extra ones, regions between the vertical lines are specified and the features including height, width, location, color, texture and the number of lines inside the regions are obtained. Finally, some additional knowledge such as door existence at the bottom of the image, a reasonable height and width of a door, the difference between color and texture of the doors and those of the neighboring regions, and numerous lines on doors is used to decide on door detection. The method was tested on the eTRIMS dataset, door images from the ImageNet dataset, and our own dataset including doors of houses, apartments, and stores leading to acceptable results. The obtained results show that our approach outperforms comparable state-of-the-art approaches.


Entrance detection Lines extraction Color Texture Image processing 


  1. 1.
    Anguelov D, Koller D, Parker E, Thrun S (2004) Detecting and modeling doors with mobile robots. In: Proceedings of the IEEE International Conference on Robotics and Automation, New Orleans, pp. 3777–3784Google Scholar
  2. 2.
    Chen Z, Li Y, Birchfield ST (2011) Visual detection of lintel-occluded doors by integrating multiple cues using data-driven Markov chain Monte Carlo process. Robot Auton Syst 59(11):966–976Google Scholar
  3. 3.
    Cohen A, Schwing AG, Pollefeys M (2014) Efficient structured parsing of facades using dynamic programming. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), Columbus, pp. 3206–3213Google Scholar
  4. 4.
    Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), pp. 886–893Google Scholar
  5. 5.
    Dollar P, Appel R, Belongie S, Perona P (2014) Fast feature pyramids for object detection. IEEE Trans Pattern Anal Mach Intell (TPAMI) 36(8):1532–1545CrossRefGoogle Scholar
  6. 6.
    Garcia-Garcia A, Orts-Escolano S, Oprea SO, Villena-Martinez V, Garcia-Rodriguez J (2017) A review on deep learning techniques applied to semantic segmentation. arXiv 1704.06857Google Scholar
  7. 7.
    Girshick R (2015) Fast R-CNN. In: Proceedings of the IEEE International Conference on Computer Vision (ICCV), pp. 1440–1448Google Scholar
  8. 8.
    Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), pp. 580–587Google Scholar
  9. 9.
    Gould S, He X (2014) Scene understanding by labeling pixels. Commun ACM 57(11):68–77CrossRefGoogle Scholar
  10. 10.
    Grompone von Gioi R, Jakubowicz J, Morel JM, Randall G (2010) LSD: A fast line segment detector with a false detection control. IEEE Trans Pattern Anal Mach Intell (TPAMI) 32(4):722–732CrossRefGoogle Scholar
  11. 11.
    Grompone von Gioi R, Jakubowicz J, Morel JM, Randall G (2012) LSD: A line segment detector. Image Processing on Line 2:35–55CrossRefGoogle Scholar
  12. 12.
    He Z, You X, Yuan Y (2009) Texture image retrieval based on non-tensor product wavelet filter banks. Signal Process 89(8):1501–1510CrossRefGoogle Scholar
  13. 13.
    Hensler J, Blaich M, Bittel O (2009) Real-time door detection based on AdaBoost learning algorithm. In: Proceedings of the International Conference on Research and Education in Robotics, La Ferte-Bernard, pp. 61–73Google Scholar
  14. 14.
    Hoiem D, Efros AA, Hebert M (2007) Recovering surface layout from an image. Int J Comput Vis (IJCV) 75(1):151–172CrossRefGoogle Scholar
  15. 15.
    Jain AK, Farrokhnia F (1991) Unsupervised texture segmentation using Gabor filters. Pattern Recogn 24(12):1167–1186CrossRefGoogle Scholar
  16. 16.
    Kang SJ, Trinh HH, Kim DN, Jo KH (2010) Entrance detection of buildings using multiple cues. In: Proceedings of the International Conference on Intelligent Information and Database Systems, Hue, pp. 251–260Google Scholar
  17. 17.
    Korc F, Forstner W (2009) eTRIMS image database for interpreting images of man-made scenes. Technical Report, University of BonnGoogle Scholar
  18. 18.
    Leung T, Malik J (2001) Representing and recognizing the visual appearance of materials using three-dimensional textons. Int J Comput Vis (IJCV) 43(1):29–44CrossRefGoogle Scholar
  19. 19.
    Liu J, Korah T, Hedau V, Parameswaran V, Grzeszczuk R, Liu Y (2014) Entrance detection from street-view images. In: IEEE International Conference on Computer Vision and Pattern Recognition Workshop (CVPR), ColumbusGoogle Scholar
  20. 20.
    Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis (IJCV) 60(2):91–110CrossRefGoogle Scholar
  21. 21.
    Mathias M, Martinovic A, Van Gool L (2016) ATLAS: A three-layered approach to facade parsing. Int J Comput Vis (IJCV) 118(1):22–48MathSciNetCrossRefGoogle Scholar
  22. 22.
    Murillo AC, Kosecka J, Guerrero JJ, Sagues C (2008) Visual door detection integrating appearance and shape cues. Robot Auton Syst 56(6):512–521CrossRefGoogle Scholar
  23. 23.
    Ren S, He K, Girshick R, Sun J (2015) Faster R-CNN: Towards real-time object detection with region proposal networks. In: Proceedings of the International Conference on Neural Information Processing Systems (NIPS), pp. 91–99Google Scholar
  24. 24.
    Russakovsky O, Deng J, Su H, Krause J, Satheesh S, Ma S, Huang Z, Karpathy A, Khosla A, Bernstein M, Berg AC, Fei-Fei L (2015) ImageNet large scale visual recognition challenge. Int J Comput Vis (IJCV) 115(3):211–252MathSciNetCrossRefGoogle Scholar
  25. 25.
    Sekkal R, Pasteau F, Babel M, Brun B, Leplumey I (2013) Simple monocular door detection and tracking. In: Proceedings of the IEEE International Conference on Image Processing, Melbourne, pp. 3929–3933Google Scholar
  26. 26.
    Shuai B, Zuo Z, Wang G, Wang B (2016) Scene parsing with integration of parametric and non-parametric models. IEEE Trans Image Process 25(5):2379–2391MathSciNetCrossRefGoogle Scholar
  27. 27.
    Szeliski R (2011) Computer vision: Algorithms and applications. Springer, LondonCrossRefGoogle Scholar
  28. 28.
    Teboul O (2010) Ecole Centrale Paris Facades Database. Available:
  29. 29.
    Teboul O, Kokkinos I, Simon L, Koutsourakis P, Paragios N (2013) Parsing facades with shape grammars and reinforcement learning. IEEE Trans Pattern Anal Mach Intell (TPAMI) 35(7):1744–1756CrossRefGoogle Scholar
  30. 30.
    Tighe J, Niethammer M, Lazebnik S (2015) Scene parsing with object instance inference using regions and per-exemplar detectors. Int J Comput Vis (IJCV) 112(2):150–171MathSciNetCrossRefGoogle Scholar
  31. 31.
    Yang MY, Förstner W (2011) Regionwise classification of building facade images. In: Proceedings of the ISPRS Conference on Photogrammetric Image Analysis, Munich, pp. 209–220Google Scholar
  32. 32.
    Zhang D, You X, Wang P, Yanushkevich SN, Tang YY (2009) Facial biometrics using non-tensor product wavelet and 2D discriminant techniques. Int J Pattern Recognit Artif Intell 23(3):1–21CrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2018

Authors and Affiliations

  1. 1.Faculty of Computer EngineeringUniversity of IsfahanIsfahanIran

Personalised recommendations