Multimedia Tools and Applications

, Volume 77, Issue 3, pp 3339–3351 | Cite as

Building detection from orthophotos using binary feature classification

  • Yan Hu
  • Xiangyun Hu
  • Penglong Li
  • Yi Ding


Building detection in orthophotos is crucial for various applications, such as urban planning and real-estate management. In order to realize accurate and fast building detection, a non-interactive approach based on binary feature classification is brought forward in this paper. The proposed approach includes two major stages, i.e., building area detection and building contours extraction. In the first stage, a sequence of intersections is obtained by superpixel segmentation in the subsampled orthophoto, and then building area is reserved roughly according to the classification of intersections. In the second stage, the sequence of intersections is updated by superpixel segmentation in the building area from original orthophoto, and then building contours is extracted in accordance with the classification of intersections likewise. The local feature of the intersections is descripted employing our extremely compact binary descriptor, and is classified using binary bag-of-features. Experiments show that benefiting from binary description and making full use of texture details and color channels, the proposed descriptor is not only computationally frugal, but also accurate. Experiments are also conducted on orthophotos with different roof colors, textures, shapes, sizes and orientations, and demonstrate that the proposed approach are capable of achieving desirable results.


Building detection Machine learning Local feature Descriptor Classifier 



This research was supported by the Key Laboratory for Earth Observation, National Administration of Surveying, Mapping and Geoinformation of China (K2015009).


  1. 1.
    Achanta R, Shaji A, Smith K, Lucchi A, Fua P, Susstrunk S (2012) SLIC superpixels compared to state-of-the-art superpixel methods. IEEE Trans Pattern Anal Mach Intell 34(11):2274–2282CrossRefGoogle Scholar
  2. 2.
    Alahi A, Ortiz R, Vandergheynst P (2012) FREAK: fast retina keypoint. In: IEEE Conference on computer vision and pattern recognition, Providence, USA, 16-21 Jun, pp 510–517Google Scholar
  3. 3.
    Bay H, Ess A, Tuytelaars T, Van Gool L (2008) Speeded-up robust features (SURF). Comput Vis Image Underst 110(3):346–359CrossRefGoogle Scholar
  4. 4.
    Bruzzone L, Demir B (2014) A review of modern approaches to classification of remote sensing data. In: Manakos I, Braun M (eds) Land use and land cover mapping in Europe: practices and trends. Springer, Dordrecht, pp 127–143CrossRefGoogle Scholar
  5. 5.
    Calonder M, Lepetit V, Ozuysal M, Trzcinski T, Strecha C, Fua P (2012) BRIEF: computing a local binary descriptor very fast. IEEE Trans Pattern Anal Mach Intell 34(7):1281–1298CrossRefGoogle Scholar
  6. 6.
    Chang X, Nie F, Yang Y, Huang H (2014) A convex formulation for semi-supervised multi-label feature selection. In: AAAI Conference on artificial intelligence, Quebec City, Canada, 27-31 Jul, pp 1171–1177Google Scholar
  7. 7.
    Chang X, Nie F, Wang S, Yang Y, Zhou X, Zhang C (2016) Compound rank-k projections for bilinear analysis. IEEE Trans Neural Netw Learn Syst 27(7):1502–1513MathSciNetCrossRefGoogle Scholar
  8. 8.
    Chang X, Ma Z, Yang Y, Zeng Z, Hauptmann AG (2017) Bi-level semantic representation analysis for multimedia event detection. IEEE Trans Cybern 47(5):1180–1197CrossRefGoogle Scholar
  9. 9.
    Chang X, Yu YL, Yang Y, Xing EP (2017) Semantic pooling for complex event analysis in untrimmed videos. IEEE Trans Pattern Anal Mach Intell 39(8):1617–1632CrossRefGoogle Scholar
  10. 10.
    Chang X, Ma Z, Lin M, Yang Y, Hauptmann AG (2017) Feature interaction augmented sparse learning for fast Kinect motion detection. IEEE Trans Image Process 26(8):3911–3920MathSciNetCrossRefGoogle Scholar
  11. 11.
    Dornaika F, Moujahid A, Merabet M, Ruichek Y (2016) Building detection from orthophotos using a machine learning approach: an empirical study on image segmentation and descriptors. Expert Syst Appl 58:130–142CrossRefGoogle Scholar
  12. 12.
    Ghamisi P, Dalla M, Benediktsson JA (2015) A survey on spectral spatial classification techniques based on attribute proles. IEEE Trans Geosci Remote Sens 53(5):2335–2353CrossRefGoogle Scholar
  13. 13.
    Hu MK (1962) Visual pattern recognition by moment invariants. IRE Trans Inf Theory 8(2):179–187CrossRefMATHGoogle Scholar
  14. 14.
    Khurana M, Wadhwa V (2015) Automatic building detection using modified grab cut algorithm from high resolution satellite image. Int J Adv Res Comput Commun Eng 4(8):158–164Google Scholar
  15. 15.
    Lecun Y, Bengio Y, Hinton GE (2015) Deep learning. Nature 521:436–444CrossRefGoogle Scholar
  16. 16.
    Leutenegger S, Chli M, Siegwart RY (2011) BRISK: binary robust invariant scalable keypoints. In: IEEE International conference on computer vision, Barcelona, Spain, 6-13 Nov, pp 2548–2555Google Scholar
  17. 17.
    Li Z, Gong W, Nee AYC, Ong SK (2009) Region-restricted rapid keypoint registration. Opt Express 17(24):22096–22101CrossRefGoogle Scholar
  18. 18.
    Li Z, Gong W, Nee AYC, Ong SK (2009) The effectiveness of detector combinations. Opt Express 17(9):7407–7418CrossRefGoogle Scholar
  19. 19.
    Liu L, Wiliem A, Chen S, Lovell BC (2014) Automatic image attribute selection for zero-shot learning of object categories. In: International conference on pattern recognition, Stockholm, Sweden, 24-28 Aug, pp 2619–2624Google Scholar
  20. 20.
    Liu L, Nie F, Zhang T, Wiliem A, Lovell BC (2016) Unsupervised automatic attribute discovery method via multi-graph clustering. In: International conference on pattern recognition, Cancun, Mexico, 4-8 Dec, pp 1713–1718Google Scholar
  21. 21.
    Liu L, Wiliem A, Chen S, Lovell BC (2017) What is the best way for extracting meaningful attributes from pictures? Pattern Recogn 64:314–326CrossRefGoogle Scholar
  22. 22.
    Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2):91–110CrossRefGoogle Scholar
  23. 23.
    Ojala T, Pietikainen M, Harwood D (1996) Comparative study of texture measures with classification based on feature distributions. Pattern Recogn 29(1):51–59CrossRefGoogle Scholar
  24. 24.
    Ojala T, Pietikainen M, Maenpaa T (2002) Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans Pattern Anal Mach Intell 24(7):971–987CrossRefMATHGoogle Scholar
  25. 25.
    Passino KM (2002) Biomimicry of bacterial foraging for distributed optimization and control. IEEE Control Syst Mag 22(3):52–67CrossRefGoogle Scholar
  26. 26.
    Rosin PL (1999) Measuring corner properties. Comput Vis Image Underst 73(2):291–307CrossRefGoogle Scholar
  27. 27.
    Rother C, Kolmogorov V, Blake A (2004) Grabcut: interactive foreground extraction using iterated graph cuts. ACM Trans Graph 23(3):309–314CrossRefGoogle Scholar
  28. 28.
    Rublee E, Rabaud V, Konolige K, Bradski G (2011) ORB: an efficient alternative to SIFT or SURF. In: IEEE International conference on computer vision, Barcelona, Spain, 6-13 Nov, pp 2564–2571Google Scholar
  29. 29.
    Sivic J, Zisserman A (2009) Efficient visual search of videos cast as text retrieval. IEEE Trans Pattern Anal Mach Intell 31(4):591–606CrossRefGoogle Scholar
  30. 30.
    Vakalopoulou M, Karantzalos K, Komodakis N, Paragios N (2015) Building detection in very high resolution multispectral data with deep learning features. In: IEEE International conference on geoscience and remote sensing symposium, Milan, Italy, 26-31 Jul, pp 1873–1876Google Scholar
  31. 31.
    Volpi M, Tuia D (2017) Dense semantic labeling of subdecimeter resolution images with convolutional neural networks. IEEE Trans Geosci Remote Sens 55(2):881–893CrossRefGoogle Scholar
  32. 32.
    Yang F, Lu H, Zhang W, Yang G (2012) Visual tracking via bag of features. IET Image Process 6(2):115–128MathSciNetCrossRefGoogle Scholar
  33. 33.
    Zhang T, Liu L, Wiliem A, Lovell B (2016) Is Alice chasing or being chased? : determining subject and object of activities in videos. In: IEEE Winter conference on applications of computer vision, Lake Placid, USA, 7-9 Mar, pp 1–7Google Scholar

Copyright information

© Springer Science+Business Media, LLC 2017

Authors and Affiliations

  1. 1.School of Remote Sensing and Information EngineeringWuhan UniversityWuhanChina
  2. 2.Chongqing Geomatics CenterChongqingChina

Personalised recommendations