Sparsity-regularized feature selection for multi-class remote sensing image classification

  • Tao Chen
  • Ye Zhao
  • Yanrong GuoEmail author
Multi-Source Data Understanding (MSDU)


Remote sensing image classification plays an important role in a wide range of applications and has caused widely concerns. During the last few years, great efforts have been made to develop a number of scene classification methods for remote sensing images. However, the existing remote sensing image classification methods do not perform satisfactorily in dealing with multi-class classification problems and rely heavily on the quality of data sets. These disadvantages seriously restrict the application of remote sensing image, including industrial research, analysis and calculation of land use and land coverage. To this end, this paper proposes a remote sensing image classification algorithm based on the sparse regularized feature learning method. Specifically, after constructing bag of features by using speeded up robust features extraction algorithm, direct sparsity optimization-based feature selection method is applied for selecting discriminative features, which is used for constructing support vector machine classifier model. The proposed algorithm has been evaluated and compared with other advanced feature selection methods on four public remote sensing image data sets. The experimental results demonstrate the effectiveness of our proposed image classification algorithm, which has been successfully applied to remote sensing image classification tasks.


Feature extraction Bag of features Sparsity-regularized feature selection Remote sensing image classification 



The research was supported by the National Key R&D Program of China under Grant No. 2017YFC0820604, Anhui Provincial Natural Science Foundation under Grant No. 1808085QF188, and National Nature Science Foundation of China under grant Nos. 61702156, 61502138, 61772171 and 61876056.

Compliance with ethical standards

Conflict of interest

The authors declare that they have no conflict of interest.


  1. 1.
    Castelluccio M et al (2015) Land use classification in remote sensing images by convolutional neural networks. arXiv:1508.00092
  2. 2.
    Yang J et al (2007) Evaluating bag-of-visual-words representations in scene classification. In: Proceedings of the international workshop on multimedia information retrieval. ACM, 2007Google Scholar
  3. 3.
    Tuia D et al (2009) Active learning methods for remote sensing image classification. IEEE Trans Geosci Remote Sens 47(7):2218–2232CrossRefGoogle Scholar
  4. 4.
    Zhang L et al (2012) On combining multiple features for hyperspectral remote sensing image classification. IEEE Trans Geosci Remote Sens 50(3):879–893CrossRefGoogle Scholar
  5. 5.
    Zhu X, Li X, Zhang S (2016) Block-row sparse multiview multilabel learning for image classification. IEEE Trans Cybern 46(2):450–461CrossRefGoogle Scholar
  6. 6.
    Zhu X et al (2017) Robust joint graph sparse coding for unsupervised spectral feature selection. IEEE Trans Neural Netw Learn Syst 28(6):1263–1275MathSciNetCrossRefGoogle Scholar
  7. 7.
    Nowak E, Jurie F, Triggs B (2006) Sampling strategies for bag-of-features image classification. In: European conference on computer vision. Springer, Berlin, Heidelberg, 2006Google Scholar
  8. 8.
    Peng H, Fan Y (2017) A general framework for sparsity regularized feature selection via iteratively reweighted least square minimization. In: AAAI 2017, pp 2471–2477Google Scholar
  9. 9.
    Wang J et al (2013) Bag-of-words representation for biomedical time series classification. Biomed Signal Process Control 8(6):634–644CrossRefGoogle Scholar
  10. 10.
    Wallach HM (2006) Topic modeling: beyond bag-of-words. In: Proceedings of the 23rd international conference on machine learning. ACM, 2006Google Scholar
  11. 11.
    Peng H, Fan Y (2016) Direct sparsity optimization based feature selection for multi-class classification.  In: IJCAI 2016, pp 1918–1924Google Scholar
  12. 12.
    Bay H, Tuytelaars T, Van Gool L (2006) Surf: speeded up robust features. In: European conference on computer vision. Springer, Berlin, HeidelbergGoogle Scholar
  13. 13.
    Bay H et al (2008) Speeded-up robust features (SURF). Comput Vis Image Underst 110(3):346–359CrossRefGoogle Scholar
  14. 14.
    Jégou H, Douze M, Schmid C (2010) Improving bag-of-features for large scale image search. Int J Comput Vis 87(3):316–336CrossRefGoogle Scholar
  15. 15.
    Lebanon G, Mao Y, Dillon J (2007) The locally weighted bag of words framework for document representation. J Mach Learn Res 8(Oct):2405–2441MathSciNetzbMATHGoogle Scholar
  16. 16.
    Blei DM, Ng AY, Jordan MI (2003) Latent dirichlet allocation. J Mach Learn Res 3(Jan):993–1022zbMATHGoogle Scholar
  17. 17.
    Fei-Fei L, Perona P (2005) A bayesian hierarchical model for learning natural scene categories. In: IEEE computer society conference on computer vision and pattern recognition, 2005. CVPR 2005. IEEE, vol 2Google Scholar
  18. 18.
    Niebles JC, Wang H, Fei-Fei L (2008) Unsupervised learning of human action categories using spatial-temporal words. Int J Comput Vis 79(3):299–318CrossRefGoogle Scholar
  19. 19.
    Anthimopoulos M et al (2014) A food recognition system for diabetic patients based on an optimized bag-of-features model. IEEE J Biomed Health Inform 18(4):1261–1271CrossRefGoogle Scholar
  20. 20.
    Zhang H et al (2013) Attribute-augmented semantic hierarchy: towards bridging semantic gap and intention gap in image retrieval. In: Proceedings of the 21st ACM international conference on multimedia. ACM, 2013Google Scholar
  21. 21.
    Li J et al (2017) Feature selection: a data perspective. ACM Comput Surv (CSUR) 50(6):94CrossRefGoogle Scholar
  22. 22.
    Zhang H et al (2014) Robust (semi) nonnegative graph embedding. IEEE Trans Image Process 23(7):2996–3012MathSciNetCrossRefzbMATHGoogle Scholar
  23. 23.
    Zhang H et al (2017) Visual translation embedding network for visual relation detection. In: CVPR 2017, pp 3107–3115Google Scholar
  24. 24.
    Nie F et al (2010) Efficient and robust feature selection via joint ℓ2, 1-norms minimization. In: NIPS 2010, pp 1813–1821Google Scholar
  25. 25.
    Chartrand R, Staneva V (2008) Restricted isometry properties and nonconvex compressive sensing. Inverse Prob 24(3):035020MathSciNetCrossRefzbMATHGoogle Scholar
  26. 26.
    Zhang M et al (2014) Feature selection at the discrete limit. In: AAAI 2014, pp 1355–1361Google Scholar
  27. 27.
    Lu C, Lin Z, Yan S (2015) Smoothed low rank and sparse matrix recovery by iteratively reweighted least squares minimization. IEEE Trans Image Process 24(2):646–654MathSciNetCrossRefGoogle Scholar
  28. 28.
    Xia G et al (2017) AID: a benchmark data set for performance evaluation of aerial scene classification. IEEE Trans Geosci Remote Sens 55(7):3965–3981CrossRefGoogle Scholar
  29. 29.
    Cheng G, Han J, Xiaoqiang L (2017) Remote sensing image scene classification: benchmark and state of the art. Proc IEEE 105(10):1865–1883CrossRefGoogle Scholar
  30. 30.
    Li H et al (2017) RSI-CB: a large scale remote sensing image classification benchmark via crowdsource data. arXiv:1705.10450
  31. 31.
    Anthimopoulos M et al (2014) A food recognition system for diabetic patients based on an optimized bag-of-features model. IEEE J Biomed Health Inform 18(4):1261–1271CrossRefGoogle Scholar
  32. 32.
    Pohjalainen J, Räsänen O, Kadioglu S (2015) Feature selection methods and their combinations in high-dimensional classification of speaker likability, intelligibility and personality traits. Comput Speech Lang 29(1):145–171CrossRefGoogle Scholar
  33. 33.
    Peng H, Long F, Ding C (2005) Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy. IEEE Trans Pattern Anal Mach Intell 27(8):1226–1238CrossRefGoogle Scholar

Copyright information

© Springer-Verlag London Ltd., part of Springer Nature 2019

Authors and Affiliations

  1. 1.School of Computer and InformationHefei University of TechnologyHefeiChina

Personalised recommendations