Exploiting label consistency in structured sparse representation for classification

Original Article
  • 22 Downloads

Abstract

Sparse representation with adaptive dictionaries has emerged as a promising tool in computer vision and pattern analysis. While standard sparsity promoted by \(\ell _0\) or \(\ell _1\) regularization has been widely used, recent approaches seek for kinds of structured sparsity to improve the discriminability of sparse codes. For classification, label consistency is one useful concept regarding structured sparsity, which relates class labels to dictionary atoms for generating discriminative sparsity patterns. Motivated by the limitations of existing label-consistent regularization methods, in this paper, we investigate the exploitation of label consistency and propose an effective sparse coding approach. The proposed approach enforces the sparse approximation of a label consistency matrix by sparse code during dictionary learning, which encourages the supports of sparse codes to be consistent for intra-class signals and distinct for inter-class signals. Thus, the learned dictionary can induce discriminative sparsity patterns when used in sparse coding. Moreover, the proposed method is computationally efficient, as the label consistency regularization developed in our method brings very little additional computational cost in solving the related sparse coding problem. The effectiveness of the proposed method is demonstrated with several recognition tasks, and the experimental results show that our method is very competitive with some state-of-the-art approaches.

Keywords

Sparse coding Label consistency Structured sparsity Image classification 

Notes

Acknowledgements

Yuhui Quan would like to thank the support by National Natural Science Foundation of China (61602184), Science and Technology Planning Project of Guangdong Province (2017A030313376), Science and Technology Program of Guangzhou (201707010147) and Educational Reform Project of South China University of Technology (j2jwY9160960). Yong Xu would like to thank the support by National Natural Science Foundation of China (U16114616167224161602184 and 61528204) and Cultivation Project of Major Basic Research of NSF-Guangdong Province (2016A030308013).

Compliance with ethical standards

Conflict of interest

The authors declare that they have no conflict of interest.

References

  1. 1.
    Aharon M, Elad M, Bruckstein A (2006) K-SVD: an algorithm for designing overcomplete dictionaries for sparse representation. Trans Signal Process 54(11):4311–4322CrossRefMATHGoogle Scholar
  2. 2.
    Bagheri MA, Gao Q, Escalera S, Moeslund TB, Ren H, Etemad E (2017) Locality regularized group sparse coding for action recognition. Comput Vis Image Underst 158:106–114CrossRefGoogle Scholar
  3. 3.
    Bao C, Ji H, Quan Y, Shen Z (2016) Dictionary learning for sparse coding: algorithms and convergence analysis. Trans Pattern Anal Mach Intell 38(7):1356–1369CrossRefGoogle Scholar
  4. 4.
    Boureau YL, Bach F, LeCun Y, Ponce J (2010) Learning mid-level features for recognition. In: Conference on computer vision and pattern recognition, IEEE, pp 2559–2566Google Scholar
  5. 5.
    Cai S, Zuo W, Zhang L, Feng X, Wang P (2014) Support vector guided dictionary learning. In: ECCV, Springer, pp 624–639Google Scholar
  6. 6.
    Chi YT, Ali M, Rajwade A, Ho J (2013) Block and group regularized sparse modeling for dictionary learning. In: Conference on computer vision and pattern recognition, IEEE, pp 377–382Google Scholar
  7. 7.
    Gao Q, Wang Q, Huang Y, Gao X, Hong X, Zhang H (2015) Dimensionality reduction by integrating sparse representation and fisher criterion and its applications. Trans Image Process 24(12):5684–5695MathSciNetCrossRefGoogle Scholar
  8. 8.
    Gao S, Tsang IW, Chia LT, Zhao P (2010) Local features are not lonely–Laplacian sparse coding for image classification. In: Conference on computer vision and pattern recognition, IEEE, pp 3555–3561Google Scholar
  9. 9.
    Gao Y, Ma J, Yuille AL (2017) Semi-supervised sparse representation based classification for face recognition with insufficient labeled samples. Trans Image Process 26(5):2545–2560MathSciNetCrossRefGoogle Scholar
  10. 10.
    Ghanem B, Ahuja N (2010) Maximum margin distance learning for dynamic texture recognition. In: European conference on computer vision, Springer, pp 223–236Google Scholar
  11. 11.
    Ghanem B, Ahuja N (2010) Maximum margin distance learning for dynamic texture recognition. In: European conference on computer vision, Springer, pp 223–236Google Scholar
  12. 12.
    Huang K, Aviyente S (2006) Sparse representation for signal classification. In: Advances in neural information processing systems, pp 609–616Google Scholar
  13. 13.
    Jacob L, Obozinski G, Vert JP (2009) Group lasso with overlap and graph lasso. In: International conference on machine learning, ACM, pp 433–440Google Scholar
  14. 14.
    Jenatton R, Audibert JY, Bach F (2011) Structured variable selection with sparsity-inducing norms. J Mach Learn Res 12:2777–2824MathSciNetMATHGoogle Scholar
  15. 15.
    Jiang Z, Lin Z, Davis LS (2011) Learning a discriminative dictionary for sparse coding via label consistent K-SVD. In: Conference on computer vision and pattern recognition, IEEE, pp 1697–1704Google Scholar
  16. 16.
    Jiang Z, Zhang G, Davis LS (2012) Submodular dictionary learning for sparse coding. In: Proceedings of IEEE conference on computer vision and pattern recognition, IEEE, pp 3418–3425Google Scholar
  17. 17.
    Jiang Z, Lin Z, Davis L (2013) Label consistent K-SVD: learning a discriminative dictionary for recognition. Trans Pattern Anal Mach Intell 35(11):2651–2664CrossRefGoogle Scholar
  18. 18.
    Kim S, Xing EP (2010) Tree-guided group lasso for multi-task regression with structured sparsity. In: International conference on machine learning, pp 543–550Google Scholar
  19. 19.
    Kong S, Wang D (2012) A dictionary learning approach for classification: separating the particularity and the commonality. In: ECCV, Springer, pp 186–199Google Scholar
  20. 20.
    Kviatkovsky I, Gabel M, Rivlin E, Shimshoni I (2017) On the equivalence of the LC-KSVD and the D-KSVD algorithms. Trans Pattern Anal Mach Intell 39(2):411–416CrossRefGoogle Scholar
  21. 21.
    Lazebnik S, Schmid C, Ponce J (2006) Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: Conference on computer vision and pattern recognition, vol 2. IEEE, pp 2169–2178Google Scholar
  22. 22.
    Lian XC, Li Z, Lu BL, Zhang L (2010) Max-margin dictionary learning for multiclass image categorization. In: European conference on computer vision, Springer, pp 157–170Google Scholar
  23. 23.
    Lian XC, Li Z, Wang C, Lu BL, Zhang L (2010) Probabilistic models for supervised dictionary learning. In: Conference on computer vision and pattern recognition, IEEE, pp 2305–2312Google Scholar
  24. 24.
    Liu J, Ye J (2010) Moreau–Yosida regularization for grouped tree structure learning. Adv Neural Inf Process Syst 23:1459–1467Google Scholar
  25. 25.
    Lu J, Wang G, Deng W, Moulin P (2014) Simultaneous feature and dictionary learning for image set based face recognition. In: European conference on computer vision, Springer, pp 265–280Google Scholar
  26. 26.
    Mairal J, Bach F, Ponce J, Sapiro G, Zisserman A (2008) Discriminative learned dictionaries for local image analysis. In: Conference on computer vision and pattern recognition, IEEE, pp 1–8Google Scholar
  27. 27.
    Mairal J, Ponce J, Sapiro G, Zisserman A, Bach FR (2009) Supervised dictionary learning. In: Advances in neural information processing systems, pp 1033–1040Google Scholar
  28. 28.
    Majumdar A (2015) Discriminative label consistent dictionary learning. In: 2015 IEEE international conference on image processing (ICIP), IEEE, pp 1016–1020Google Scholar
  29. 29.
    Marsousi M, Li X, Plataniotis KN (2016) Shape-included label-consistent discriminative dictionary learning: an approach to detect and segment multi-class objects in images. In: International conference on image processing, IEEE, pp 729–733Google Scholar
  30. 30.
    Martinez AM (1998) The AR face database. CVC Technical report 24Google Scholar
  31. 31.
    Péteri R, Fazekas S, Huiskes MJ (2010) DynTex : a comprehensive database of dynamic textures. Pattern Recogn Lett 31:1627–1632CrossRefGoogle Scholar
  32. 32.
    Pham DS, Venkatesh S (2008) Joint learning and dictionary construction for pattern recognition. In: Conference on computer vision and pattern recognition, IEEE, pp 1–8Google Scholar
  33. 33.
    Quan Y, Ji H, Shen Z (2014) Data-driven multi-scale non-local wavelet frame construction and image recovery. J Sci Comput 63:307–329MathSciNetCrossRefMATHGoogle Scholar
  34. 34.
    Quan Y, Xu Y, Sun Y, Huang Y (2016) Supervised dictionary learning with multiple classifier integration. Pattern Recogn 55:247–260CrossRefGoogle Scholar
  35. 35.
    Quan Y, Xu Y, Sun Y, Huang Y, Ji H (2016) Sparse coding for classification via discrimination ensemble. In: Conference on computer vision and pattern recognition, pp 5839–5847Google Scholar
  36. 36.
    Ramirez I, Sprechmann P, Sapiro G (2010) Classification and clustering via dictionary learning with structured incoherence and shared features. In: Conference on computer vision and pattern recognition, IEEE, pp 3501–3508Google Scholar
  37. 37.
    Rosenblum K, Zelnik-Manor L, Eldar Y (2010) Dictionary optimization for block-sparse representations. In: Association for the advancement of artificial intelligence fall symposium: manifold learning and its applications, pp 50–58Google Scholar
  38. 38.
    Sprechmann P, Ramirez I, Sapiro G, Eldar YC (2011) C-HiLasso: a collaborative hierarchical sparse modeling framework. Trans Signal Process 59(9):4183–4198MathSciNetCrossRefGoogle Scholar
  39. 39.
    Sun Y, Liu Q, Tang J, Tao D (2014) Learning discriminative dictionary for group sparse representation. Trans Image Process 23(9):3816–3828MathSciNetCrossRefMATHGoogle Scholar
  40. 40.
    Wang J, Yang J, Yu K, Lv F, Huang T, Gong Y (2010) Locality-constrained linear coding for image classification. In: Conference on computer vision and pattern recognition, IEEE, pp 3360–3367Google Scholar
  41. 41.
    Wang X, Yang M, Shen L (2016) Structured regularized robust coding for face recognition. Neurocomputing 216:18–27CrossRefGoogle Scholar
  42. 42.
    Wright J, Yang AY, Ganesh A, Sastry SS, Ma Y (2009) Robust face recognition via sparse representation. Trans Pattern Anal Mach Intell 31(2):210–227CrossRefGoogle Scholar
  43. 43.
    Xu Y, Quan Y, Ling H, Ji H (2011) Dynamic texture classification using dynamic fractal analysis. In: International conference on computer vision, IEEE, pp 1219–1226Google Scholar
  44. 44.
    Yang J, Yu K, Gong Y, Huang T (2009) Linear spatial pyramid matching using sparse coding for image classification. In: Conference on computer vision and pattern recognition, IEEE, pp 1794–1801Google Scholar
  45. 45.
    Yang J, Yu K, Huang T (2010) Supervised translation-invariant sparse coding. In: Conference on computer vision and pattern recognition, IEEE, pp 3517–3524Google Scholar
  46. 46.
    Yang J, Zhang L, Xu Y, Jy Yang (2012) Beyond sparsity: the role of L1-optimizer in pattern classification. Pattern Recogn 45(3):1104–1118CrossRefMATHGoogle Scholar
  47. 47.
    Yang M, Zhang D, Feng X (2011) Fisher discrimination dictionary learning for sparse representation. In: International conference on computer vision, IEEE, pp 543–550Google Scholar
  48. 48.
    Yang M, Dai D, Shen L, Van Gool L (2014) Latent dictionary learning for sparse representation based classification. In: Conference on computer vision and pattern recognition, pp 4138–4145Google Scholar
  49. 49.
    Yang M, Zhang L, Feng X, Zhang D (2014) Sparse representation based fisher discrimination dictionary learning for image classification. Int J Comput Vision 109(3):209–232MathSciNetCrossRefMATHGoogle Scholar
  50. 50.
    Zhang D, Liu P, Zhang K, Zhang H, Wang Q, Jing X (2015) Class relatedness oriented-discriminative dictionary learning for multiclass image classification. Pattern Recognit 59:168–175CrossRefGoogle Scholar
  51. 51.
    Zhang H, Wu QJ, Chow TW, Zhao M (2012) A two-dimensional neighborhood preserving projection for appearance-based face recognition. Pattern Recogn 45(5):1866–1876CrossRefMATHGoogle Scholar
  52. 52.
    Zhang Q, Li B (2010) Discriminative K-SVD for dictionary learning in face recognition. In: Conference on computer vision and pattern recognition, IEEE, pp 2691–2698Google Scholar
  53. 53.
    Zhao G, Pietikainen M (2007) Dynamic texture recognition using local binary patterns with an application to facial expressions. Trans Pattern Anal Mach Intell 29(6):915–928CrossRefGoogle Scholar
  54. 54.
    Zhou N, Shen Y, Peng J, Fan J (2012) Learning inter-related visual dictionary for object recognition. In: Conference on computer vision and pattern recognition, IEEE, pp 3490–3497Google Scholar

Copyright information

© The Natural Computing Applications Forum 2018

Authors and Affiliations

  1. 1.School of Computer Science and EngineeringSouth China University of TechnologyGuangzhouChina

Personalised recommendations