Skip to main content

Discriminant Manifold Learning via Sparse Coding for Image Analysis

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9517))

Abstract

Traditional subspace learning methods directly calculate the statistical properties of the original input images, while ignoring different contributions of different image components. In fact, the noise (e.g., illumination, shadow) in the image often has a negative influence on learning the desired subspace and should have little contribution to image recognition. To tackle this problem, we propose a novel subspace learning method named Discriminant Manifold Learning via Sparse Coding (DML_SC). In our method, we first decompose the input image into several components via dictionary learning, and then regroup the components into a More Important Part (MIP) and a Less Important Part (LIP). The MIP can be regarded as the clean part of the original image residing on a nonlinear submanifold, while LIP as noise in the image. Finally, the MIP and LIP are incorporated into manifold learning to learn a desired discriminative subspace. The proposed method is able to deal with data with and without labels, yielding supervised and unsupervised DML SCs. Experimental results show that DML_SC achieves best performance on image recognition and clustering tasks compared with well-known subspace learning and sparse representation methods.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Turk, M., Pentland, A. P.: Face recognition using eigenfaces. In: IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), pp. 586–591 (1991)

    Google Scholar 

  2. Nikitidis, S., et al.: Maximum margin projection subspace learning for visual data analysis. IEEE Trans. Image Process. (TIP) 23(10), 4413–4425 (2014)

    Article  MathSciNet  Google Scholar 

  3. Jin, W., Liu, R., et al.: Robust visual tracking using latent subspace projection pursuit. In: IEEE International Conference on Multimedia and Expo (ICME) (2014)

    Google Scholar 

  4. Jiang, X.: Linear subspace learning-based dimensionality reduction. IEEE Signal Process. Mag. 28(2), 16–26 (2011)

    Article  Google Scholar 

  5. Huang, Z., Wang, R., Shan, S., Chen X.: Projection metric learning on grassmann manifold with application to video based face recognition. In: International Conference on Computer Vision and Pattern Recognition (CVPR), pp. 140–149 (2015)

    Google Scholar 

  6. Roweis, S.T., Saul, L.K.: Nonlinear dimensionality reduction by locally linear embedding. Science 290(5500), 2323–2326 (2000)

    Article  Google Scholar 

  7. He, X., Yan, S., et al.: Face recognition using laplacianfaces. IEEE Trans. Pattern Anal. Mach. Intell. (TPAMI) 27, 328–340 (2005)

    Article  Google Scholar 

  8. Zeng, X., Luo, S.-W.: A supervised subspace learning algorithm: supervised neighborhood preserving embedding. In: Alhajj, R., Gao, H., Li, X., Li, J., Zaïane, O.R. (eds.) ADMA 2007. LNCS (LNAI), vol. 4632, pp. 81–88. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  9. Cai, D., He, X., Zhou, K., Han, J., Bao, H.: Locality sensitive discriminant analysis. In: IJCAI, pp. 708–713 (2007)

    Google Scholar 

  10. Yan, S., Xu, D., Zhang, B., Zhang, H.J., Yang, Q., Lin, S.: Graph embedding and extensions: a general framework for dimensionality reduction. IEEE Trans. Pattern Anal. Mach. Intell. (TPAMI) 29(1), 40–51 (2007)

    Article  Google Scholar 

  11. Wang, R., Chen, X.: Manifold discriminant analysis. In: International Conference on Computer Vision and Pattern Recognition (CVPR), pp. 429–436 (2009)

    Google Scholar 

  12. Wang, R., Shan, S., Chen, X., Chen, J., Gao, W.: Maximal linear embedding for dimensionality reduction. IEEE Trans. Pattern Anal. Mach. Intell. (TPAMI) 33(9), 1776–1792 (2011)

    Article  Google Scholar 

  13. Wang, B.H., Lin, C., et al.: Neighbourhood sensitive preserving embedding for pattern classification. IET Image Proc. 8(8), 489–497 (2014)

    Article  Google Scholar 

  14. Olshausen, B.A., Field, D.J.: Sparse coding with an over-complete basis set: a strategy employed by V1? Vision. Res. 37(23), 3311–3325 (1997)

    Article  Google Scholar 

  15. Kim, S.J., Koh, K., Lustig, M., Boyd, S., Gorinevsky, D.: An interior-point method for large-scale l1-regularized least squares. IEEE J. Sel. Top. Signal Process. 1(4), 606–617 (2007)

    Article  Google Scholar 

  16. Aharon, M., Elad, M., Bruckstein, A.: The K-SVD: an algorithm for designing of overcomplete dictionaries for sparse representation. IEEE Trans. Image Process. (TIP) 54(11), 4311–4322 (2006)

    Article  Google Scholar 

  17. Wright, J., Yang, A.Y., Ganesh, A., Sastry, S.S., Ma, Y.: Robust face recognition via sparse representation. IEEE Trans. Pattern Anal. Mach. Intell. (TPAMI) 31(2), 210–227 (2009)

    Article  Google Scholar 

  18. Zheng, M., Bu, J., Chen, C., Wang, C., Zhang, L., Qiu, G., Cai, D.: Graph regularized sparse coding for image representation. IEEE Trans. Image Process. (TIP) 20(5), 1327–1336 (2011)

    Article  MathSciNet  Google Scholar 

  19. Yang, M., Zhang, L., et al.: Metaface learning for sparse representation. In: International Conference on Image Processing (ICIP), pp. 1601–1604 (2010)

    Google Scholar 

  20. Gao, S., Tsang, I. W. H., Chia, L. T., Zhao, P.: Local features are not lonely-laplacian sparse coding for image classification. In: Computer Vision and Pattern Recognition (CVPR), pp. 3555–3561 (2010)

    Google Scholar 

  21. Wang, B., Pang, M., Lin, C., Fan, X.: Graph regularized non-negative matrix factorization with sparse coding. In: IEEE China Summit & International Conference On Signal and Information Processing (ChinaSIP), pp. 476–480 (2013)

    Google Scholar 

  22. Zhang, L., Zhu, P., et al.: A linear subspace learning approach via sparse coding. In: IEEE International Conference on Computer Vision (ICCV), pp. 755–761 (2011)

    Google Scholar 

  23. Li, H., Jiang, T., Zhang, K.: Efficient and robust feature extraction by maximum margin criterion. IEEE Trans. Neural Netw. 17(1), 157–165 (2006)

    Article  Google Scholar 

  24. Georghiades, A.S., et al.: From few to many: illumination cone models for face recognition under variable lighting and pose. IEEE Trans. Pattern Anal. Mach. Intell. (TPAMI) 23(6), 643–660 (2001)

    Article  Google Scholar 

  25. Sim, T., Baker, S., Bsat, M.: The CMU Pose Illumination, and Expression (PIE) database. In: FG, pp. 46–51 (2002)

    Google Scholar 

  26. Lee, D.D., Seung, H.S.: Learning the parts of objects by non-negative matrix factorization. Nature 401(6755), 788–791 (1999)

    Article  Google Scholar 

  27. Cai, D., He, X., Han, J., Huang, T.S.: Graph regularized non-negative matrix factorization for data representation. IEEE Trans. Pattern Anal. Mach. Intell. (TPAMI) 33(8), 1548–1560 (2011)

    Article  Google Scholar 

  28. Nene, S. A., Nayar, S. K., Murase, H.: Columbia object image library (coil-20). Technical Report CUCS-005-96 (1996)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Chuang Lin .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Pang, M., Wang, B., Fan, X., Lin, C. (2016). Discriminant Manifold Learning via Sparse Coding for Image Analysis. In: Tian, Q., Sebe, N., Qi, GJ., Huet, B., Hong, R., Liu, X. (eds) MultiMedia Modeling. MMM 2016. Lecture Notes in Computer Science(), vol 9517. Springer, Cham. https://doi.org/10.1007/978-3-319-27674-8_22

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-27674-8_22

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-27673-1

  • Online ISBN: 978-3-319-27674-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics