Skip to main content
Log in

Tensor-driven low-rank discriminant analysis for image set classification

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Classification based on image sets has recently attracted great interest in computer vision community. In this paper, we proposed a transductive Tensor-driven Low-rank Discriminant Analysis (TLRDA) model for image set classification, in which the tensor-driven low-rank approximation and the discriminant graph embedding are integrated to improve the representativeness of image sets. In addition, we develop an iterative shrinkage thresholding algorithm to better optimize the objective function of the proposed TLRDA. Experiments on seven publicly available datasets demonstrate that our proposed method is guaranteed to converge within a small number of iterations during the training procedure and obtains promising results compared with state-of-the-art methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

References

  1. Belkin M, Niyogi P (2002) Laplacian eigenmaps and spectral techniques for embedding and clustering. Proc Adv Neural Inf Process Syst 14(6):585–591

    Google Scholar 

  2. Brenner C, Inbar Y (2015) Disgust sensitivity predicts political ideology and policy attitudes in the Netherlands. Eur J Soc Psychol 45(1):27–38

    Article  Google Scholar 

  3. Cai D (2009) Spectral regression: a regression framework for efficient regularized subspace learning. University of Illinois, Urbana-Champaign

    Google Scholar 

  4. Cai D, He X, Han J (2007) Spectral regression for efficient regularized subspace learning. In: Proceedings of IEEE International Conference on Computer Vision, pp 1–8

  5. Cevikalp H, Triggs B (2010) Face recognition based on image sets. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp 2567–2573

  6. Cui J, Liu Y, Xu Y, Zhao H, Zha H (2013) Tracking generic human motion via fusion of low- and high-dimensional approaches. IEEE Trans Syst Man Cybern Syst 43(4):996–1002

    Article  Google Scholar 

  7. Ding Z, Fu Y (2016) Robust multi-view subspace learning through dual low-rank decompositions. In: Proceedings of AAAI Conference on Artificial Intelligence, pp 1181–1187

  8. Deng C, He X, Han J (2007) SRDA: An efficient algorithm for large scale discriminant analysis. IEEE Trans Knowl Data Eng 20(1):1–12

    Article  Google Scholar 

  9. Ding Z, Fu Y, Low-rank common subspace for multi-view learning (2014). In: Proceedings of IEEE International Conference on Data Mining, pp 110–119

  10. Dou J, Li J, Qin Q, Tu Z (2015) Moving object detection based on incremental learning low rank representation and spatial constraint. Neurocomputing 168:382–400

    Article  Google Scholar 

  11. Dong W, Li G, Shi G, Li X, Ma Y (2015) Low-rank tensor approximation with laplacian scale mixture modeling for multiframe image denoising. In: Proceedings of the IEEE International Conference on Computer Vision, pp 442–449

  12. Faraki M, Harandi M, Porikli F (2010) Image set classification by symmetric positive semi-definite matrices. In: Proceedings of IEEE Winter Conference on Applications of Computer Vision, pp 1–8

  13. Gross R, Shi J (2001) The cmu motion of body (mobo) database. Technical report, Carnegie Mellon University, Carnegie

    Google Scholar 

  14. Hamm J, Lee D (2008) Grassman discriminant analysis: a unifying view on subspace-based learning. In: Proceedings of International Conference on Machine Learning, pp 376–383

  15. Hardoon D, Szedmak S, Shawe-Taylor J (2004) Canonical correlation analysis: An overview with application to learning methods. Neural Comput 16(12):2639–2664

    Article  MATH  Google Scholar 

  16. Harandi M, Sanderson C, Shirazi S, Lovell B (2011) Graph embedding discriminant analysis on grassmannian manifolds for improved image set matching. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp 2705–2712

  17. Harandi M, Salzmann M, Baktashmotlagh M (2015) Beyond gauss: image-set matching on the riemannian manifold of PDFs. In: Proceedings of International Conference on Computer Vision, pp 4112–4120

  18. He X, Niyogi P (2002) Locality preserving projections. Adv Neural Inf Process Syst 16(1):186–197

    Google Scholar 

  19. Hu Y, Mian A, Owens R (2011) Sparse approximated nearest points for image set classification. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp 121–128

  20. Hu W, Tao D, Zhang W, Xie Y, Yang Y (2015) A new low-rank tensor model for video completion. CoRR arXiv:1509.02027

  21. Huang Z, Wang R, Shan S, Li X, Chen X (2015) Log-euclidean metric learning on symmetric positive definite manifold with application to image set classification, pp 720–729

  22. Jia C, Zhong G, Fu Y (2014) Low-rank tensor learning with discriminant analysis for action classification and image recovery. In: Proceedings of AAAI Conference on Artificial Intelligence, pp 1228–1234

  23. Kim T, Cipolla R (2009) Canonical correlation analysis of video volume tensors for action categorization and detection. IEEE Trans Pattern Anal Mach Intell 31 (8):1415–1428

    Article  Google Scholar 

  24. Kim T, Kittler J, Cipolla R (2006) Learning discriminative canonical correlations for object recognition with image sets. In: Proceedings of European Conference on Computer Vision, pp 251–262

  25. Kroonenberg P, Leeuw J (1980) Principal component analysis of three-mode data by means of alternating least squares algorithms. Psychometrika 45:69–97

    Article  MathSciNet  MATH  Google Scholar 

  26. Lee K, Ho J, Yang M, Kriegman D (2003) Video-based face recognition using probabilistic appearance manifolds. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp 313–320

  27. Lecun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86:2278–2324

    Article  Google Scholar 

  28. Leibe B, Schiele B (2003) Analyzing appearance and contour based methods for object categorization. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp 409–415

  29. Li J, Wu Y, Zhao J, Lu K (2016) Low-rank discriminant embedding for multiview learning. IEEE Trans Cybern 99:1–14

    Google Scholar 

  30. Lui Y, Beveridge J, Kirby M (2010) Action classification on product manifolds. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp 833–839

  31. Liu G, Lin Z, Yu Y (2010) Robust subspace segmentation by low-rank representation. In: Proceedings of International Conference on Machine Learning, pp 663–670

  32. Liu Y, Cui J, Zhao H, Zha H (2012) Fusion of low-and high-dimensional approaches by trackers sampling for generic human motion tracking. In: Proceedings of International Conference on Pattern Recognition, pp 898–901

  33. Liu Y, Zhang L, Nie L, Yan Y, Rosenblum D (2016) Fortune teller: predicting your career path. In: Proceedings of the AAAI Conference on Artificial Intelligence, pp 201–207

  34. Liu Y, Nie L, Liu L, Rosenblum D (2016) From action to activity: sensor-based activity recognition. Neurocomputing 181:108–115

    Article  Google Scholar 

  35. Liu Y, Zheng Y, Liang Y, Liu S, Rosenblum D (2016) Urban water quality prediction based on multi-task multi-view learning. In: Proceedings of the 25th International Conference on Artificial Intelligence

  36. Liu Y, Liang Y, Liu S, Rosenblum D, Zheng Y (2016) Predicting urban water quality with ubiquitous data. arXiv:1610.09462

  37. Liu L, Cheng L, Liu Y, Jia Y, Rosenblum D (2016) Recognizing complex activities by a probabilistic interval-based model. In: Proceedings of AAAI Conference on Artificial Intelligence, pp 1266–1272

  38. Liu Y, Nie L, Han L, Zhang L, Rosenblum D (2016) Action2Activity: recognizing complex activities from sensor data. In: Proceedings of International Conference on Artificial Intelligence, pp 1617–1623

  39. Lu H (2013) Learning canonical correlations of paired tensor sets via tensor-to-vector projection. In: Proceedings of International Joint Conference on Artificial Intelligence, pp 3–9

  40. Lu H, Plataniotis K, Venetsanopoulos A (2009) Uncorrelated multilinear discriminant analysis with regularization and aggregation for tensor object recognition. IEEE Trans Neural Netw 20(1):103–123

    Article  Google Scholar 

  41. Lu Y, Wei Y, Liu L, Zhong J, Sun L, Liu Y (2016) Towards unsupervised physical activity recognition using smartphone accelerometers. Multimed Tools Appl, pp 1–19

  42. Manton J, Mahony R, Hua Y (2003) The geometry of weighted low-rank approximations. IEEE Trans Signal Process 51(2):500–514

    Article  MathSciNet  MATH  Google Scholar 

  43. Nguyen H, Yang W, Shen F, Sun C (2015) Kernel low-rank representation for face recognition. Neurocomputing 155:32–42

    Article  Google Scholar 

  44. Pietro P, Liu Daniel Y, Hopkins D, Ungar L (2017) Beyond binary labels: Political ideology prediction of twitter users in ACL

  45. Prooijen J, Krouwel A, Boiten M, Eendebak L (2015) Fear among the extremes: how political ideology predicts negative emotions and outgroup derogation. Pers Soc Psychol Bullet 41(4):485–497

    Article  Google Scholar 

  46. Rate C, Retrieval C (2011) Columbia Object Image Library (COIL-20). Tech. Rep. CUCS-005-96. Columbia University, New York

  47. Rodriguez-Aseretto D, Rigo D, Leo M, Corts A, San-Miguel-Ayanz J (2013) A data-driven model for large wildfire behaviour prediction in europe. Procedia Comput Sci 18:1861–1870

    Article  Google Scholar 

  48. Rowei S, Saul L (2000) Nonlinear dimensionality reduction by locally linear embedding. Science 290:2323–2326

    Article  Google Scholar 

  49. Shroff N, Turaga P, Chellappa R (2010) Moving vistas: exploiting motion for describing scenes. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp 1911–1918

  50. Su Y, Wang H, Jing P, Xu C (2015) A spatial-temporal iterative tensor decomposition technique for action and gesture recognition. Multimed Tools Appl,pp 1–18

  51. Tao D, Li X, Wu X, Maybank S (2008) Tensor rank one discriminant analysis-A convergent method for discriminative multilinear subspace selection. Neurocomputing 71:1866–1882

    Article  Google Scholar 

  52. Turaga P, Veeraraghavan A, Srivastava A, Chellappa R (2011) Statistical computations on Grassmann and Stiefel manifolds for image and video-based recognition. IEEE Trans Pattern Anal Mach Intell 33(11):2273–2286

    Article  Google Scholar 

  53. Wang R, Chen X (2009) Manifold discriminant analysis. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp 429–436

  54. Wang T, Shi P (2009) Kernel grassmannian distances and discriminant analysis for face recognition from image sets. Pattern Recogn Lett 30(13):1161–1165

    Article  Google Scholar 

  55. Wang R, Shan S, Chen X, Gao W (2008) Manifold-manifold distance with application to face recognition based on image set. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp 2940–2947

  56. Wang Y, Xu H, Leng C (2013) Provable subspace clustering: When LRR meets SSC. In: Proceedigns of Advances in Neural Information Processing Systems, pp 64–72

  57. Wang B, Hu Y, Gao J, Sun Y, Yin B (2014) Low rank representation on Grassmann manifolds. In: Proceedings of Asian Conference on Computer Vision, pp 81–96

  58. Wang G, Zheng F, Shi C, Xue J, Liu C, He L (2015) Embedding metric learning into set-based face recognition for video surveillance. Neurocomputing 151:1500–1506

    Article  Google Scholar 

  59. Wang J, Shi D, Cheng D, Zhang Y, Gao J (2016) LRSR: Low-rank-sparse representation for subspace clustering. Neurocomputing 214:1026–1037

    Article  Google Scholar 

  60. Wang B, Hu Y, Gao J, Sun Y, Yin B (2016) Product grassmann manifold representation and its lrr models. In: Proceedings of AAAI Conference on Artificial Intelligence, pp 2122–2129

  61. Welch G, Foxlin E (2002) Motion tracking: no silver bullet, but a respectable arsenal. IEEE Comput Graph Appl 22(6):24–38

    Article  Google Scholar 

  62. Wright J, Beaver K, Morgan M, Connolly E (2016) Political ideology predicts involvement in crime. Personality and Individual Differences

  63. Yamaguchi O, Fukui K, Maeda K (1998) Face recognition using temporal image sequence. In: Proceedings of IEEE International Conference on Automatic Face and Gesture Recognition, pp 318– 323

  64. Yun X, Bachmann E (2006) Design, implementation, and experimental results of a quaternion-based kalman filter for human body motion tracking. IEEE Trans Robot 22(6):1216–1227

    Article  Google Scholar 

  65. Zhang J, Xu C, Jing P, Zhang C, Su Y (2016) A tensor-driven temporal correlation model for video sequence classification. IEEE Signal Process Lett 23(9):1246–1249

    Article  Google Scholar 

  66. Zheng C, Hou Y, Zhang J (2016) Im proved sparse representation with low-rank representation for robust face recognition. Neurocomputing 198:114–124

    Article  Google Scholar 

  67. Zhong G, Cheriet M (2014) Large margin low rank tensor analysis. Neural Comput 26(4):761–780

    Article  MathSciNet  Google Scholar 

  68. Zhou B, Lapedriza A, Xiao J, Torralba A, Oliva A (2014) Learning deep features for scene recognition using places database. In: Proceedings of Advances in Neural Information Processing Systems, pp 487–495

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Peiguang Jing.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhang, J., Li, Z., Jing, P. et al. Tensor-driven low-rank discriminant analysis for image set classification. Multimed Tools Appl 78, 4001–4020 (2019). https://doi.org/10.1007/s11042-017-5173-0

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-017-5173-0

Keywords

Navigation