Abstract
Classification based on image sets has recently attracted great interest in computer vision community. In this paper, we proposed a transductive Tensor-driven Low-rank Discriminant Analysis (TLRDA) model for image set classification, in which the tensor-driven low-rank approximation and the discriminant graph embedding are integrated to improve the representativeness of image sets. In addition, we develop an iterative shrinkage thresholding algorithm to better optimize the objective function of the proposed TLRDA. Experiments on seven publicly available datasets demonstrate that our proposed method is guaranteed to converge within a small number of iterations during the training procedure and obtains promising results compared with state-of-the-art methods.
Similar content being viewed by others
References
Belkin M, Niyogi P (2002) Laplacian eigenmaps and spectral techniques for embedding and clustering. Proc Adv Neural Inf Process Syst 14(6):585–591
Brenner C, Inbar Y (2015) Disgust sensitivity predicts political ideology and policy attitudes in the Netherlands. Eur J Soc Psychol 45(1):27–38
Cai D (2009) Spectral regression: a regression framework for efficient regularized subspace learning. University of Illinois, Urbana-Champaign
Cai D, He X, Han J (2007) Spectral regression for efficient regularized subspace learning. In: Proceedings of IEEE International Conference on Computer Vision, pp 1–8
Cevikalp H, Triggs B (2010) Face recognition based on image sets. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp 2567–2573
Cui J, Liu Y, Xu Y, Zhao H, Zha H (2013) Tracking generic human motion via fusion of low- and high-dimensional approaches. IEEE Trans Syst Man Cybern Syst 43(4):996–1002
Ding Z, Fu Y (2016) Robust multi-view subspace learning through dual low-rank decompositions. In: Proceedings of AAAI Conference on Artificial Intelligence, pp 1181–1187
Deng C, He X, Han J (2007) SRDA: An efficient algorithm for large scale discriminant analysis. IEEE Trans Knowl Data Eng 20(1):1–12
Ding Z, Fu Y, Low-rank common subspace for multi-view learning (2014). In: Proceedings of IEEE International Conference on Data Mining, pp 110–119
Dou J, Li J, Qin Q, Tu Z (2015) Moving object detection based on incremental learning low rank representation and spatial constraint. Neurocomputing 168:382–400
Dong W, Li G, Shi G, Li X, Ma Y (2015) Low-rank tensor approximation with laplacian scale mixture modeling for multiframe image denoising. In: Proceedings of the IEEE International Conference on Computer Vision, pp 442–449
Faraki M, Harandi M, Porikli F (2010) Image set classification by symmetric positive semi-definite matrices. In: Proceedings of IEEE Winter Conference on Applications of Computer Vision, pp 1–8
Gross R, Shi J (2001) The cmu motion of body (mobo) database. Technical report, Carnegie Mellon University, Carnegie
Hamm J, Lee D (2008) Grassman discriminant analysis: a unifying view on subspace-based learning. In: Proceedings of International Conference on Machine Learning, pp 376–383
Hardoon D, Szedmak S, Shawe-Taylor J (2004) Canonical correlation analysis: An overview with application to learning methods. Neural Comput 16(12):2639–2664
Harandi M, Sanderson C, Shirazi S, Lovell B (2011) Graph embedding discriminant analysis on grassmannian manifolds for improved image set matching. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp 2705–2712
Harandi M, Salzmann M, Baktashmotlagh M (2015) Beyond gauss: image-set matching on the riemannian manifold of PDFs. In: Proceedings of International Conference on Computer Vision, pp 4112–4120
He X, Niyogi P (2002) Locality preserving projections. Adv Neural Inf Process Syst 16(1):186–197
Hu Y, Mian A, Owens R (2011) Sparse approximated nearest points for image set classification. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp 121–128
Hu W, Tao D, Zhang W, Xie Y, Yang Y (2015) A new low-rank tensor model for video completion. CoRR arXiv:1509.02027
Huang Z, Wang R, Shan S, Li X, Chen X (2015) Log-euclidean metric learning on symmetric positive definite manifold with application to image set classification, pp 720–729
Jia C, Zhong G, Fu Y (2014) Low-rank tensor learning with discriminant analysis for action classification and image recovery. In: Proceedings of AAAI Conference on Artificial Intelligence, pp 1228–1234
Kim T, Cipolla R (2009) Canonical correlation analysis of video volume tensors for action categorization and detection. IEEE Trans Pattern Anal Mach Intell 31 (8):1415–1428
Kim T, Kittler J, Cipolla R (2006) Learning discriminative canonical correlations for object recognition with image sets. In: Proceedings of European Conference on Computer Vision, pp 251–262
Kroonenberg P, Leeuw J (1980) Principal component analysis of three-mode data by means of alternating least squares algorithms. Psychometrika 45:69–97
Lee K, Ho J, Yang M, Kriegman D (2003) Video-based face recognition using probabilistic appearance manifolds. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp 313–320
Lecun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86:2278–2324
Leibe B, Schiele B (2003) Analyzing appearance and contour based methods for object categorization. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp 409–415
Li J, Wu Y, Zhao J, Lu K (2016) Low-rank discriminant embedding for multiview learning. IEEE Trans Cybern 99:1–14
Lui Y, Beveridge J, Kirby M (2010) Action classification on product manifolds. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp 833–839
Liu G, Lin Z, Yu Y (2010) Robust subspace segmentation by low-rank representation. In: Proceedings of International Conference on Machine Learning, pp 663–670
Liu Y, Cui J, Zhao H, Zha H (2012) Fusion of low-and high-dimensional approaches by trackers sampling for generic human motion tracking. In: Proceedings of International Conference on Pattern Recognition, pp 898–901
Liu Y, Zhang L, Nie L, Yan Y, Rosenblum D (2016) Fortune teller: predicting your career path. In: Proceedings of the AAAI Conference on Artificial Intelligence, pp 201–207
Liu Y, Nie L, Liu L, Rosenblum D (2016) From action to activity: sensor-based activity recognition. Neurocomputing 181:108–115
Liu Y, Zheng Y, Liang Y, Liu S, Rosenblum D (2016) Urban water quality prediction based on multi-task multi-view learning. In: Proceedings of the 25th International Conference on Artificial Intelligence
Liu Y, Liang Y, Liu S, Rosenblum D, Zheng Y (2016) Predicting urban water quality with ubiquitous data. arXiv:1610.09462
Liu L, Cheng L, Liu Y, Jia Y, Rosenblum D (2016) Recognizing complex activities by a probabilistic interval-based model. In: Proceedings of AAAI Conference on Artificial Intelligence, pp 1266–1272
Liu Y, Nie L, Han L, Zhang L, Rosenblum D (2016) Action2Activity: recognizing complex activities from sensor data. In: Proceedings of International Conference on Artificial Intelligence, pp 1617–1623
Lu H (2013) Learning canonical correlations of paired tensor sets via tensor-to-vector projection. In: Proceedings of International Joint Conference on Artificial Intelligence, pp 3–9
Lu H, Plataniotis K, Venetsanopoulos A (2009) Uncorrelated multilinear discriminant analysis with regularization and aggregation for tensor object recognition. IEEE Trans Neural Netw 20(1):103–123
Lu Y, Wei Y, Liu L, Zhong J, Sun L, Liu Y (2016) Towards unsupervised physical activity recognition using smartphone accelerometers. Multimed Tools Appl, pp 1–19
Manton J, Mahony R, Hua Y (2003) The geometry of weighted low-rank approximations. IEEE Trans Signal Process 51(2):500–514
Nguyen H, Yang W, Shen F, Sun C (2015) Kernel low-rank representation for face recognition. Neurocomputing 155:32–42
Pietro P, Liu Daniel Y, Hopkins D, Ungar L (2017) Beyond binary labels: Political ideology prediction of twitter users in ACL
Prooijen J, Krouwel A, Boiten M, Eendebak L (2015) Fear among the extremes: how political ideology predicts negative emotions and outgroup derogation. Pers Soc Psychol Bullet 41(4):485–497
Rate C, Retrieval C (2011) Columbia Object Image Library (COIL-20). Tech. Rep. CUCS-005-96. Columbia University, New York
Rodriguez-Aseretto D, Rigo D, Leo M, Corts A, San-Miguel-Ayanz J (2013) A data-driven model for large wildfire behaviour prediction in europe. Procedia Comput Sci 18:1861–1870
Rowei S, Saul L (2000) Nonlinear dimensionality reduction by locally linear embedding. Science 290:2323–2326
Shroff N, Turaga P, Chellappa R (2010) Moving vistas: exploiting motion for describing scenes. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp 1911–1918
Su Y, Wang H, Jing P, Xu C (2015) A spatial-temporal iterative tensor decomposition technique for action and gesture recognition. Multimed Tools Appl,pp 1–18
Tao D, Li X, Wu X, Maybank S (2008) Tensor rank one discriminant analysis-A convergent method for discriminative multilinear subspace selection. Neurocomputing 71:1866–1882
Turaga P, Veeraraghavan A, Srivastava A, Chellappa R (2011) Statistical computations on Grassmann and Stiefel manifolds for image and video-based recognition. IEEE Trans Pattern Anal Mach Intell 33(11):2273–2286
Wang R, Chen X (2009) Manifold discriminant analysis. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp 429–436
Wang T, Shi P (2009) Kernel grassmannian distances and discriminant analysis for face recognition from image sets. Pattern Recogn Lett 30(13):1161–1165
Wang R, Shan S, Chen X, Gao W (2008) Manifold-manifold distance with application to face recognition based on image set. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp 2940–2947
Wang Y, Xu H, Leng C (2013) Provable subspace clustering: When LRR meets SSC. In: Proceedigns of Advances in Neural Information Processing Systems, pp 64–72
Wang B, Hu Y, Gao J, Sun Y, Yin B (2014) Low rank representation on Grassmann manifolds. In: Proceedings of Asian Conference on Computer Vision, pp 81–96
Wang G, Zheng F, Shi C, Xue J, Liu C, He L (2015) Embedding metric learning into set-based face recognition for video surveillance. Neurocomputing 151:1500–1506
Wang J, Shi D, Cheng D, Zhang Y, Gao J (2016) LRSR: Low-rank-sparse representation for subspace clustering. Neurocomputing 214:1026–1037
Wang B, Hu Y, Gao J, Sun Y, Yin B (2016) Product grassmann manifold representation and its lrr models. In: Proceedings of AAAI Conference on Artificial Intelligence, pp 2122–2129
Welch G, Foxlin E (2002) Motion tracking: no silver bullet, but a respectable arsenal. IEEE Comput Graph Appl 22(6):24–38
Wright J, Beaver K, Morgan M, Connolly E (2016) Political ideology predicts involvement in crime. Personality and Individual Differences
Yamaguchi O, Fukui K, Maeda K (1998) Face recognition using temporal image sequence. In: Proceedings of IEEE International Conference on Automatic Face and Gesture Recognition, pp 318– 323
Yun X, Bachmann E (2006) Design, implementation, and experimental results of a quaternion-based kalman filter for human body motion tracking. IEEE Trans Robot 22(6):1216–1227
Zhang J, Xu C, Jing P, Zhang C, Su Y (2016) A tensor-driven temporal correlation model for video sequence classification. IEEE Signal Process Lett 23(9):1246–1249
Zheng C, Hou Y, Zhang J (2016) Im proved sparse representation with low-rank representation for robust face recognition. Neurocomputing 198:114–124
Zhong G, Cheriet M (2014) Large margin low rank tensor analysis. Neural Comput 26(4):761–780
Zhou B, Lapedriza A, Xiao J, Torralba A, Oliva A (2014) Learning deep features for scene recognition using places database. In: Proceedings of Advances in Neural Information Processing Systems, pp 487–495
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Zhang, J., Li, Z., Jing, P. et al. Tensor-driven low-rank discriminant analysis for image set classification. Multimed Tools Appl 78, 4001–4020 (2019). https://doi.org/10.1007/s11042-017-5173-0
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-017-5173-0