A loss combination based deep model for person re-identification

Zhu, Fuqing; Kong, Xiangwei; Wu, Qun; Fu, Haiyan; Li, Ming

doi:10.1007/s11042-017-5009-y

A loss combination based deep model for person re-identification

Published: 08 July 2017

Volume 77, pages 3049–3069, (2018)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Fuqing Zhu¹,
Xiangwei Kong ORCID: orcid.org/0000-0002-0851-6752¹,
Qun Wu^2,3,
Haiyan Fu¹ &
…
Ming Li¹

640 Accesses
12 Citations
Explore all metrics

Abstract

The Convolutional Neural Network (CNN) has significantly improved the state-of-the-art in person re-identification (re-ID). In the existing available identification CNN model, the softmax loss function is employed as the supervision signal to train the CNN model. However, the softmax loss only encourages the separability of the learned deep features between different identities. The distinguishing intra-class variations have not been considered during the training process of CNN model. In order to minimize the intra-class variations and then improve the discriminative ability of CNN model, this paper combines a new supervision signal with original softmax loss for person re-ID. Specifically, during the training process, a center of deep features is learned for each pedestrian identity and the deep features are subtracted from the corresponding identity centers, simultaneously. So that, the deep features of the same identity to the center will be pulled efficiently. With the combination of loss functions, the inter-class dispersion and intra-class aggregation can be constrained as much as possible. In this way, a more discriminative CNN model, which has two key learning objectives, can be learned to extract deep features for person re-ID task. We evaluate our method in two identification CNN models (i.e., CaffeNet and ResNet-50). It is encouraging to see that our method has a stable improvement compared with the baseline and yields a competitive performance to the state-of-the-art person re-ID methods on three important person re-ID benchmarks (i.e., Market-1501, CUHK03 and MARS).

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

CBAM: Convolutional Block Attention Module

A review of object detection based on deep learning

Article 12 June 2020

Youzi Xiao, Zhiqiang Tian, … Xuguang Lan

Deep Learning for Generic Object Detection: A Survey

Article Open access 31 October 2019

Li Liu, Wanli Ouyang, … Matti Pietikäinen

Notes

Here, we do not provide detailed descriptions of the identification CNN model and just take CaffeNet [25] model as instance in Fig. 2.
The size is 227 ×227 for CaffeNet [25], while is 224 ×224 for ResNet-50 [19].
Note: CaffeNet [25] is FC7 layer, while ResNet-50 [19] is Pool5 layer.

References

Ahmed E, Jones M, Marks TK (2015) An improved deep learning architecture for person re-identification. In: Proceedings CVPR, pp 3908–3916
An L, Chen X, Liu S, Lei Y, Yang S (2016) Integrating appearance features and soft biometrics for person re-identification. Multimedia Tools and Applications
Baltieri D, Vezzani R, Cucchiara R (2011) 3dpes: 3d people dataset for surveillance and forensics. In: Proceedings ACM workshop on human gesture and behavior understanding, pp 59–64
Bottou L (2010) Large-scale machine learning with stochastic gradient descent. In: Proceedings COMPSTAT’2010, pp 177–186
Chapter Google Scholar
Chang X, Yang Y (2016) Semisupervised feature analysis by mining correlations among multiple tasks. IEEE Trans Neural Netw Learn Syst. doi:10.1109/TNNLS.2016.2582746
Article MathSciNet Google Scholar
Chang X, Nie F, Wang S, Yang Y, Zhou X, Zhang C (2016) Compound rank- k projections for bilinear analysis. IEEE Trans Neural Netw Learn Syst 27(7):1502–1513
Article MathSciNet Google Scholar
Chang X, Nie F, Yang Y, Zhang C, Huang H (2016) Convex sparse pca for unsupervised feature learning. ACM Trans Knowl Discov Data 11(1):3:1–3:16
Article Google Scholar
Chang X, Yu YL, Yang Y, Xing EP (2016) Semantic pooling for complex event analysis in untrimmed videos. IEEE Trans Pattern Anal Mach Intell. doi:10.1109/TPAMI.2016.2608901
Article Google Scholar
Chen D, Yuan Z, Chen B, Zheng N (2016) Similarity learning with spatial constraints for person re-identification. In: Proceedings CVPR, pp 1268–1277
Cheng D, Gong Y, Zhou S, Wang J, Zheng N (2016) Person re-identification by multi-channel parts-based cnn with improved triplet loss function. In: Proceedings CVPR, pp 1335–1344
Das A, Chakraborty A, Roy-Chowdhury AK (2014) Consistent re-identification in a camera network. In: Proceedings ECCV, pp 330–345
Davis JV, Kulis B, Jain P, Sra S, Dhillon IS (2007) Information-theoretic metric learning. In: Proceedings ICML, pp 209–216
Dehghan A, Modiri Assari S, Shah M (2015) Gmmcp tracker: Globally optimal generalized maximum multi clique problem for multiple object tracking. In: Proceedings CVPR, pp 4091–4099
Deng J, Dong W, Socher R, Li LJ, Li K, Fei-Fei L (2009) Imagenet: a large-scale hierarchical image database. In: Proceedings CVPR, pp 248–255
Ess A, Leibe B, Van Gool L (2007) Depth and appearance for mobile scene analysis. In: Proceedings ICCV, pp 1–8
Felzenszwalb PF, Girshick RB, McAllester D, Ramanan D (2010) Object detection with discriminatively trained part-based models. IEEE Trans Pattern Anal Mach Intell 32(9):1627–1645
Article Google Scholar
Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings CVPR, pp 580–587
Gray D, Tao H (2008) Viewpoint invariant pedestrian recognition with an ensemble of localized features. In: Proceedings ECCV, pp 262–275
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings CVPR, pp 770–778
Hirzer M, Beleznai C, Roth PM, Bischof H (2011) Person re-identification by descriptive and discriminative classification. In: Proceedings Scandinavian conference on Image analysis, pp 91–102
Chapter Google Scholar
Hu HM, Fang W, Zeng G, Hu Z, Li B (2016) A person re-identification algorithm based on pyramid color topology feature. Multimedia Tools and Applications
Jia Y, Shelhamer E, Donahue J, Karayev S, Long J, Girshick R, Guadarrama S, Darrell T (2014) Caffe: Convolutional architecture for fast feature embedding. In: Proceedings ACM international conference on multimedia, pp 675–678
Jose C, Fleuret F (2016) Scalable metric learning via weighted approximate rank component analysis. In: Proceedings ECCV, pp 875–890
Koestinger M, Hirzer M, Wohlhart P, Roth PM, Bischof H (2012) Large scale metric learning from equivalence constraints. In: Proceedings CVPR, pp 2288–2295
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Proceedings NIPS, pp 1097–1105
Leng Q, Hu R, Liang C, Wang Y, Chen J (2015) Person re-identification with content and context re-ranking. Multimedia Tools Appl 74(17):6989–7014
Article Google Scholar
Li W, Wang X (2013) Locally aligned feature transforms across views. In: Proceedings CVPR, pp 3594–3601
Li W, Zhao R, Wang X (2012) Human reidentification with transferred metric learning. In: Proceedings ACCV, pp 31–44
Li W, Zhao R, Xiao T, Wang X (2014) Deepreid: deep filter pairing neural network for person re-identification. In: Proceedings CVPR, pp 152–159
Liao S, Hu Y, Zhu X, Li SZ (2015) Person re-identification by local maximal occurrence representation and metric learning. In: Proceedings CVPR, pp 2197–2206
Lin Y, Zheng L, Zheng Z, Wu Y, Yang Y (2017) Improving person re-identification by attribute and identity learning. arXiv:170307220
Liu H, Feng J, Qi M, Jiang J, Yan S (2016) End-to-end comparative attention networks for person re-identification. arXiv:160604404
Liu J, Zha ZJ, Tian Q, Liu D, Yao T, Ling Q, Mei T (2016) Multi-scale triplet cnn for person re-identification. In: Proceedings ACM international conference on multimedia, pp 192–196
Martinel N, Das A, Micheloni C, Roy-Chowdhury AK (2016) Temporal model adaptation for person re-identification. arXiv:160707216
Prosser B, Zheng WS, Gong S, Xiang T, Mary Q (2010) Person re-identification by support vector ranking. In: Proceedings BMVC, pp 1–11
Radenović F, Tolias G, Chum O (2016) Cnn image retrieval learns from bow: unsupervised fine-tuning with hard examples. arXiv:160402426
Radford A, Metz L, Chintala S (2015) Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv:151106434
Roth PM, Hirzer M, Koestinger M, Beleznai C, Bischof H (2014) Mahalanobis distance learning for person re-identification. In: Person re-identification, Springer London, pp 247–267
Schroff F, Kalenichenko D, Philbin J (2015) Facenet: a unified embedding for face recognition and clustering. In: Proceedings CVPR, pp 815–823
Su C, Zhang S, Xing J, Gao W, Tian Q (2016) Deep attributes driven multi-camera person re-identification. In: Proceedings ECCV, pp 475–491
Sun Y, Zheng L, Deng W, Wang S (2017) Svdnet for pedestrian retrieval. arXiv:170305693
Ustinova E, Ganin Y, Lempitsky V (2015) Multiregion bilinear convolutional neural networks for person re-identification. arXiv:151205300
Varior RR, Haloi M, Wang G (2016) Gated siamese convolutional neural network architecture for human re-identification. In: Proceedings ECCV, pp 791–808
Varior RR, Shuai B, Lu J, Xu D, Wang G (2016) A siamese long short-term memory architecture for human re-identification. In: Proceedings ECCV, pp 135–153
Wang F, Zuo W, Lin L, Zhang D, Zhang L (2016) Joint learning of single-image and cross-image representations for person re-identification. In: Proceedings CVPR, pp 1288–1296
Wang T, Gong S, Zhu X, Wang S (2014) Person re-identification by video ranking. In: Proceedings ECCV, pp 688–703
Weinberger KQ, Blitzer J, Saul LK (2005) Distance metric learning for large margin nearest neighbor classification. In: Proceedings NIPS, pp 1473–1480
Wen Y, Zhang K, Li Z, Qiao Y (2016) A discriminative feature learning approach for deep face recognition. In: Proceedings ECCV, pp 499–515
Wu L, Shen C, Hengel AvD (2016) Personnet: Person re-identification with deep convolutional neural networks. arXiv:160107255
Xiang ZJ, Chen Q, Liu Y (2014) Person re-identification by fuzzy space color histogram. Multimedia Tools Appl 73(1):91–107
Article Google Scholar
Xiao T, Li H, Ouyang W, Wang X (2016) Learning deep feature representations with domain guided dropout for person re-identification. In: Proceedings CVPR, pp 1249–1258
Xiao T, Li S, Wang B, Lin L, Wang X (2016) End-to-end deep learning for person search. arXiv:160401850
Yan Y, Ni B, Song Z, Ma C, Yan Y, Yang X (2016) Person re-identification via recurrent feature aggregation. In: Proceedings ECCV, pp 701–716
Yan Y, Nie F, Li W, Gao C, Yang Y, Xu D (2016) Image classification by cross-media active learning with privileged information. IEEE Trans Multimedia 18 (12):2494–2502
Article Google Scholar
Yang Y, Zhuang YT, Wu F, Pan YH (2008) Harmonizing hierarchical manifolds for multimedia document semantics understanding and cross-media retrieval. IEEE Trans Multimedia 10(3):437– 446
Article Google Scholar
Yang Y, Nie F, Xu D, Luo J, Zhuang Y, Pan Y (2012) A multimedia retrieval framework based on semi-supervised ranking and relevance feedback. IEEE Trans Pattern Anal Mach Intell 34(4):723– 742
Article Google Scholar
Yang Y, Ma Z, Hauptmann AG, Sebe N (2013) Feature selection for multimedia analysis by sharing information among multiple tasks. IEEE Trans Multimedia 15 (3):661–669
Article Google Scholar
Yi D, Lei Z, Liao S, Li SZ (2014) Deep metric learning for person re-identification. In: Proceedings ICPR, pp 34–39
Zhang L, Xiang T, Gong S (2016) Learning a discriminative null space for person re-identification. In: Proceedings CVPR, pp 1239–1248
Zhao R, Ouyang W, Wang X (2014) Learning mid-level filters for person re-identification. In: Proceedings CVPR, pp 144–151
Zhao Y, Zhao X, Luo R, Liu Y (2016) Person re-identification by encoding free energy feature maps. Multimedia Tools Appl 75(8):4795–4813
Article Google Scholar
Zheng L, Wang S, Liu Z, Tian Q (2014) Packing and padding: coupled multi-index for accurate image retrieval. In: Proceedings CVPR, pp 1939–1946
Zheng L, Shen L, Tian L, Wang S, Wang J, Tian Q (2015) Scalable person re-identification: a benchmark. In: Proceedings ICCV, pp 1116–1124
Zheng L, Wang S, Tian L, He F, Liu Z, Tian Q (2015) Query-adaptive late fusion for image search and person re-identification. In: Proceedings CVPR, pp 1741–1750
Zheng L, Bie Z, Sun Y, Wang J, Wang S, Su C, Tian Q (2016) Mars: a video benchmark for large-scale person re-identification. In: Proceedings ECCV, pp 868–884
Zheng L, Wang S, Wang J, Tian Q (2016) Accurate image search with multi-scale contextual evidences. Int J Comput Vis 120(1):1–13
Article MathSciNet Google Scholar
Zheng L, Yang Y, Hauptmann AG (2016) Person re-identification: past, present and future. arXiv:161002984
Zheng L, Yang Y, Tian Q (2017) Sift meets cnn: a decade survey of instance retrieval. IEEE Trans Pattern Anal Mach Intell. doi:10.1109/TPAMI.2017.2709749
Article Google Scholar
Zheng L, Zhang H, Sun S, Chandraker M, Yang Y, Tian Q (2017) Person re-identification in the wild. In: Proceedings CVPR
Zheng WS, Gong S, Xiang T (2009) Associating groups of people. In: Proceedings BMVC, pp 23.1–23.11
Zheng Z, Zheng L, Yang Y (2017) Unlabeled samples generated by gan improve the person re-identification baseline in vitro. arXiv:170107717

Download references

Acknowledgements

This work is supported by the Foundation for Innovative Research Groups of the NSFC (Grant no.71421001), National Natural Science Foundation of China (Grant no.61502073), and the Open Projects Program of National Laboratory of Pattern Recognition (No.201407349).

Author information

Authors and Affiliations

School of Information and Communication Engineering, Dalian University of Technology, Dalian, 116024, China
Fuqing Zhu, Xiangwei Kong, Haiyan Fu & Ming Li
General Design Institute, Zhejiang Sci-Tech University, Hangzhou, 310018, China
Qun Wu
Taizhou Research Institute, Zhejiang University, Taizhou, 318000, China
Qun Wu

Authors

Fuqing Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Xiangwei Kong
View author publications
You can also search for this author in PubMed Google Scholar
Qun Wu
View author publications
You can also search for this author in PubMed Google Scholar
Haiyan Fu
View author publications
You can also search for this author in PubMed Google Scholar
Ming Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xiangwei Kong.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhu, F., Kong, X., Wu, Q. et al. A loss combination based deep model for person re-identification. Multimed Tools Appl 77, 3049–3069 (2018). https://doi.org/10.1007/s11042-017-5009-y

Download citation

Received: 14 March 2017
Revised: 28 June 2017
Accepted: 04 July 2017
Published: 08 July 2017
Issue Date: February 2018
DOI: https://doi.org/10.1007/s11042-017-5009-y

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A loss combination based deep model for person re-identification

Abstract

Access this article

Similar content being viewed by others

CBAM: Convolutional Block Attention Module

A review of object detection based on deep learning

Deep Learning for Generic Object Detection: A Survey

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Abstract

Access this article

Similar content being viewed by others

CBAM: Convolutional Block Attention Module

A review of object detection based on deep learning

Deep Learning for Generic Object Detection: A Survey

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation