Abstract
Still to video (S2V) face recognition attracts many interests for researchers in computer vision and biometrics. In S2V scenarios, the still images are often captured with high quality and cooperative user condition. On the contrary, video clips usually show more variations and of low quality. In this paper, we primarily focus on the S2V face recognition where face gallery is formed by a few still face images, and the query is the video clip. We utilized the deep convolutional neural network to deal with the S2V face recognition. We also studied the choice of different similarity measures for the face matching, and suggest the more appropriate measure for the deep representations. Our results for both S2V face identification and verification yield a significant improvement over the previous results on two databases, i.e., COX-S2V and PaSC.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Beveridge, J.R., Phillips, P.J., Bolme, D.S., Draper, B.A., Givens, G.H., Lui, Y.M., Teli, M.N., Zhang, H., Scruggs, W.T., Bowyer, K.W., et al.: The challenge of face recognition from digital point-and-shoot cameras. In: IEEE Biometrics: Theory, Applications and Systems (BTAS), pp. 1–8 (2013)
Beveridge, J.R., Zhang, H., Draper, B.A., Flynn, P.J., Feng, Z., Huber, P., Kittler, J., Huang, Z., Li, S., Li, Y., et al.: Report on the fg 2015 video person recognition evaluation. In: 2015 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG), vol. 1, pp. 1–8. IEEE (2015)
Chen, X., Wang, C., Xiao, B., Cai, X.: Scenario oriented discriminant analysis for still-to-video face recognition. In: IEEE International Conference on Image Processing (ICIP), pp. 738–742 (2014)
Chen, X., Wang, C., Xiao, B., Zhang, C.: Still-to-video face recognition via weighted scenario oriented discriminant analysis. In: IEEE International Joint Conference on Biometrics (IJCB), pp. 1–6 (2014)
Cui, Z., Chang, H., Shan, S., Ma, B., Chen, X.: Joint sparse representation for video-based face recognition. Neurocomputing 135, 306–312 (2014)
Davis, J.V., Kulis, B., Jain, P., Sra, S., Dhillon, I.S.: Information-theoretic metric learning. In: Proceedings of the 24th International Conference on Machine Learning, pp. 209–216. ACM (2007)
Goldberger, J., Hinton, G.E., Roweis, S.T., Salakhutdinov, R.R.: Neighbourhood components analysis. In: Advances in Neural Information Processing Systems, pp. 513–520 (2005)
Gong, B., Shi, Y., Sha, F., Grauman, K.: Geodesic flow kernel for unsupervised domain adaptation. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2066–2073 (2012)
Huang, Z., Shan, S., Zhang, H., Lao, S., Kuerban, A., Chen, X.: Benchmarking still-to-video face recognition via partial and local linear discriminant analysis on COX-S2V dataset. In: Lee, K.M., Matsushita, Y., Rehg, J.M., Hu, Z. (eds.) ACCV 2012. LNCS, vol. 7725, pp. 589–600. Springer, Heidelberg (2013). doi:10.1007/978-3-642-37444-9_46
Huang, Z., Wang, R., Shan, S., Chen, X.: Learning euclidean-to-riemannian metric for point-to-set classification. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1677–1684 (2014)
Kim, T.-K., Kittler, J., Cipolla, R.: Discriminative learning and recognition of image set classes using canonical correlations. IEEE Trans. Pattern Anal. Mach. Intell. 29(6), 1005–1018 (2007)
Liu, X., Chen, T.: Video-based face recognition using adaptive hidden markov models. In: IEEE Computer Vision and Pattern Recognition, vol. 1, p. I-340 (2003)
Sugiyama, M.: Dimensionality reduction of multimodal labeled data by local fisher discriminant analysis. J. Mach. Learn. Res. 8, 1027–1061 (2007)
Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., Rabinovich, A.: Going deeper with convolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–9 (2015)
Wang, H., Liu, C., Ding, X.: Still-to-video face recognition in unconstrained environments. In: IS&T/SPIE Electronic Imaging, p. 94050O. International Society for Optics and Photonics (2015)
Wang, R., Guo, H., Davis, L.S., Dai, Q., Covariance discriminative learning: a natural and efficient approach to image set classification. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2496–2503 (2012)
Weinberger, K.Q., Blitzer, J., Saul, L.K.: Distance metric learning for large margin nearest neighbor classification. In: Advances in Neural Information Processing Systems, pp. 1473–1480 (2005)
Yi, D., Lei, Z., Liao, S., Li, S.Z.: Learning face representation from scratch. arXiv preprint arXiv:1411.7923 (2014)
Zhu, P., Zhang, L., Zuo, W., Zhang, D.: From point to set: extend the learning of distance metrics. In: IEEE International Conference on Computer Vision, pp. 2664–2671 (2013)
Zhu, Y., Zheng, Z., Li, Y., Mu, G., Shan, S., Guo, G.: Still to video face recognition using a heterogeneous matching approach. In: IEEE Biometrics: Theory, Applications and Systems (BTAS) (2015)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing AG
About this paper
Cite this paper
Zhu, Y., Guo, G. (2016). Exploring Deep Features with Different Distance Measures for Still to Video Face Matching. In: You, Z., et al. Biometric Recognition. CCBR 2016. Lecture Notes in Computer Science(), vol 9967. Springer, Cham. https://doi.org/10.1007/978-3-319-46654-5_18
Download citation
DOI: https://doi.org/10.1007/978-3-319-46654-5_18
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-46653-8
Online ISBN: 978-3-319-46654-5
eBook Packages: Computer ScienceComputer Science (R0)