Interpretable Verification of Visually Similar Vehicle Images Using Convolutional Networks

Hong, Liugen; Wang, Wenzhong; Pang, Yanhui; Hu, Huai; Tang, Jin

doi:10.1007/978-981-13-2922-7_13

Interpretable Verification of Visually Similar Vehicle Images Using Convolutional Networks

Liugen Hong¹³,
Wenzhong Wang¹³,
Yanhui Pang¹³,
Huai Hu¹³ &
…
Jin Tang¹³

Conference paper
First Online: 11 October 2018

1865 Accesses

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 945))

Abstract

This paper presents a simple and effective method to verify similar vehicle images. In order to provide a meaningful interpretation of the verification, we propose to detect the local differences between two images. We frame this task as a saliency map regression problem, where the saliency map measures the degree of discrepancy at every pixel. To achieve this goal, we use a convolutional neural network (CNN) to map two aligned vehicle images to one saliency map. Our network design enables end-to-end training. We validate our algorithm on a vehicle image dataset. Experimental results show that our approach is accurate, fast and robust, and it achieves better performance than other methods.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Bhattarai, B., Sharma, G., Jurie, F.: CP-mtML: coupled projection multi-task metric learning for large scale face retrieval. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4226–4235 (2016)
Google Scholar
Fischler, M.A., Bolles, R.C.: Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 24(6), 381–395 (1981)
Article MathSciNet Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)
Google Scholar
Kumar, B., Carneiro, G., Reid, I., et al.: Learning local image descriptors with deep siamese and triplet convolutional networks by minimising global loss functions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5385–5394 (2016)
Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)
Article MathSciNet Google Scholar
Noh, H., Hong, S., Han, B.: Learning deconvolution network for semantic segmentation. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1520–1528 (2015)
Google Scholar
Plamondon, R., Lorette, G.: Automatic signature verification and writer identification-the state of the art. Pattern Recognit. 22(2), 107–131 (1989)
Article Google Scholar
Qin, H., Yan, J., Li, X., Hu, X.: Joint training of cascaded CNN for face detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3456–3465 (2016)
Google Scholar
Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 779–788 (2016)
Google Scholar
Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. In: Advances in Neural Information Processing Systems, pp. 91–99 (2015)
Google Scholar
Simonyan, K., Vedaldi, A., Zisserman, A.: Deep inside convolutional networks: visualising image classification models and saliency maps. arXiv preprint arXiv:1312.6034 (2013)
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)
Sun, Y., Chen, Y., Wang, X., Tang, X.: Deep learning face representation by joint identification-verification. In: Advances in Neural Information Processing Systems, pp. 1988–1996 (2014)
Google Scholar
Sun, Y., Wang, X., Tang, X.: Deeply learned face representations are sparse, selective, and robust. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2892–2900 (2015)
Google Scholar
Taigman, Y., Yang, M., Ranzato, M., Wolf, L.: DeepFace: closing the gap to human-level performance in face verification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1701–1708 (2014)
Google Scholar
Tan, Z., Liu, B., Yu, N.: PPEDNet: pyramid pooling encoder-decoder network for real-time semantic segmentation. In: Zhao, Y., Kong, X., Taubman, D. (eds.) ICIG 2017. LNCS, vol. 10666, pp. 328–339. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-71607-7_29
Chapter Google Scholar
Yi, D., Lei, Z., Liao, S., Li, S.Z.: Learning face representation from scratch. arXiv preprint arXiv:1411.7923 (2014)
Zagoruyko, S., Komodakis, N.: Learning to compare image patches via convolutional neural networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4353–4361 (2015)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Science and Technology, Anhui University, Hefei, 230601, China
Liugen Hong, Wenzhong Wang, Yanhui Pang, Huai Hu & Jin Tang

Authors

Liugen Hong
View author publications
You can also search for this author in PubMed Google Scholar
Wenzhong Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yanhui Pang
View author publications
You can also search for this author in PubMed Google Scholar
Huai Hu
View author publications
You can also search for this author in PubMed Google Scholar
Jin Tang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jin Tang .

Editor information

Editors and Affiliations

School of Mathematics and Statistics, Xi'an Jiaotong University, Xi'an, China
Zongben Xu
Xidian University, Xi'an, China
Xinbo Gao
Xidian University, Xi'an, Shaanxi, China
Qiguang Miao
Chinese Academy of Sciences, Beijing, China
Yunquan Zhang
Zhejiang University, Hangzhou, China
Jiajun Bu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hong, L., Wang, W., Pang, Y., Hu, H., Tang, J. (2018). Interpretable Verification of Visually Similar Vehicle Images Using Convolutional Networks. In: Xu, Z., Gao, X., Miao, Q., Zhang, Y., Bu, J. (eds) Big Data. Big Data 2018. Communications in Computer and Information Science, vol 945. Springer, Singapore. https://doi.org/10.1007/978-981-13-2922-7_13

Download citation

DOI: https://doi.org/10.1007/978-981-13-2922-7_13
Published: 11 October 2018
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-2921-0
Online ISBN: 978-981-13-2922-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

the China Computer Federation (CCF) (opens in a new tab)