Abstract
The wide applications of the object detection techniques in the domains like video surveillance, security, military, automated industry tasks, biometrics has attracted the interest of the researchers. Deep learning is one of the most effective and efficient techniques for the object detection nowadays and has brought quite a revolution in this field. This paper proposes CNN architecture for the extraction of compact binary descriptors using stacked convolutional auto encoders without labeled data. PASCAL and CALTECH standard object datasets are used to validate the experimental results. The results are presented in terms of recall and precision performance matrices. The results show that the proposed architecture using CNN outperforms the rest of the state-of the art descriptor of its class. The recall and precision for the CALTECH dataset for the proposed CNN architecture is 0.98 and 0.93 respectively.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Kumar, R., Chandra, P., Hanmandlu, M.: A robust fingerprint matching system using orientation features. J. Inf. Process. Syst. 121, 83–99 (2016)
Kumar, R., Hanmandlu, M., Chandra, P.: An empirical evaluation of rotation invariance of LDP features for fingerprint matching using neural networks. Int. J. Comput. Vis. Robot. 4(4), 330–348 (2014)
Kumar, R., Chandra, P., Hanmandlu, M.: Rotational invariant fingerprint matching using local directional descriptors. Int. J. Comput. Intell. Stud. 3(4), 292–319 (2014)
Kumar, R.: Fingerprint matching using rotational invariant orientation local binary pattern descriptor and machine learning techniques. Int. J. Comput. Vis. Image Process. (IJCVIP) 7(4), 51–67 (2017)
Kumar, R.: Hand image biometric based personal authentication system. In: Intelligent Techniques in Signal Processing for Multimedia Security, pp. 201–226. Springer, Cham (2017)
Rani, R., Kumar, R., Singh, A.P.: An empirical evaluation of translational and rotational invariance of descriptors and the classification of flower dataset. Pattern Anal. Appl. 21(1), 1–18 (2018)
Rani, R., Kumar, R., Singh, A.P.: A comparative study of object recognition techniques. In: 2016 7th International Conference on Intelligent Systems, Modelling and Simulation (ISMS), pp. 151–156. IEEE, January 2016
Rani, R., Kumar, R., Singh, A.P.: An empirical evaluation of local descriptors in object recognition. In: 2016 International Conference on Advances in Computing, Communications and Informatics (ICACCI), pp. 1517–1521. IEEE, September 2016
Rani, R., Singh, A.P., Kumar, R.: Impact of reduction in descriptor size on object detection and classification. Multimed. Tools Appl. 1–15 (2018)
Rublee, E., Rabaud, V., Konolige, K., Bradski, G.: ORB: an efficient alternative to SIFT or SURF. In: Proceedings of IEEE International Conference on Computer Vision, pp. 2564–2571 (2011)
Alahi, A., Ortiz, R., Vandergheynst, P.: FREAK: fast retina keypoint. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE (2012)
Leutenegger, S., Chli, M., Siegwart, R.: BRISK: binary robust invariant scalable keypoints. In: Proceedings of IEEE International Conference on Computer Vision (ICCV), pp. 2548 –2555, November 2011
Calonder, M., Lepetit, V., Ozuysal, M., Trzcinski, T., Strecha, C., Fua, P.: BRIEF: computing a local binary descriptor very fast. IEEE Trans. Pattern Anal. Mach. Intell. 34, 1281–1298 (2012)
Levi, G., Hassner, T.: LATCH: learned arrangements of three patch codes. In: IEEE Winter Conference on Applications of Computer Vision (WACV), Lake Placid, NY, USA, March 2016
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60, 91–110 (2004)
Bay, H., Ess, A., Tuytelaars, T., Gool, L.V.: SURF – speeded up robust features. Comput. Vis. Image Underst. 110, 346–359 (2008)
Ke, Y., Sukthankar, R.: PCA-SIFT: a more distinctive representation for local image descriptors. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 506–513 (2004)
Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In: NIPS, pp. 1106–1114 (2012)
Zeiler, M.D., Fergus, R.: Visualizing and understanding convolutional networks. In: ECCV (2014)
Donahue, J., Jia, Y., Vinyals, O., Hoffman, J., Zhang, N., Tzeng, E., Darrell, T.: DeCAF: a deep convolutional activation feature for generic visual recognition. In: ICML (2014)
Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation. In: CVPR (2015)
Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: CVPR (2014)
Sermanet, P., Eigen, D., Zhang, X., Mathieu, M., Fergus, R., LeCun, Y.: OverFeat: integrated recognition, localization and detection using convolutional networks. In: ICLR (2014)
Dosovitskiy, A., Springenberg, J.T., Brox, T.: Unsupervised feature learning by augmenting single images. Pre-print, arXiv:1312.5242v3 [cs.CV] (2014). ICLR 2014 workshop track 2
Yu, K., Xu, W., Gong, Y.: Deep learning with kernel regularization for visual recognition. In NIPS, pp. 1889–1896 (2008)
Ahmed, A., Yu, K., Xu, W., Gong, Y., Xing, E.: Training hierarchical feed-forward visual recognition models using transfer learning from pseudo-tasks. In: ECCV, pp. 69–82 (2008)
LeCun, Y., Huang, F.-J., Bottou, L.: Learning methods for generic object recognition with invariance to pose and lighting. In: CVPR (2004)
Ranzato, M., Huang, F.-J., Boureau, Y., LeCun, Y.: Unsupervised learning of invariant feature hierarchies with applications to object recognition. In: CVPR (2007)
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998). CVPR 2007
Hinton, G.E., Osindero, S., Teh, Y.: A fast learning algorithm for deep belief nets. Neural Comput. 18, 1527–1554 (2006)
Donahue, J., Jia, Y., Vinyals, O., Hoffman, J., Zhang, N., Tzeng, E., Darrell, T.: De- CAF: a deep convolutional activation feature for generic visual recognition. Pre-print arXiv:1310.1531v1 [cs.CV] 1 (2013)
Zhang, Y., Lee, K., Lee, H.: Augmenting supervised neural networks with unsupervised objectives for large-scale image classification. arXiv (2016)
Sharma, S.K., Chandra, P.: Constructive neural networks: a review. Int. J. Eng. Sci. Technol. 2(12), 7847–7855 (2010)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Rani, R., Kumar, R., Singh, A.P. (2020). Deep Learning Method Based Binary Descriptor for Object Detection. In: Singh, P., Panigrahi, B., Suryadevara, N., Sharma, S., Singh, A. (eds) Proceedings of ICETIT 2019. Lecture Notes in Electrical Engineering, vol 605. Springer, Cham. https://doi.org/10.1007/978-3-030-30577-2_31
Download citation
DOI: https://doi.org/10.1007/978-3-030-30577-2_31
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-30576-5
Online ISBN: 978-3-030-30577-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)