Abstract
Bird species classification has received more and more attention in the field of computer vision, for its promising applications in biology and environmental studies. Recognizing bird species are difficult due to the challenges of discriminative region localization and fine-grained feature learning. In this paper, we have introduced a Transfer learning based method with multistage training. We have used both Pre-Trained Mask-RCNN and a ensemble model consists of Inception Nets (InceptionV3 net & InceptionResnetV2) to get both the localization and species of the bird from the images. we have tested our model in an Indian bird dataset consist of variable size, high-resolution images are taken from camera in various environments (like day, noon, evening etc.) with different perspectives and occlusions. Our final model achieves an F1 score of 0.5567 or 55.67% on that dataset.
Code is available at: https://github.com/AKASH2907/bird-species-classification. Implemented in Keras [20].
A. Kumar and S. D. Das—Equal Contribution.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Fu, J., Zheng, H., Mei, T.: Look closer to see better: recurrent attention convolutional neural network for fine-grained image recognition. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, pp. 4476–4484 (2017). https://doi.org/10.1109/CVPR.2017.476
Pang, C., Yao, H., Sun, X.: Discriminative features for bird species classification. In: Proceedings of International Conference on Internet Multimedia Computing and Service, ICIMCS 2014, p. 256, 5 p. ACM, New York (2014). https://doi.org/10.1145/2632856.2632917
Ge, Z., McCool, C., Sanderson, C., Bewley, A., Chen, Z., Corke, P.: Fine-grained bird species recognition via hierarchical subset learning. In: 2015 IEEE International Conference on Image Processing (ICIP), Quebec City, QC, pp. 561–565 (2015). https://doi.org/10.1109/ICIP.2015.7350861
Atanbori, J., Duan, W., Murray, J., Appiah, K., Dickinson, P.: Automatic classification of flying bird species using computer vision techniques. Pattern Recogn. Lett. 81, 53–62 (2016). https://doi.org/10.1016/j.patrec.2015.08.015
Marini, A., Facon, J., Koerich, A.: Bird species classification based on color features. In: Proceedings - 2013 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2013, pp. 4336–4341 (2013). https://doi.org/10.1109/SMC.2013.740
Branson, S., et al.: Bird species categorization using pose normalized deep convolutional Nets. CoRR abs/1406.2952 (2014)
Yang, S., et al.: Unsupervised template learning for fine-grained object recognition. In: NIPS (2012)
He, K., et al.: Mask R-CNN. In: 2017 IEEE International Conference on Computer Vision (ICCV), pp. 2980–2988 (2017)
Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In: NIPS (2012)
Szegedy, C., et al.: Inception-v4, Inception-ResNet and the impact of residual connections on learning. In: AAAI (2017)
Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J., Wojna, Z: Rethinking the inception architecture for computer vision. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, United States, pp. 2818–2826 (2016). https://doi.org/10.1109/CVPR.2016.308
Ramachandran, P., Zoph, B., Le, Q.V.: Swish: a self-gated activation function (2017)
Vinyals, O., Blundell, C., Lillicrap, T.P., Kavukcuoglu, K., Wierstra, D.: Matching networks for one shot learning. In: NIPS (2016)
Lin, T.-Y., et al.: Microsoft COCO: common objects in context. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8693, pp. 740–755. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10602-1_48
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. CoRR, abs/1512.03385 (2015)
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. CoRR, abs/1409.1556 (2014)
Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. In: Proceedings of the 28th International Conference on Neural Information Processing Systems - Volume 1, NIPS 2015, Montreal, Canada (2015)
Shelhamer, E., Long, J., Darrell, T.: Fully convolutional networks for semantic segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 39, 640–651 (2017)
Wah, C., Branson, S., Welinder, P., Perona, P., Belongie, S.: The Caltech-UCSD Birds-200-2011 dataset. Computation & Neural Systems. Technical Report, CNS-TR-2011-001 (2011)
Chollet, F., et al.: Keras (2015). https://keras.io
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Kumar, A., Das, S.D. (2019). Bird Species Classification Using Transfer Learning with Multistage Training. In: Arora, C., Mitra, K. (eds) Computer Vision Applications. WCVA 2018. Communications in Computer and Information Science, vol 1019. Springer, Singapore. https://doi.org/10.1007/978-981-15-1387-9_3
Download citation
DOI: https://doi.org/10.1007/978-981-15-1387-9_3
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-1386-2
Online ISBN: 978-981-15-1387-9
eBook Packages: Computer ScienceComputer Science (R0)