Abstract
Detecting small targets like vehicles in high resolution satellite images is a significant but challenging task. In the past decade, some detection frameworks have been proposed to solve this problem. However, like the traditional ways of object detection in natural images those methods all consist of multiple separated stages. Region proposals are first produced, then, fed into the feature extractor and classified finally. Multi-stage detection schemes are designed complicated and time-consuming. In this paper, we propose a unified single-stage vehicle detection framework using fully convolutional network (FCN) to simultaneously predict vehicle bounding boxes and class probabilities from an arbitrary-sized satellite image. We elaborate our FCN architecture which replaces the fully connected layers in traditional CNNs with convolutional layers and design vehicle object-oriented training methodology with reference boxes (anchors). The whole model can be trained end-to-end by minimizing a multi-task loss function. Comparison experiment results on a common dataset demonstrate that our FCN model which has much fewer parameters can achieve a faster detection with lower false alarm rates compared to the traditional methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Zhao, T., Nevatia, R.: Car detection in low resolution aerial images. Image Vis. Comput. 2, 693–703 (2003)
Eikvil, L., Aurdal, L., Koren, H.: Classification-based vehicle detection in high-resolution satellite images. ISPRS J. Photogram. Remote Sens. 64, 65–72 (2009)
Liang, P., Teodoro, G., Ling, H., Blasch, E., Chen, G., Bai, L.: Multiple kernel learning for vehicle detection in wide area motion imagery. In: Proceedings of the 15th International Conference on Information Fusion, pp. 1629–1636 (2012)
Kembhavi, A., Harwood, D., Davis, L.S.: Vehicle detection using partial least squares. IEEE Trans. Pattern Anal. Mach. Intell. 33, 1250–1265 (2011)
Krizhevsky, A., Sutskever, I., Hinton, G.: Imagenet classification with deep convolutional neural networks. Adv. Neural Inf. Process. Syst. 25 (2012)
Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh, S., Ma, S., Huang, Z., Karpathy, A., Khosla, A., Bernstein, M., Berg, A.C., Fei-Fei, L.: Imagenet large scale visual recognition challenge. Int. J. Comput. Vis. 115, 211–252 (2015)
Jiang, Q., Cao, L., Cheng, M., Wang, C., Li, J.: Deep neural networks-based vehicle detection in satellite images. In: International Symposium on Bioelectronics and Bioinformatics, pp. 184–187 (2015)
Chen, X., Xiang, S., Liu, C., Pan, C.: Vehicle detection in satellite images by hybrid deep convolutional neural networks. IEEE Geosci. Remote Sens. Lett. 11, 1797–1801 (2014)
Chen, X., Xiang, S., Liu, C., Pan, C.: Vehicle detection in satellite images by parallel deep convolutional neural networks. In: Proceedings of the 2nd IAPR Asian Conference on Pattern Recognition, vol. 14, pp. 181–185 (2013)
Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. Adv. Neural Inf. Process. Syst. 91–99 (2015)
Szegedy, C., Reed, S., Erhan, D., Anguelov, D.: Scalable, high-quality object detection. arXiv:1412.1441 (v1) (2015)
Pinheiro, P., Collobert, R., Dollar, P.: Learning to segment object candidates. In: NIPS (2015)
Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3431–3440 (2015)
Zeiler, M.D., Fergus, R.: Visualizing and understanding convolutional networks. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8689, pp. 818–833. Springer, Heidelberg (2014). doi:10.1007/978-3-319-10590-1_53
LeCun, Y., Boser, B., Denker, J.S., Henderson, D., Howard, R.E., Hubbard, W., Jackel, L.D.: Backpropagation applied to handwritten zip code recognition. Neural Comput. 1, 541–551 (1989)
Girshick, R.: Fast R-CNN. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1440–1448 (2015)
Jia, Y., Shelhamer, E., Donahue, J., Karayev, S., Long, J., Girshick, R., Guadarrama, S., Darrell, T.: Caffe: convolutional architecture for fast feature embedding. In: Proceedings of the 22nd ACM International Conference on Multimedia, pp. 675–678 (2014)
Dalal, D., Triggs, B.: Histogram of oriented gradients for human detection. Proc. IEEE Conf. Comput. Vis. Pattern Recogn. 1, 886–893 (2005)
Ojala, T., Pietikainen, M., Maenpaa, T.: Multiresolution gray scale and rotation invariant texture classification with local binary patterns. IEEE Trans. Pattern Anal. Mach. Intell. 24, 971–987 (2002)
Viola, P., Jones, M.J.: Robust real-time face detection. Int. J. Comput. Vis. 57, 137–154 (2004)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Hu, J., Xu, T., Zhang, J., Yang, Y. (2016). Fast Vehicle Detection in Satellite Images Using Fully Convolutional Network. In: Zhang, Z., Huang, K. (eds) Intelligent Visual Surveillance. IVS 2016. Communications in Computer and Information Science, vol 664. Springer, Singapore. https://doi.org/10.1007/978-981-10-3476-3_15
Download citation
DOI: https://doi.org/10.1007/978-981-10-3476-3_15
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-3475-6
Online ISBN: 978-981-10-3476-3
eBook Packages: Computer ScienceComputer Science (R0)