Automatic Object Detection from Digital Images by Deep Learning with Transfer Learning

Yabuki, Nobuyoshi; Nishimura, Naoto; Fukuda, Tomohiro

doi:10.1007/978-3-319-91635-4_1

Automatic Object Detection from Digital Images by Deep Learning with Transfer Learning

Conference paper
First Online: 19 May 2018

3482 Accesses
5 Citations
1 Altmetric

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10863))

Abstract

At construction sites and disaster areas, an enormous number of digital photographs are taken by engineers. Tasks such as collecting, sorting, annotating, storing, deleting, distributing these digital images, as done manually, are cumbersome, error-prone, and time-consuming. Thus, it is desirable to automate the object detection process of pictures so that engineers do not have to waste their valuable time and can improve the efficiency and accuracy. Although conventional machine learning could be a solution, it takes much time for researchers to determine features and contents of digital images, and the accuracy tends to be unsatisfactory. On the other hand, deep learning can automatically determine features and contents of various objects from digital images. Therefore, this research aims to automatically detect each object as an object and its position from digital images by using deep learning. Since deep learning usually requires a very large amount of dataset, this research has adopted deep learning with transfer learning, which enables object detection even if the dataset is not very large. Experiments were executed to detect construction machines, workers, and signboards in photographs, comparing among the conventional machine learning by feature values, deep learning with and without transfer learning. The result showed that the best performance was achieved by the deep learning with transfer learning.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

About people in photos on your iPhone, iPad, or iPod touch. https://support.apple.com/en-us/HT207103. Accessed 19 Jan 2018
Google products. https://www.google.com/about/products/. Accessed 19 Jan 2018
Viola, P., Jones, M.: Rapid object detection using a boosted cascade of simple features. In: Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Kauai, pp. 511–518. IEEE (2001)
Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: Proceedings of 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Diego, vol. 1, pp. 511–518. IEEE (2005)
Google Scholar
Levi, K., Weiss, Y.: Learning object detection from a small number of examples: the importance of good features. In: Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Washington D.C., vol. 2, pp. 53–60. IEEE (2004)
Google Scholar
Wu, B., Nevatia, R.: Detection of multiple, partially occluded humans in a single image by Bayesian combination of edgelet part detectors. In: Tenth IEEE International Conference on Computer Vision, Beijing, vol. 1, pp. 90–97. IEEE (2005)
Google Scholar
Mitsui, T., Fujiyoshi, H.: Object detection by joint features based on two-stage boosting. In: Proceedings of 2009 IEEE 12th International Conference on Computer Vision Workshops, Kyoto, pp. 1169–1176. IEEE (2009)
Google Scholar
Simonyan, K., Andrew, Z.: Very deep convolutional networks for large-scale image recognition. In: Proceedings of International Conference on Learning Representations 2015, San Diego (2015). Preprint CoRR: arXiv:1409.1556
Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Ohio, pp. 580–587. IEEE (2014)
Google Scholar
Girshick, R.: Fast R-CNN. In: Proceedings of 15th IEEE International Conference on Computer Vision, Santiago, pp. 1440–1448. IEEE (2015)
Google Scholar
Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. In: Proceedings of Advances in Neural Information Processing Systems, Montreal, pp. 91–99. NIPS (2015)
Google Scholar
Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.-Y., Berg, A.C.: SSD: single shot multibox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 21–37. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46448-0_2
Chapter Google Scholar
Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection. In: Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, pp. 779–788. IEEE (2016)
Google Scholar
The PASCAL Visual Object Classes Homepage. http://host.robots.ox.ac.uk/pascal/VOC/. Accessed 19 Jan 2018
Caltech 101. http://www.vision.caltech.edu/Image_Datasets/Caltech101/. Accessed 19 Jan 2018
COCO dataset. http://cocodataset.org/. Accessed 19 Jan 2018
Lin, T.-Y., Maire, M., Belongie, S., Hays, J., Perona, P., Ramanan, D., Dollár, P., Zitnick, C.L.: Microsoft COCO: Common Objects in Context. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8693, pp. 740–755. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10602-1_48
Chapter Google Scholar
UCI Machine Learning Repository. http://archive.ics.uci.edu/ml/about.html. Accessed 19 Jan 2018
Transfer Learning – Machine Learning’s Next Frontier. http://ruder.io/transfer-learning/index.html. Accessed 19 Jan 2018
Pan, S.J., Yang, Q.: A survey on transfer learning. IEEE Trans. Knowl. Data Eng. 22(10), 1345–1359 (2010)
Article Google Scholar
LabelImg. https://github.com/tzutalin/labelImg. Accessed 1 Sept 2017
Dlib C++ Library Python API. http://dlib.net/python/index.html. Accessed 1 Sept 2017

Download references

Author information

Authors and Affiliations

Osaka University, Suita, Osaka, 565-0871, Japan
Nobuyoshi Yabuki, Naoto Nishimura & Tomohiro Fukuda

Authors

Nobuyoshi Yabuki
View author publications
You can also search for this author in PubMed Google Scholar
Naoto Nishimura
View author publications
You can also search for this author in PubMed Google Scholar
Tomohiro Fukuda
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Nobuyoshi Yabuki .

Editor information

Editors and Affiliations

Applied Computing and Mechanics Laboratory (IMAC), School of Architecture, Civil and Environmental Engineering (ENAC), Swiss Federal Institute of Technology, Lausanne (EPFL), Lausanne, Switzerland
Ian F. C. Smith
Institute for Landscape, Architecture, Construction and Territory (inPact) Construction and Environment Department (CED), University of Applied Sciences, Geneva (HEPIA), Geneva, Switzerland
Bernd Domer

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yabuki, N., Nishimura, N., Fukuda, T. (2018). Automatic Object Detection from Digital Images by Deep Learning with Transfer Learning. In: Smith, I., Domer, B. (eds) Advanced Computing Strategies for Engineering. EG-ICE 2018. Lecture Notes in Computer Science(), vol 10863. Springer, Cham. https://doi.org/10.1007/978-3-319-91635-4_1

Download citation

DOI: https://doi.org/10.1007/978-3-319-91635-4_1
Published: 19 May 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-91634-7
Online ISBN: 978-3-319-91635-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics