Convolutional Neuronal Networks Based Monocular Object Detection and Depth Perception for Micro UAVs

Aguilar, Wilbert G.; Quisaguano, Fernando J.; Rodríguez, Guillermo A.; Alvarez, Leandro G.; Limaico, Alex; Sandoval, David S.

doi:10.1007/978-3-030-02698-1_35

Wilbert G. Aguilar^17,18,19,
Fernando J. Quisaguano¹⁷,
Guillermo A. Rodríguez¹⁷,
Leandro G. Alvarez¹⁷,
Alex Limaico¹⁷ &
…
David S. Sandoval¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11266))

Included in the following conference series:

International Conference on Intelligent Science and Big Data Engineering

1893 Accesses
5 Citations
3 Altmetric

Abstract

In this work, we present the development of a system for the detection and depth estimation of objects in real time using the on-board camera in a micro-UAV through convolutional neuronal networks. Traditionally for the detection of obstacles shows the use of SLAM visual systems. However, to solve this problem, this level of complexity is not necessary, saving resources and execution time. The training with convolutional neural networks using stereo images for the depth estimation and in the same way training the detection of common observable objects can obtain an accurate detection of obstacles in a real time.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Aguilar, W.G., Angulo, C.: Real-time model-based video stabilization for microaerial vehicles. Neural Process. Lett. 43(2), 459–477 (2016)
Article Google Scholar
Aguilar, W.G., Angulo, C.: Real-time video stabilization without phantom movements for micro aerial vehicles. EURASIP J. Image Video Process. 2014(1), 46 (2014)
Article Google Scholar
Gageik, N., Benz, P., Montenegro, S.: Obstacle detection and collision avoidance for a UAV with complementary low-cost sensors. IEEE Access 3, 599–609 (2015)
Article Google Scholar
Yang, S., Konam, S., Ma, C., Rosenthal, S., Veloso, M., Scherer, S.: Obstacle avoidance through deep networks based intermediate perception (2017)
Google Scholar
Aguilar, W.G., et al.: Pedestrian detection for UAVs using cascade classifiers and saliency maps. In: Rojas, I., Joya, G., Catala, A. (eds.) IWANN 2017. LNCS, vol. 10306, pp. 563–574. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-59147-6_48
Chapter Google Scholar
Aguilar, W.G., et al.: Cascade classifiers and saliency maps based people detection. In: De Paolis, L.T., Bourdot, P., Mongelli, A. (eds.) AVR 2017. LNCS, vol. 10325, pp. 501–510. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-60928-7_42
Chapter Google Scholar
Aguilar, W.G., Rodríguez, G.A., Álvarez, L., Sandoval, S., Quisaguano, F., Limaico, A.: Visual SLAM with a RGB-D camera on a quadrotor UAV using on-board processing. In: Rojas, I., Joya, G., Catala, A. (eds.) IWANN 2017. LNCS, vol. 10306, pp. 596–606. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-59147-6_51
Chapter Google Scholar
Aguilar, W.G., Rodríguez, G.A., Álvarez, L., Sandoval, S., Quisaguano, F., Limaico, A.: Real-time 3D modeling with a RGB-D camera and on-board processing. In: De Paolis, L.T., Bourdot, P., Mongelli, A. (eds.) AVR 2017. LNCS, vol. 10325, pp. 410–419. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-60928-7_35
Chapter Google Scholar
Aguilar, W.G., Rodríguez, G.A., Álvarez, L., Sandoval, S., Quisaguano, F., Limaico, A.: On-board visual SLAM on a UGV using a RGB-D camera. In: Huang, Y.A., Wu, H., Liu, H., Yin, Z. (eds.) ICIRA 2017. LNCS (LNAI), vol. 10464, pp. 298–308. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-65298-6_28
Chapter Google Scholar
Oleynikova, H., Honegger, D., Pollefeys, M.: Reactive avoidance using embedded stereo vision for MAV flight. In: Proceedings of IEEE International Conference on Robotics and Automation, vol. 2015, pp. 50–56 (2015)
Google Scholar
Eigen, D., Puhrsch, C., Fergus, R.: Depth map prediction from a single image using a multi-scale deep network. In: NIPS, pp. 1–9 (2014)
Google Scholar
Liu, F., Shen, C., Lin, G., Reid, I.: Learning depth from single monocular images using deep convolutional neural fields. IEEE Trans. Pattern Anal. Mach. Intell. 38(10), 2024–2039 (2016)
Article Google Scholar
Chakravarty, P., Kelchtermans, K., Roussel, T., Wellens, S., Tuytelaars, T., Van Eycken, L.: CNN-based single image obstacle avoidance on a quadrotor. In: Proceedings of IEEE International Conference on Robotics and Automation (ICRA), pp. 6369–6374 (2017)
Google Scholar
Aguilar, W.G., Quisaguano, F., Álvarez, L., Pardo, J., Proaño, Z.: Monocular depth perception on a micro-UAV using convolutional neuronal networks. Accepted
Google Scholar
Godard, C., Mac Aodha, O., Brostow, G.J.: Unsupervised monocular depth estimation with left-right consistency (2016)
Google Scholar
Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. In: NIPS, pp. 91–99 (2015)
Google Scholar
Dai, J., Li, Y., He, K., Sun, J.: R-FCN: object detection via region-based fully convolutional networks. In: NIPS (2016)
Google Scholar
Liu, W., et al.: SSD: single shot multibox detector, pp. 21–37 (2016)
Chapter Google Scholar
Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection (2015)
Google Scholar
Cvpr, A., Id, P.: Speed/accuracy trade-offs for modern convolutional object detectors. In: CVPR, vol. 3562, pp. 7310–7319 (2017)
Google Scholar
Yilmaz, A., Javed, O., Shah, M.: Object tracking. ACM Comput. Surv. 38(4), 13 (2006)
Article Google Scholar
Sebastian, S., Mori, T.: First results in detecting and avoiding frontal obstacle from monocular camera for micro unmanned aerial vehicles. In: 2013 IEEE International Conference on Robotics and automation (ICRA), vol. 53, no. 9, pp. 1689–1699 (2013)
Google Scholar
Aguilar, W.G., Morales, S.G.: 3D environment mapping using the Kinect V2 and path planning based on RRT algorithms. Electronics 5(4), 70 (2016)
Article Google Scholar
Aguilar, W.G., Morales, S., Ruiz, H., Abad, V.: RRT* GL based optimal path planning for real-time navigation of UAVs. In: Rojas, I., Joya, G., Catala, A. (eds.) IWANN 2017. LNCS, vol. 10306, pp. 585–595. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-59147-6_50
Chapter Google Scholar
Tokui, S., Oono, K., Hido, S., Clayton, J.: Chainer: a next-generation open source framework for deep learning. In: Proceedings of Workshop on Machine Learning Systems, Twenty-Ninth Annual Conference on Neural Information Processing Systems, pp. 1–6 (2015)
Google Scholar
Saxena, A., Chung, S.H., Ng, A.Y.: Learning depth from single monocular images. Adv. Neural. Inf. Process. Syst. 18, 1161–1168 (2006)
Google Scholar
Mancini, M., Costante, G., Valigi, P., Ciarfuglia, T.A., Delmerico, J., Scaramuzza, D.: Toward domain independence for learning-based monocular depth estimation. IEEE Robot. Autom. Lett. 2(3), 1778–1785 (2017)
Article Google Scholar
Mancini, M., Costante, G., Valigi, P., Ciarfuglia, T.A.: J-MOD²: joint monocular obstacle detection and depth estimation, vol. 3766, no. c, pp. 1–8 (2017)
Google Scholar
Lecumberry, F.: Cálculo de disparidad en imágenes estéreo, una comparación. XI Congr. Argentino Ciencias la Comput (2005)
Google Scholar
Szegedy, C., Reed, S., Erhan, D., Anguelov, D., Ioffe, S.: Scalable, high-quality object detection (2014)
Google Scholar
Aguilar, W.G., Salcedo, V.S., Sandoval, D.S., Cobeña, B.: Developing of a video-based model for UAV autonomous navigation. In: Barone, D.A.C., Teles, E.O., Brackmann, C.P. (eds.) LAWCN 2017. CCIS, vol. 720, pp. 94–105. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-71011-2_8
Chapter Google Scholar
Aguilar, W.G., Casaliglla, V.P., Pólit, J.L.: Obstacle avoidance based-visual navigation for micro aerial vehicles. Electronics 6(1), 10 (2017)
Article Google Scholar
Aguilar, W.G., Casaliglla, V.P., Pólit, J.L., Abad, V., Ruiz, H.: Obstacle avoidance for flight safety on unmanned aerial vehicles. In: Rojas, I., Joya, G., Catala, A. (eds.) IWANN 2017. LNCS, vol. 10306, pp. 575–584. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-59147-6_49
Chapter Google Scholar
Geiger, A., Lenz, P., Urtasun, R.: Are we ready for autonomous driving? The KITTI vision benchmark suite. In: Conference on Computer Vision and Pattern Recognition, pp. 3354–3361 (2012)
Google Scholar
Lin, T.-Y., et al.: Microsoft COCO: common objects in context. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8693, pp. 740–755. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10602-1_48
Chapter Google Scholar

Download references

Acknowledgement

This work is part of the project Perception and localization system for autonomous navigation of rotor micro aerial vehicle in gps-denied environments, VisualNavDrone, 2016-PIC-024, from the Universidad de las Fuerzas Armadas ESPE, directed by Dr. Wilbert G. Aguilar.

Author information

Authors and Affiliations

CICTE Research Center, Universidad de las Fuerzas Armadas ESPE, Sangolquí, Ecuador
Wilbert G. Aguilar, Fernando J. Quisaguano, Guillermo A. Rodríguez, Leandro G. Alvarez, Alex Limaico & David S. Sandoval
FIS Faculty, Escuela Politécnica Nacional, Quito, Ecuador
Wilbert G. Aguilar
GREC Research Group, Universitat Politècnica de Catalunya, Barcelona, Spain
Wilbert G. Aguilar

Authors

Wilbert G. Aguilar
View author publications
You can also search for this author in PubMed Google Scholar
Fernando J. Quisaguano
View author publications
You can also search for this author in PubMed Google Scholar
Guillermo A. Rodríguez
View author publications
You can also search for this author in PubMed Google Scholar
Leandro G. Alvarez
View author publications
You can also search for this author in PubMed Google Scholar
Alex Limaico
View author publications
You can also search for this author in PubMed Google Scholar
David S. Sandoval
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wilbert G. Aguilar .

Editor information

Editors and Affiliations

Peking University, Beijing, China
Yuxin Peng
Shanghai Jiao Tong University, Shanghai, China
Kai Yu
Tsinghua University, Beijing, China
Jiwen Lu
Central China Normal University, Wuhan, China
Xingpeng Jiang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Aguilar, W.G., Quisaguano, F.J., Rodríguez, G.A., Alvarez, L.G., Limaico, A., Sandoval, D.S. (2018). Convolutional Neuronal Networks Based Monocular Object Detection and Depth Perception for Micro UAVs. In: Peng, Y., Yu, K., Lu, J., Jiang, X. (eds) Intelligence Science and Big Data Engineering. IScIDE 2018. Lecture Notes in Computer Science(), vol 11266. Springer, Cham. https://doi.org/10.1007/978-3-030-02698-1_35

Download citation

DOI: https://doi.org/10.1007/978-3-030-02698-1_35
Published: 09 November 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-02697-4
Online ISBN: 978-3-030-02698-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics