Skip to main content

Convolutional Neuronal Networks Based Monocular Object Detection and Depth Perception for Micro UAVs

  • Conference paper
  • First Online:
Intelligence Science and Big Data Engineering (IScIDE 2018)

Abstract

In this work, we present the development of a system for the detection and depth estimation of objects in real time using the on-board camera in a micro-UAV through convolutional neuronal networks. Traditionally for the detection of obstacles shows the use of SLAM visual systems. However, to solve this problem, this level of complexity is not necessary, saving resources and execution time. The training with convolutional neural networks using stereo images for the depth estimation and in the same way training the detection of common observable objects can obtain an accurate detection of obstacles in a real time.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Aguilar, W.G., Angulo, C.: Real-time model-based video stabilization for microaerial vehicles. Neural Process. Lett. 43(2), 459–477 (2016)

    Article  Google Scholar 

  2. Aguilar, W.G., Angulo, C.: Real-time video stabilization without phantom movements for micro aerial vehicles. EURASIP J. Image Video Process. 2014(1), 46 (2014)

    Article  Google Scholar 

  3. Gageik, N., Benz, P., Montenegro, S.: Obstacle detection and collision avoidance for a UAV with complementary low-cost sensors. IEEE Access 3, 599–609 (2015)

    Article  Google Scholar 

  4. Yang, S., Konam, S., Ma, C., Rosenthal, S., Veloso, M., Scherer, S.: Obstacle avoidance through deep networks based intermediate perception (2017)

    Google Scholar 

  5. Aguilar, W.G., et al.: Pedestrian detection for UAVs using cascade classifiers and saliency maps. In: Rojas, I., Joya, G., Catala, A. (eds.) IWANN 2017. LNCS, vol. 10306, pp. 563–574. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-59147-6_48

    Chapter  Google Scholar 

  6. Aguilar, W.G., et al.: Cascade classifiers and saliency maps based people detection. In: De Paolis, L.T., Bourdot, P., Mongelli, A. (eds.) AVR 2017. LNCS, vol. 10325, pp. 501–510. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-60928-7_42

    Chapter  Google Scholar 

  7. Aguilar, W.G., Rodríguez, G.A., Álvarez, L., Sandoval, S., Quisaguano, F., Limaico, A.: Visual SLAM with a RGB-D camera on a quadrotor UAV using on-board processing. In: Rojas, I., Joya, G., Catala, A. (eds.) IWANN 2017. LNCS, vol. 10306, pp. 596–606. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-59147-6_51

    Chapter  Google Scholar 

  8. Aguilar, W.G., Rodríguez, G.A., Álvarez, L., Sandoval, S., Quisaguano, F., Limaico, A.: Real-time 3D modeling with a RGB-D camera and on-board processing. In: De Paolis, L.T., Bourdot, P., Mongelli, A. (eds.) AVR 2017. LNCS, vol. 10325, pp. 410–419. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-60928-7_35

    Chapter  Google Scholar 

  9. Aguilar, W.G., Rodríguez, G.A., Álvarez, L., Sandoval, S., Quisaguano, F., Limaico, A.: On-board visual SLAM on a UGV using a RGB-D camera. In: Huang, Y.A., Wu, H., Liu, H., Yin, Z. (eds.) ICIRA 2017. LNCS (LNAI), vol. 10464, pp. 298–308. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-65298-6_28

    Chapter  Google Scholar 

  10. Oleynikova, H., Honegger, D., Pollefeys, M.: Reactive avoidance using embedded stereo vision for MAV flight. In: Proceedings of IEEE International Conference on Robotics and Automation, vol. 2015, pp. 50–56 (2015)

    Google Scholar 

  11. Eigen, D., Puhrsch, C., Fergus, R.: Depth map prediction from a single image using a multi-scale deep network. In: NIPS, pp. 1–9 (2014)

    Google Scholar 

  12. Liu, F., Shen, C., Lin, G., Reid, I.: Learning depth from single monocular images using deep convolutional neural fields. IEEE Trans. Pattern Anal. Mach. Intell. 38(10), 2024–2039 (2016)

    Article  Google Scholar 

  13. Chakravarty, P., Kelchtermans, K., Roussel, T., Wellens, S., Tuytelaars, T., Van Eycken, L.: CNN-based single image obstacle avoidance on a quadrotor. In: Proceedings of IEEE International Conference on Robotics and Automation (ICRA), pp. 6369–6374 (2017)

    Google Scholar 

  14. Aguilar, W.G., Quisaguano, F., Álvarez, L., Pardo, J., Proaño, Z.: Monocular depth perception on a micro-UAV using convolutional neuronal networks. Accepted

    Google Scholar 

  15. Godard, C., Mac Aodha, O., Brostow, G.J.: Unsupervised monocular depth estimation with left-right consistency (2016)

    Google Scholar 

  16. Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. In: NIPS, pp. 91–99 (2015)

    Google Scholar 

  17. Dai, J., Li, Y., He, K., Sun, J.: R-FCN: object detection via region-based fully convolutional networks. In: NIPS (2016)

    Google Scholar 

  18. Liu, W., et al.: SSD: single shot multibox detector, pp. 21–37 (2016)

    Chapter  Google Scholar 

  19. Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection (2015)

    Google Scholar 

  20. Cvpr, A., Id, P.: Speed/accuracy trade-offs for modern convolutional object detectors. In: CVPR, vol. 3562, pp. 7310–7319 (2017)

    Google Scholar 

  21. Yilmaz, A., Javed, O., Shah, M.: Object tracking. ACM Comput. Surv. 38(4), 13 (2006)

    Article  Google Scholar 

  22. Sebastian, S., Mori, T.: First results in detecting and avoiding frontal obstacle from monocular camera for micro unmanned aerial vehicles. In: 2013 IEEE International Conference on Robotics and automation (ICRA), vol. 53, no. 9, pp. 1689–1699 (2013)

    Google Scholar 

  23. Aguilar, W.G., Morales, S.G.: 3D environment mapping using the Kinect V2 and path planning based on RRT algorithms. Electronics 5(4), 70 (2016)

    Article  Google Scholar 

  24. Aguilar, W.G., Morales, S., Ruiz, H., Abad, V.: RRT* GL based optimal path planning for real-time navigation of UAVs. In: Rojas, I., Joya, G., Catala, A. (eds.) IWANN 2017. LNCS, vol. 10306, pp. 585–595. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-59147-6_50

    Chapter  Google Scholar 

  25. Tokui, S., Oono, K., Hido, S., Clayton, J.: Chainer: a next-generation open source framework for deep learning. In: Proceedings of Workshop on Machine Learning Systems, Twenty-Ninth Annual Conference on Neural Information Processing Systems, pp. 1–6 (2015)

    Google Scholar 

  26. Saxena, A., Chung, S.H., Ng, A.Y.: Learning depth from single monocular images. Adv. Neural. Inf. Process. Syst. 18, 1161–1168 (2006)

    Google Scholar 

  27. Mancini, M., Costante, G., Valigi, P., Ciarfuglia, T.A., Delmerico, J., Scaramuzza, D.: Toward domain independence for learning-based monocular depth estimation. IEEE Robot. Autom. Lett. 2(3), 1778–1785 (2017)

    Article  Google Scholar 

  28. Mancini, M., Costante, G., Valigi, P., Ciarfuglia, T.A.: J-MOD2: joint monocular obstacle detection and depth estimation, vol. 3766, no. c, pp. 1–8 (2017)

    Google Scholar 

  29. Lecumberry, F.: Cálculo de disparidad en imágenes estéreo, una comparación. XI Congr. Argentino Ciencias la Comput (2005)

    Google Scholar 

  30. Szegedy, C., Reed, S., Erhan, D., Anguelov, D., Ioffe, S.: Scalable, high-quality object detection (2014)

    Google Scholar 

  31. Aguilar, W.G., Salcedo, V.S., Sandoval, D.S., Cobeña, B.: Developing of a video-based model for UAV autonomous navigation. In: Barone, D.A.C., Teles, E.O., Brackmann, C.P. (eds.) LAWCN 2017. CCIS, vol. 720, pp. 94–105. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-71011-2_8

    Chapter  Google Scholar 

  32. Aguilar, W.G., Casaliglla, V.P., Pólit, J.L.: Obstacle avoidance based-visual navigation for micro aerial vehicles. Electronics 6(1), 10 (2017)

    Article  Google Scholar 

  33. Aguilar, W.G., Casaliglla, V.P., Pólit, J.L., Abad, V., Ruiz, H.: Obstacle avoidance for flight safety on unmanned aerial vehicles. In: Rojas, I., Joya, G., Catala, A. (eds.) IWANN 2017. LNCS, vol. 10306, pp. 575–584. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-59147-6_49

    Chapter  Google Scholar 

  34. Geiger, A., Lenz, P., Urtasun, R.: Are we ready for autonomous driving? The KITTI vision benchmark suite. In: Conference on Computer Vision and Pattern Recognition, pp. 3354–3361 (2012)

    Google Scholar 

  35. Lin, T.-Y., et al.: Microsoft COCO: common objects in context. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8693, pp. 740–755. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10602-1_48

    Chapter  Google Scholar 

Download references

Acknowledgement

This work is part of the project Perception and localization system for autonomous navigation of rotor micro aerial vehicle in gps-denied environments, VisualNavDrone, 2016-PIC-024, from the Universidad de las Fuerzas Armadas ESPE, directed by Dr. Wilbert G. Aguilar.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Wilbert G. Aguilar .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Aguilar, W.G., Quisaguano, F.J., Rodríguez, G.A., Alvarez, L.G., Limaico, A., Sandoval, D.S. (2018). Convolutional Neuronal Networks Based Monocular Object Detection and Depth Perception for Micro UAVs. In: Peng, Y., Yu, K., Lu, J., Jiang, X. (eds) Intelligence Science and Big Data Engineering. IScIDE 2018. Lecture Notes in Computer Science(), vol 11266. Springer, Cham. https://doi.org/10.1007/978-3-030-02698-1_35

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-02698-1_35

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-02697-4

  • Online ISBN: 978-3-030-02698-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics