Revisiting Deep Convolutional Neural Networks for RGB-D Based Object Recognition

Madai-Tahy, Lorand; Otte, Sebastian; Hanten, Richard; Zell, Andreas

doi:10.1007/978-3-319-44781-0_4

Lorand Madai-Tahy¹⁶,
Sebastian Otte¹⁶,
Richard Hanten¹⁶ &
…
Andreas Zell¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 9887))

Included in the following conference series:

International Conference on Artificial Neural Networks

4119 Accesses
6 Citations

Abstract

In this paper we reinvestigate Deep Convolutional Neural Networks (DCNNs) for RGB-D based object recognition. A previously proposed method in which DCNNs are pretrained on a large-scale RGB database and just fine-tuned to process colorized depth images is taken up and extended. We introduce and analyse multiple solutions to improve depth colorization and propose a new method for depth colorization based on surface normals. We show that our improvements increase the classification accuracy significantly, such that we can present new state-of-the-art results for the Washington RGB-D dataset. Our results also indicate that classification using only surface normals without RGB images outperforms classification using pure RGB images, which is to our knowledge a novel discovery in the field of DCNNs.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Asif, U., Bennamoun, M., Sohel, F.: Efficient RGB-D object categorization using cascaded ensembles of randomized decision trees. In: IEEE International Conference on Robotics and Automation, ICRA 2015, pp. 1295–1302 (2015)
Google Scholar
Blum, M., Springenberg, J., Wuelfing, J., Riedmiller, M.: A learned feature descriptor for object recognition in RGB-D data. In: Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), pp. 1298–1303 (2012)
Google Scholar
Bo, L., Lai, K., Ren, X., Fox, D.: Object recognition with hierarchical kernel descriptors. In: Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1729–1736 (2011)
Google Scholar
Bo, L., Ren, X., Fox, D.: Depth kernel descriptors for object recognition. In: Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 821–826 (2011)
Google Scholar
Bo, L., Ren, X., Fox, D.: Unsupervised feature learning for RGB-D based object recognition. In: Proceedings of the International Symposium on Experimental Robtics (ISER), pp. 387–402 (2012)
Google Scholar
Deng, J., Dong, W., Socher, R., Li, L., Li, K., Fei-Fei, L.: Imagenet: a large-scale hierarchical image database. In: IEEE Computer Vision and Pattern Recognition (CVPR), pp. 248–255 (2009)
Google Scholar
Donahue, J.: Caffenet. https://github.com/BVLC/caffe/tree/master/models/bvlc_reference_caffenet. Accessed 26 Feb 2016
Eitel, A., Springenberg, J.T., Spinello, L., Riedmiller, M., Burgaard, W.: Multimodal deep learning for robust RGB-D object recognition. In: IROS Conference, pp. 681–687 (2015)
Google Scholar
Gupta, S., Girshick, R., Arbeláez, P., Malik, J.: Learning rich features from RGB-D images for object detection and segmentation. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014, Part VII. LNCS, vol. 8695, pp. 345–360. Springer, Heidelberg (2014)
Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. Adv. Neural Inf. Process. Syst. 25, 1106–1114 (2012)
Google Scholar
Lai, K., Bo, L., Ren, X., Fox, D.: A large-scale hierarchical multi-view RGB-D object dataset. In: Proceedings of the IEEE International Conference on Robotics, pp. 1817–1824 (2011)
Google Scholar
Schwarz, M., Schulz, H., S.B.: RGB-D object recognition and pose estimation based on pre-trained convolutional neural network features. In: IEEE International Conference on Robotics and Automation (ICRA), pp. 1329–1335 (2015)
Google Scholar
Socher, R., Huval, B., Bhat, B., Manning, C., Ng, A.: Convolutional-recursive deep learning for 3D object classification. Adv. Neural Inf. Process. Syst. (NIPS) 25, 656–664 (2012)
Google Scholar

Download references

Author information

Authors and Affiliations

Cognitive Systems Group, University of Tuebingen, Sand 1, 72076, Tuebingen, Germany
Lorand Madai-Tahy, Sebastian Otte, Richard Hanten & Andreas Zell

Authors

Lorand Madai-Tahy
View author publications
You can also search for this author in PubMed Google Scholar
Sebastian Otte
View author publications
You can also search for this author in PubMed Google Scholar
Richard Hanten
View author publications
You can also search for this author in PubMed Google Scholar
Andreas Zell
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Lorand Madai-Tahy .

Editor information

Editors and Affiliations

University of Lausanne, Lausanne, Switzerland
Alessandro E.P. Villa
University of Lausanne, Lausanne, Switzerland
Paolo Masulli
Universitat Politécnica de Catalunya, Terrrassa, Spain
Antonio Javier Pons Rivero

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Madai-Tahy, L., Otte, S., Hanten, R., Zell, A. (2016). Revisiting Deep Convolutional Neural Networks for RGB-D Based Object Recognition. In: Villa, A., Masulli, P., Pons Rivero, A. (eds) Artificial Neural Networks and Machine Learning – ICANN 2016. ICANN 2016. Lecture Notes in Computer Science(), vol 9887. Springer, Cham. https://doi.org/10.1007/978-3-319-44781-0_4

Download citation

DOI: https://doi.org/10.1007/978-3-319-44781-0_4
Published: 13 August 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-44780-3
Online ISBN: 978-3-319-44781-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics