Graspable Object Classification with Multi-loss Hierarchical Representations

Wang, Zhichao; Li, Zhiqi; Wang, Bin; Liu, Hong

doi:10.1007/978-3-319-65298-6_42

Zhichao Wang¹⁷,
Zhiqi Li¹⁷,
Bin Wang¹⁷ &
…
Hong Liu¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10464))

Included in the following conference series:

International Conference on Intelligent Robotics and Applications

4870 Accesses

Abstract

To allow robots to accomplish manipulation work effectively, one of the critical functions they need is to precisely and robustly recognize the robotic graspable object and the category of the graspable objects, especially in data limited condition. In this paper, we propose a novel multi-loss hierarchical representations learning framework that is capable of recognizing the category of graspable objects in a coarse-to-fine way. Our model consists of two main components, an efficient hierarchical feature learning component that combines kernel features with the deep learning features and a multi-loss function that optimizes the multi-task learning mechanism in a coarse-to-fine way. We demonstrate the power of our proposed system to data of graspable and ungraspable objects. The results show that our system has superior performance than many existing algorithms both in terms of classification accuracy and computation efficiency. Moreover, our system achieves a quite high accuracy (about 82%) in unstructured real-world condition.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Lenz, I., Lee, H., Saxena, A.: Deep learning for detecting robotic grasps. Int. J. Robot. Res. 34(4–5), 705–724 (2015)
Article Google Scholar
Redmon, J., Angelova, A.: Real-time grasp detection using convolutional neural networks 2015, pp. 1316–1322 (2015)
Google Scholar
Wang, Z., Li, Z., Wang, B., Liu, H.: Robot grasp detection using multimodal deep convolutional neural networks. Adv. Mech. Eng. 8(9) (2016). doi:10.1177/1687814016668077
Girshick, R.: Fast R-CNN. In: International Conference on Computer Vision (ICCV) (2015)
Google Scholar
Bo, L., Lai, K., Ren, X., Fox, D.: Object recognition with hierarchical kernel descriptors. In: 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1729–1736. IEEE (2011)
Google Scholar
Hinton, G.E., Salakhutdinov, R.R.: Reducing the dimensionality of data with neural networks. Science 313(5786), 504–507 (2006)
Article MathSciNet MATH Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)
Google Scholar
Karpathy, A., Fei-Fei, L.: Deep visual-semantic alignments for generating image descriptions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3128–3137 (2015)
Google Scholar
Saxena, A., Driemeyer, J., Ng, A.Y.: Robotic grasping of novel objects using vision. Int. J. Robot. Res. 27(2), 157–173 (2008)
Article Google Scholar
Levine, S., Pastor, P., Krizhevsky, A., Quillen, D.: Learning hand-eye coordination for robotic grasping with deep learning and large-scale data collection (2016)
Google Scholar
Bohg, J., Morales, A., Asfour, T., Kragic, D.: Data-driven grasp synthesis a survey. IEEE Trans. Rob. 30(2), 289–309 (2014)
Article Google Scholar
Pinto, L., Gupta, A.: Supersizing self-supervision: Learning to grasp from 50k tries and 700 robot hours (2015)
Google Scholar
Dai, J., He, K., Sun, J.: Instance-aware semantic segmentation via multi-task network cascades, pp. 3150–3158 (2016)
Google Scholar
Wang, K., Lin, L., Zuo, W., Gu, S., Zhang, L.: Dictionary pair classifier driven convolutional neural networks for object detection. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2138–2146 (2016)
Google Scholar
Bo, L., Ren, X., Fox, D.: Kernel descriptors for visual recognition. In: Advances in Neural Information Processing Systems, pp. 244–252 (2010)
Google Scholar
Schölkopf, B., Smola, A., Müller, K.-R.: Kernel principal component analysis. In: Gerstner, W., Germond, A., Hasler, M., Nicoud, J.-D. (eds.) ICANN 1997. LNCS, vol. 1327, pp. 583–588. Springer, Heidelberg (1997). doi:10.1007/BFb0020217
Google Scholar
Wang, Q.: Kernel principal component analysis and its applications in face recognition and active shape models (2012). arXiv preprint arXiv:1207.3538
Dauphin, Y., De Vries, H., Chung, J., Bengio, Y.: RMSprop and equilibrated adaptive learning rates for non-convex optimization. arxiv preprint (2015). arXiv preprint arXiv:1502.04390
Lai, K., Bo, L., Ren, X., Fox, D.: A large-scale hierarchical multi-view RGB-D object dataset. In: 2011 IEEE International Conference on Robotics and Automation (ICRA), pp. 1817–1824. IEEE (2011)
Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vision 60(2), 91–110 (2004)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Harbin Institute of Technology, Harbin, 150001, China
Zhichao Wang, Zhiqi Li, Bin Wang & Hong Liu

Authors

Zhichao Wang
View author publications
You can also search for this author in PubMed Google Scholar
Zhiqi Li
View author publications
You can also search for this author in PubMed Google Scholar
Bin Wang
View author publications
You can also search for this author in PubMed Google Scholar
Hong Liu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Bin Wang .

Editor information

Editors and Affiliations

School of Mechanical Science and Engineering, Huazhong University of Science and Technology, Wuhan, China
YongAn Huang
School of Mechanical Science and Engineering, Huazhong University of Science and Technology, Wuhan, China
Hao Wu
Institute of Industrial Research, University of Portsmouth, Portsmouth, United Kingdom
Honghai Liu
School of Mechanical Science and Engineering, Huazhong University of Science and Technology, Wuhan, China
Zhouping Yin

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, Z., Li, Z., Wang, B., Liu, H. (2017). Graspable Object Classification with Multi-loss Hierarchical Representations. In: Huang, Y., Wu, H., Liu, H., Yin, Z. (eds) Intelligent Robotics and Applications. ICIRA 2017. Lecture Notes in Computer Science(), vol 10464. Springer, Cham. https://doi.org/10.1007/978-3-319-65298-6_42

Download citation

DOI: https://doi.org/10.1007/978-3-319-65298-6_42
Published: 06 August 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-65297-9
Online ISBN: 978-3-319-65298-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics