Enabling Imagination: Generative Adversarial Network-Based Object Finding in Robotic Tasks

Che, Huimin; Hu, Ben; Ding, Bo; Wang, Huaimin

doi:10.1007/978-3-319-70136-3_11

Huimin Che¹⁸,
Ben Hu¹⁸,
Bo Ding¹⁸ &
…
Huaimin Wang¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 10639))

Included in the following conference series:

International Conference on Neural Information Processing

3560 Accesses
1 Citations

Abstract

The skill to find objects in a real world situation is important for mobile robots. Existing works of robotic vision-based object finding is based on the traditional training and classification paradigm, which means that a robot can only detect objects with the fixed and pre-trained classification labels. It is of great challenge for robots to find an untrained object, even if a complex description of the object has been given. In this paper, we proposed a vision-based object detection approach for robotic finding names Generative Search. It is inspired by the object detection model that when an unfamiliar object needs to be found through a complex description, human would “imagine” the object in his or her brain and then find the object which is mostly like the imagined object profile. By adopting a Generative Adversarial Network (GAN), our approach enables the robot to generate the object virtually according to the given description. Then, we use pre-trained deep neural networks to match the generated image with images in the robotic vision. At the implementation level, we adopt the cloud robotic architecture to promote the algorithm efficiency. The experiments on both open datasets and real robotic scenarios have proved the significant promotion of object finding accuracy when a robot searching an unfamiliar object with a complex description.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
https://github.com/SelinaChe/GAN-based-Object-Finding.

References

Szegedy, C., Toshev, A., Erhan, D.: Deep neural networks for object detection. In: Advances in Neural Information Processing Systems, pp. 2553–2561 (2013)
Google Scholar
Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A., Bengio, Y.: Generative adversarial nets. In: Advances in Neural Information Processing Systems, pp. 2672–2680 (2014)
Google Scholar
Quigley, M., Conley, K., Gerkey, B., Faust, J., Foote, T., Leibs, J., Wheeler, R., Ng, A.Y.: ROS: an open-source robot operating system. In: ICRA Workshop on Open Source Software (2009)
Google Scholar
Aydemir, A., Sjöö, K., Jensfelt, P.: Object search on a mobile robot using relational spatial information. In: Proceedings of International Conference on Intelligent Autonomous Systems, pp. 111–120 (2010)
Google Scholar
Saigol, Z., Ridder, B., Wang, M., Dearden, R., Fox, M., Hawes, N., Lane, D.M., Long, D.: Efficient search for known objects in unknown environments using autonomous indoor robots. In: IROS Workshop on Task Planning for Intelligent Robots in Service and Manufacturing (2015)
Google Scholar
Garvey, T.D.: Perceptual strategies for purposive vision (1976)
Google Scholar
Torralba, A., Murphy, K.P., Freeman, W.T., Rubin, M.A., et al.: Context-based vision system for place and object recognition. In: ICCV, vol. 3, pp. 273–280 (2003)
Google Scholar
Li, Y., Wang, H., Ding, B., Shi, P., Liu, X.: Toward QoS-aware cloud robotic applications: a hybrid architecture and its implementation. In: 2016 International IEEE Conferences Ubiquitous Intelligence & Computing, pp. 33–40. IEEE (2016)
Google Scholar
López, D.G., Sjo, K., Paul, C., Jensfelt, P.: Hybrid laser and vision based object search and localization. In: IEEE International Conference on Robotics and Automation, ICRA 2008, pp. 2636–2643. IEEE (2008)
Google Scholar
Kim, H.S., Jain, R., Volz, R.: Object recognition using multiple views. In: Proceedings 1985 IEEE International Conference on Robotics and Automation, vol. 2, pp. 28–33. IEEE (1985)
Google Scholar
Zhang, H., Xu, T., Li, H., Zhang, S., Huang, X., Wang, X., Metaxas, D.: Stackgan: text to photo-realistic image synthesis with stacked generative adversarial networks. arXiv preprint arXiv:1612.03242 (2016)
Santana, E., Hotz, G.: Learning a driving simulator. arXiv preprint arXiv:1608.01230 (2016)
Ho, J., Ermon, S.: Generative adversarial imitation learning. In: Advances in Neural Information Processing Systems, pp. 4565–4573 (2016)
Google Scholar
Lin, K., Yang, H.F., Hsiao, J.H., Chen, C.S.: Deep learning of binary hash codes for fast image retrieval. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 27–35 (2015)
Google Scholar
Uijlings, J.R., Van De Sande, K.E., Gevers, T., Smeulders, A.W.: Selective search for object recognition. Int. J. Comput. Vision 104(2), 154–171 (2013)
Article Google Scholar
Hu, B., Wang, H., Zhang, P., Ding, B., Che, H.: Cloudroid: a cloud framework for transparent and QoS-aware robotic computation outsourcing. arXiv preprint arXiv:1705.05691 (2017)
Wah, C., Branson, S., Welinder, P., Perona, P., Belongie, S.: The caltech-ucsd birds-200-2011 dataset (2011)
Google Scholar
Nilsback, M.E., Zisserman, A.: Automated flower classification over a large number of classes. In: Sixth Indian Conference on Computer Vision, Graphics & Image Processing, ICVGIP 2008, pp. 722–729. IEEE (2008)
Google Scholar
Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. In: Advances in Neural Information Processing Systems, pp. 91–99 (2015)
Google Scholar

Download references

Acknowledgments

This work is partially supported by the National Natural Science Foundation of China (nos. 91118008 and 61202117), the special program for the applied basic research of the National University of Defense Technology (no. ZDYYJCYJ20140601), and the Jiangsu Future Networks Innovation Institute Prospective Research Project on Future Networks (no. BY2013095-2-08).

Author information

Authors and Affiliations

National Key Lab of Parallel and Distributed Processing, College of Computer, National University of Defense Technology, Changsha, China
Huimin Che, Ben Hu, Bo Ding & Huaimin Wang

Authors

Huimin Che
View author publications
You can also search for this author in PubMed Google Scholar
Ben Hu
View author publications
You can also search for this author in PubMed Google Scholar
Bo Ding
View author publications
You can also search for this author in PubMed Google Scholar
Huaimin Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Huimin Che .

Editor information

Editors and Affiliations

Guangdong University of Technology, Guangzhou, China
Derong Liu
Guangdong University of Technology, Guangzhou, China
Shengli Xie
South China University of Technology, Guangzhou, China
Yuanqing Li
Institute of Automation, Chinese Academy of Sciences, Beijing, China
Dongbin Zhao
King Fahd University of Petroleum and Minerals, Dhahran, Saudi Arabia
El-Sayed M. El-Alfy

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Che, H., Hu, B., Ding, B., Wang, H. (2017). Enabling Imagination: Generative Adversarial Network-Based Object Finding in Robotic Tasks. In: Liu, D., Xie, S., Li, Y., Zhao, D., El-Alfy, ES. (eds) Neural Information Processing. ICONIP 2017. Lecture Notes in Computer Science(), vol 10639. Springer, Cham. https://doi.org/10.1007/978-3-319-70136-3_11

Download citation

DOI: https://doi.org/10.1007/978-3-319-70136-3_11
Published: 26 October 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-70135-6
Online ISBN: 978-3-319-70136-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics