Abstract
There is a huge number of data on the Internet that can be used for the development of machine learning in a robot or an AI agent. Utilizing this unorganized data, however, usually requires pre-collected database, which is time-consuming and expensive to make. This paper proposes a framework for collecting names of items required for performing a task, using text and image data available on the Internet without relying on any dictionary or pre-made database. We demonstrate a method to use text data acquired from Google Search to estimate term frequency-inverse document frequency (TF-IDF) value for task-word-relation verification, then identify words that are likely to be an item-name using image classification. We show the comparison results of measuring words’ item-name likelihood using various image classification settings. Finally, we have demonstrated that our framework can discover more than 45% of the desired item-names on three example tasks.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Bird, S.: Nltk: the natural language toolkit. In: Proceedings of the COLING/ACL on Interactive Presentation Sessions, pp. 69–72. Association for Computational Linguistics (2006)
Bird, S., Klein, E., Loper, E.: Natural language processing with Python: analyzing text with the natural language toolkit, O’Reilly Media, Inc. (2009)
Chatfield, K., Simonyan, K., Vedaldi, A., Zisserman, A.: Return of the devil in the details: delving deep into convolutional nets. In: BMVC (2014)
Chen, J., Cui, Y., Ye, G., Liu, D., Chang, S.F.: Event-driven semantic concept discovery by exploiting weakly tagged internet images. In: Proceedings of International Conference on Multimedia Retrieval, ICMR ’14, pp. 1:1–1:8. ACM, New York, NY, USA (2014). http://doi.acm.org/10.1145/2578726.2578729
Chen, X., Gupta, A.: Webly supervised learning of convolutional networks. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1431–1439 (2015)
Divvala, S.K., Farhadi, A., Guestrin, C.: Learning everything about anything: Webly-supervised visual concept learning. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (June 2014)
Felzenszwalb, P.F., Girshick, R.B., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part-based models. IEEE Trans. Pattern Anal. Mach. Intell. 32(9), 1627–1645 (2010)
Fergus, R., Fei-Fei, L., Perona, P., Zisserman, A.: Learning object categories from internet image searches. Proc. IEEE 98(8), 1453–1466 (2010)
Girshick, R.: Fast R-CNN. In: The IEEE International Conference on Computer Vision (ICCV) (Dec 2015)
Malisiewicz, T., Gupta, A., Efros, A.A.: Ensemble of exemplar-SVMS for object detection and beyond. In: 2011 IEEE International Conference on Computer Vision (ICCV), pp. 89–96. IEEE (2011)
Michel, J.B., Shen, Y.K., Aiden, A.P., Veres, A., Gray, M.K., Pickett, J.P., Hoiberg, D., Clancy, D., Norvig, P., Orwant, J., et al.: Quantitative analysis of culture using millions of digitized books. Science 331(6014), 176–182 (2011)
Miller, G.A.: Wordnet: a lexical database for english. Commun. ACM 38(11), 39–41 (1995)
Riboni, D., Murtas, M.: Web mining and computer vision: new partners for object-based activity recognition. In: 2017 IEEE 26th International Conference on Enabling Technologies: Infrastructure for Collaborative Enterprises (WETICE), pp. 158–163. IEEE (2017)
Torresani, L., Szummer, M., Fitzgibbon, A.: Efficient object category recognition using classemes. In: Computer Vision-ECCV 2010, pp. 776–789 (2010)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer International Publishing AG, part of Springer Nature
About this paper
Cite this paper
Thaipumi, P., Hasegawa, O. (2019). Task-related Item-Name Discovery Using Text and Image Data from the Internet. In: Kim, JH., et al. Robot Intelligence Technology and Applications 5. RiTA 2017. Advances in Intelligent Systems and Computing, vol 751. Springer, Cham. https://doi.org/10.1007/978-3-319-78452-6_6
Download citation
DOI: https://doi.org/10.1007/978-3-319-78452-6_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-78451-9
Online ISBN: 978-3-319-78452-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)