Task-related Item-Name Discovery Using Text and Image Data from the Internet

Thaipumi, Putti; Hasegawa, Osamu

doi:10.1007/978-3-319-78452-6_6

Task-related Item-Name Discovery Using Text and Image Data from the Internet

Putti Thaipumi²¹ &
Osamu Hasegawa²¹

Conference paper
First Online: 31 May 2018

877 Accesses

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 751))

Abstract

There is a huge number of data on the Internet that can be used for the development of machine learning in a robot or an AI agent. Utilizing this unorganized data, however, usually requires pre-collected database, which is time-consuming and expensive to make. This paper proposes a framework for collecting names of items required for performing a task, using text and image data available on the Internet without relying on any dictionary or pre-made database. We demonstrate a method to use text data acquired from Google Search to estimate term frequency-inverse document frequency (TF-IDF) value for task-word-relation verification, then identify words that are likely to be an item-name using image classification. We show the comparison results of measuring words’ item-name likelihood using various image classification settings. Finally, we have demonstrated that our framework can discover more than 45% of the desired item-names on three example tasks.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Bird, S.: Nltk: the natural language toolkit. In: Proceedings of the COLING/ACL on Interactive Presentation Sessions, pp. 69–72. Association for Computational Linguistics (2006)
Google Scholar
Bird, S., Klein, E., Loper, E.: Natural language processing with Python: analyzing text with the natural language toolkit, O’Reilly Media, Inc. (2009)
Google Scholar
Chatfield, K., Simonyan, K., Vedaldi, A., Zisserman, A.: Return of the devil in the details: delving deep into convolutional nets. In: BMVC (2014)
Google Scholar
Chen, J., Cui, Y., Ye, G., Liu, D., Chang, S.F.: Event-driven semantic concept discovery by exploiting weakly tagged internet images. In: Proceedings of International Conference on Multimedia Retrieval, ICMR ’14, pp. 1:1–1:8. ACM, New York, NY, USA (2014). http://doi.acm.org/10.1145/2578726.2578729
Chen, X., Gupta, A.: Webly supervised learning of convolutional networks. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1431–1439 (2015)
Google Scholar
Divvala, S.K., Farhadi, A., Guestrin, C.: Learning everything about anything: Webly-supervised visual concept learning. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (June 2014)
Google Scholar
Felzenszwalb, P.F., Girshick, R.B., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part-based models. IEEE Trans. Pattern Anal. Mach. Intell. 32(9), 1627–1645 (2010)
Article Google Scholar
Fergus, R., Fei-Fei, L., Perona, P., Zisserman, A.: Learning object categories from internet image searches. Proc. IEEE 98(8), 1453–1466 (2010)
Article Google Scholar
Girshick, R.: Fast R-CNN. In: The IEEE International Conference on Computer Vision (ICCV) (Dec 2015)
Google Scholar
Malisiewicz, T., Gupta, A., Efros, A.A.: Ensemble of exemplar-SVMS for object detection and beyond. In: 2011 IEEE International Conference on Computer Vision (ICCV), pp. 89–96. IEEE (2011)
Google Scholar
Michel, J.B., Shen, Y.K., Aiden, A.P., Veres, A., Gray, M.K., Pickett, J.P., Hoiberg, D., Clancy, D., Norvig, P., Orwant, J., et al.: Quantitative analysis of culture using millions of digitized books. Science 331(6014), 176–182 (2011)
Article Google Scholar
Miller, G.A.: Wordnet: a lexical database for english. Commun. ACM 38(11), 39–41 (1995)
Article Google Scholar
Riboni, D., Murtas, M.: Web mining and computer vision: new partners for object-based activity recognition. In: 2017 IEEE 26th International Conference on Enabling Technologies: Infrastructure for Collaborative Enterprises (WETICE), pp. 158–163. IEEE (2017)
Google Scholar
Torresani, L., Szummer, M., Fitzgibbon, A.: Efficient object category recognition using classemes. In: Computer Vision-ECCV 2010, pp. 776–789 (2010)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computational Intelligence and Systems Science, Tokyo Institute of Technology, Tokyo, Japan
Putti Thaipumi & Osamu Hasegawa

Authors

Putti Thaipumi
View author publications
You can also search for this author in PubMed Google Scholar
Osamu Hasegawa
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Putti Thaipumi .

Editor information

Editors and Affiliations

School of Electrical Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, Korea (Republic of)
Jong-Hwan Kim
Department of Civil and Environmental Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, Korea (Republic of)
Hyun Myung
School of Electrical Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, Korea (Republic of)
Junmo Kim
Department of Mechanical Engineering, The University of Auckland, Auckland, New Zealand
Weiliang Xu
Department of Computer and Information Technology, Purdue University, West Lafayette, Indiana, USA
Eric T Matson
Department of Computer Science and Engineering, Dongguk University, Seoul, Korea (Republic of)
Jin-Woo Jung
Department of Aerospace Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, Korea (Republic of)
Han-Lim Choi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Thaipumi, P., Hasegawa, O. (2019). Task-related Item-Name Discovery Using Text and Image Data from the Internet. In: Kim, JH., et al. Robot Intelligence Technology and Applications 5. RiTA 2017. Advances in Intelligent Systems and Computing, vol 751. Springer, Cham. https://doi.org/10.1007/978-3-319-78452-6_6

Download citation

DOI: https://doi.org/10.1007/978-3-319-78452-6_6
Published: 31 May 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-78451-9
Online ISBN: 978-3-319-78452-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics