Skip to main content

TagBag: Annotating a Foreign Language Lexical Resource with Pictures

  • Conference paper
  • First Online:
Analysis of Images, Social Networks and Texts (AIST 2015)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 542))

Abstract

Such forms of art as photography or drawing may serve as a uniform language, which represents things that we can either see or imagine. Hence, it is reasonable to use such pictures in order to connect nouns of the natural languages by their meanings. In this paper a study of mapping noun images from an annotated collection to the word senses of a foreign language lexical resource through the usage of a bilingual dictionary has been conducted. In this study, the English-Russian dictionary by V.K. Mueller has been used to enhance the Yet Another RussNet synsets with Flickr photos.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://www.flickr.com/services/api/flickr.photos.search.html.

  2. 2.

    http://nlpub.ru/YARN/Format.

  3. 3.

    http://www.talkenglish.com/Vocabulary/Top-1500-Nouns.aspx.

  4. 4.

    http://mueller-dic.chat.ru/.

  5. 5.

    http://ustalov.imm.uran.ru/pub/mueller.tar.gz.

  6. 6.

    https://www.flickr.com/photos/33818912@N00/16224853830.

  7. 7.

    https://www.flickr.com/photos/16391511@N00/16380713555.

  8. 8.

    https://www.flickr.com/photos/39585662@N00/16204920679.

References

  1. Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., Fei-Fei, L.: ImageNet: a large-scale hierarchical image database. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2009, pp. 248–255 (2009)

    Google Scholar 

  2. Gelfenbein, I., et al.: Avtomaticheskij perevod semanticheskoj seti WORDNET na russkij yazyk. In: Proceedings of Dialog 2003 (2003) (in Russian)

    Google Scholar 

  3. Müller, H., Clough, P., Deselaers, T., Caputo, B. (eds.): ImageCLEF. The Information Retrieval Series, vol. 32. Springer, Heidelberg (2010)

    MATH  Google Scholar 

  4. Joshi, D., Wang, J.Z., Li, J.: The story picturing engine–a system for automatic text illustration. ACM Trans. Multimedia Comput. Commun. Appl. 2, 68–89 (2006)

    Article  Google Scholar 

  5. Mihalcea, R., Leong, C.W.: Toward communicating simple sentences using pictorial representations. Mach. Trans. 22, 153–173 (2008)

    Article  Google Scholar 

  6. Reiter, K., Soderland, S., Etzioni, O.: Cross-lingual image search on the web. In: Proceedings of the Workshop on Cross-Lingual Information Access (20th International Joint Conference on Artificial Intelligence) (2007)

    Google Scholar 

  7. Trojahn, C., Quaresma, P., Vieira, R.: A framework for multilingual ontology mapping. In: Proceedings of the Sixth International Conference on Language Resources and Evaluation, LREC 2008, Marrakech. European Language Resources Association (2008)

    Google Scholar 

  8. Stampouli, A., Giannakidou, E., Vakali, A.: Tag disambiguation through flickr and wikipedia. In: Yoshikawa, M., Meng, X., Yumoto, T., Ma, Q., Sun, L., Watanabe, C. (eds.) DASFAA 2010. LNCS, vol. 6193, pp. 252–263. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  9. Jiang, Y., Liu, J., Lu, H.: Chat with illustration. Multimedia Syst. 1–12 (2014). http://link.springer.com/article/10.1007/s00530-014-0371-3

  10. Li, W., Zhuge, H.: Summarising news with texts and pictures. In: 10th International Conference on Semantics, Knowledge and Grids (SKG), pp. 100–107 (2014)

    Google Scholar 

  11. Braslavski, P., Ustalov, D., Mukhin, M.: A spinning wheel for YARN: user interface for a crowdsourced thesaurus. In: Proceedings of the Demonstrations at the 14th Conference of the European Chapter of the Association for Computational Linguistics, Gothenburg, pp. 101–104. Association for Computational Linguistics (2014)

    Google Scholar 

  12. Karger, D.R., Oh, S., Shah, D.: Budget-optimal task allocation for reliable crowdsourcing systems. Oper. Res. 62, 1–24 (2014)

    Article  MATH  Google Scholar 

  13. Fleiss, J.L., Levin, B., Paik, M.C.: Statistical Methods for Rates and Proportions, 3rd edn. Wiley, Hoboken (2003)

    Book  MATH  Google Scholar 

  14. Cheng, M.M., Zhang, G.X., Mitra, N.J., Huang, X., Hu, S.M.: Global contrast based salient region detection. In: 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 409–416 (2011)

    Google Scholar 

  15. von Ahn, L., Dabbish, L.: Labeling images with a computer game. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, CHI 2004, pp. 319–326. ACM, New York (2004)

    Google Scholar 

  16. Loukachevitch, N.: Thesauri for Information Retrieval Tasks. MSU, Moscow (2011)

    Google Scholar 

  17. Ntoulas, A., Najork, M., Manasse, M., Fetterly, D.: Detecting spam web pages through content analysis. In: Proceedings of the 15th International Conference on World Wide Web, WWW 2006, pp. 83–92. ACM, New York (2006)

    Google Scholar 

  18. Flati, T., Navigli, R.: The CQC algorithm: cycling in graphs to semantically enrich and enhance a bilingual dictionary. J. Artif. Int. Res. 43, 135–171 (2012)

    MATH  Google Scholar 

Download references

Acknowledgements

This work is supported by the Russian Foundation for the Humanities, project no. 13-04-12020 “New Open Electronic Thesaurus for Russian”, and by the Program of Government of the Russian Federation 02.A03.21.0006 on 27.08.2013. The URAN supercomputer located at the N.N. Krasovskii Institute of Mathematics and Mechanics of the Ural Branch of the Russian Academy of Sciences has been used to obtain the image collection. The author is grateful to those annotators who participated in the evaluation. He is also grateful to the anonymous referees who offered very useful comments on the present paper.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Dmitry Ustalov .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Ustalov, D. (2015). TagBag: Annotating a Foreign Language Lexical Resource with Pictures. In: Khachay, M., Konstantinova, N., Panchenko, A., Ignatov, D., Labunets, V. (eds) Analysis of Images, Social Networks and Texts. AIST 2015. Communications in Computer and Information Science, vol 542. Springer, Cham. https://doi.org/10.1007/978-3-319-26123-2_35

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-26123-2_35

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-26122-5

  • Online ISBN: 978-3-319-26123-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics