Multimedia Tools and Applications

, Volume 62, Issue 2, pp 451–478 | Cite as

i-TagRanker: an efficient tag ranking system for image sharing and retrieval using the semantic relationships between tags

  • Jin-Woo Jeong
  • Hyun-Ki Hong
  • Dong-Ho Lee


Folksonomy, considered a core component for Web 2.0 user-participation architecture, is a classification system made by user’s tags on the web resources. Recently, various approaches for image retrieval exploiting folksonomy have been proposed to improve the result of image search. However, the characteristics of the tags such as semantic ambiguity and non-controlledness limit the effectiveness of tags on image retrieval. Especially, tags associated with images in a random order do not provide any information about the relevance between a tag and an image. In this paper, we propose a novel image tag ranking system called i-TagRanker which exploits the semantic relationships between tags for re-ordering the tags according to the relevance with an image. The proposed system consists of two phases: 1) tag propagation phase, 2) tag ranking phase. In tag propagation phase, we first collect the most relevant tags from similar images, and then propagate them to an untagged image. In tag ranking phase, tags are ranked according to their semantic relevance to the image. From the experimental results on a Flickr photo collection about over 30,000 images, we show the effectiveness of the proposed system.


Tag ranking Tag-based image retrieval Semantic relationship Folksonomy WordNet 



This research was supported by the MKE(The Ministry of Knowledge Economy), Korea and Microsoft Research, under IT/SW Creative research program supervised by the NIPA(National IT Industry Promotion Agency) (NIPA-2010-C1810-1002-0012)


  1. 1.
    Baziz M, Boughanem M, Aussenac-Gilles N (2005) Conceptual indexing based on document content representation. In: Crestani F, Ruthven I (eds) Context: nature, impact, and role, vol 3507. Lecture notes in computer science. Springer Berlin, Heidelberg, pp 2021–2043. doi: 10.1007/11495222_14 Google Scholar
  2. 2.
    Benevenuto F, Duarte F, Rodrigues T, Almeida VAF, Almeida JM, Ross KW (2008) Understanding video interactions in youtube. Paper presented at the Proceeding of the 16th ACM international conference on Multimedia, Vancouver, British Columbia, CanadaGoogle Scholar
  3. 3.
    Budanitsky A, Hirst G (2001) Semantic distance in WordNet: an experimental, application-oriented evaluation of five measures. In: Workshop on Wordnet and Other Lexical Resources, Second Meeting of the North American Chapter of the Association for Computational LinguisticsGoogle Scholar
  4. 4.
    Chatzichristofis SA, Boutalis YS (2008) FCTH: Fuzzy Color and Texture Histogram - A Low Level Feature for Accurate Image Retrieval. In: Image Analysis for Multimedia Interactive Services, 2008. WIAMIS '08. Ninth International Workshop on. pp 191–196Google Scholar
  5. 5.
    Chatzichristofis SA, Zagoris K, Boutalis YS, Papamarkos N (2010) Accurate image retrieval based on compact composite descriptors and relevance feedback information. IJPRAI 24(2):207–244. doi: 10.1142/S0218001410007890 Google Scholar
  6. 6.
    Chi Z, Yan H, Pham T (1996) Fuzzy algorithms: with applications to image processing and pattern recognition, vol 10. Advances in Fuzzy Systems-Applications and Theory. World SciencificGoogle Scholar
  7. 7.
    Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on. pp 886–893 vol. 881Google Scholar
  8. 8.
    Datta R, Joshi D, Li J, Wang JZ (2008) Image retrieval: ideas, influences, and trends of the new age. ACM Comput Surv 40(2):1–60. doi: 10.1145/1348246.1348248 CrossRefGoogle Scholar
  9. 9.
    Fellbaum C (1998) Wordnet: an electronic lexical database. MIT Press, Cambridge, MAMATHGoogle Scholar
  10. 10.
  11. 11.
    Flickr-Pool (2010) available at
  12. 12.
    Frey BJ, Dueck D (2007) Clustering by passing messages between data points. Science 315:972–976MathSciNetMATHCrossRefGoogle Scholar
  13. 13.
    Hirst G, D. S-O (1998) Lexical chains as representations of context for the detection and correction of malapropisms. Fellbaum C. MA: MIT PressGoogle Scholar
  14. 14.
  15. 15.
    JWordNetSim (2010) available at
  16. 16.
    Kasutani E, Yamada A (2001) The MPEG-7 color layout descriptor: a compact image feature description for high-speed image/video segment retrieval. In: Image Processing, 2001. Proceedings. 2001 International Conference on. pp 674–677 vol. 671Google Scholar
  17. 17.
    Kekäläinen J, Järvelin K (2002) Cumulated gain-based evaluation of IR techniques. ACM Trans Inf Syst 20(4):422–446. doi: 10.1145/582415.582418 CrossRefGoogle Scholar
  18. 18.
    Leachok C, Chodorow M (1998) Combining local context and WordNet similarity for word sense identification In: Fellbaum C (ed) WordNet: An electronic lexical database. MIT PressGoogle Scholar
  19. 19.
    Lee D-H, Kim H-J (2001) A fast content-based indexing and retrieval technique by the shape information in large image database. J Syst Softw 56(2):165–182. doi: 10.1016/S0164-1212(00)00095-9 CrossRefGoogle Scholar
  20. 20.
    Li X, Snoek CGM, Worring M (2008) Learning tag relevance by neighbor voting for social image retrieval. Paper presented at the Proceeding of the 1st ACM international conference on Multimedia information retrieval, Vancouver, British Columbia, CanadaGoogle Scholar
  21. 21.
    Li X, Snoek CGM, Worring M (2010) Unsupervised multi-feature tag relevance learning for social image retrieval. Paper presented at the Proceedings of the ACM International Conference on Image and Video Retrieval, Xi'an, ChinaGoogle Scholar
  22. 22.
    Lienhart R, Romberg S, Hörster E (2009) Multilayer pLSA for multimodal image retrieval. Paper presented at the Proceeding of the ACM International Conference on Image and Video Retrieval, Santorini, Fira, GreeceGoogle Scholar
  23. 23.
    Lin D (1998) Automatic retrieval and clustering of similar words. Paper presented at the Proceedings of the 17th international conference on Computational linguistics - Volume 2, Montreal, Quebec, CanadaGoogle Scholar
  24. 24.
    Lindstaedt S, Morzinger R, Sorschag R, Pammer V, Thallinger G (2009) Automatic image annotation using visual content and folksonomies. Multimed Tool Appl 42(1):97–113. doi: 10.1007/s11042-008-0247-7 CrossRefGoogle Scholar
  25. 25.
    Liu D, Hua X-S, Wang M, Zhang H-J (2010) Retagging social images based on visual and semantic consistency. Paper presented at the Proceedings of the 19th international conference on World wide web, Raleigh, North Carolina, USAGoogle Scholar
  26. 26.
    Liu D, Hua X-S, Yang L, Wang M, Zhang H-J (2009) Tag ranking. Paper presented at the Proceedings of the 18th international conference on World wide web, Madrid, SpainGoogle Scholar
  27. 27.
    Mackay WE (1989) EVA: an experimental video annotator for symbolic analysis of video data. SIGCHI Bull 21(2):68–71. doi: 10.1145/70609.70617 CrossRefGoogle Scholar
  28. 28.
    Monay F, Gatica-Perez D (2004) PLSA-based image auto-annotation: constraining the latent space. Paper presented at the Proceedings of the 12th annual ACM international conference on Multimedia, New York, NY, USAGoogle Scholar
  29. 29.
    Ooi BC, Tan K-L, Chua TS, Hsu W (1998) Fast image retrieval using color-spatial information. VLDB J 7(2):115–128. doi: 10.1007/s007780050057 CrossRefGoogle Scholar
  30. 30.
    Oomoto E, Tanaka K (1993) OVID: design and implementation of a video-object database system. IEEE Trans Knowl Data Eng 5(4):629–643CrossRefGoogle Scholar
  31. 31.
    Park K-W, Jeong J-W, Lee D-H (2007) OLYBIA: ontology-based automatic image annotation system using semantic inference rules. In: Advances in databases: concepts, systems and applications, vol 4443. Lecture notes in computer science. Springer Berlin/Heidelberg, pp 485–496. doi: 10.1007/978-3-540-71703-4_42
  32. 32.
    Pedersen T, Patwardhan S, Michelizzi J (2004) WordNet::Similarity: measuring the relatedness of concepts. Paper presented at the Demonstration Papers at HLT-NAACL 2004, Boston, MassachusettsGoogle Scholar
  33. 33.
    Raykar VC, Duraiswami R (2006) Fast optimal bandwidth selection for kernel density estimation. Paper presented at the SIAM International Conference on Data MiningGoogle Scholar
  34. 34.
    Resnik P (1998) Semantic similarity in a taxonnomy: an information-based measure and its application to problems of ambiguity in natural language. J Artif Intell Res 11:95–130Google Scholar
  35. 35.
    Rijsbergen CJV, Robertson SE, Porter MF (1980) New models in probabilistic information retrieval vol 5587. British Library research & development reports British Library Lending DivisionGoogle Scholar
  36. 36.
    Sanchez D, Moreno A (2008) Learning non-taxonomic relationships from web documents for domain ontology construction. Data Knowl Eng 64(3):600–623CrossRefGoogle Scholar
  37. 37.
    Smith JR, Chang S-F (1996) VisualSEEk: a fully automated content-based image query system. Paper presented at the Proceedings of the fourth ACM international conference on Multimedia, Boston, Massachusetts, United StatesGoogle Scholar
  38. 38.
    Wikipedia (2010) available at
  39. 39.
    Wu Z, Palmer M (1994) Verbs semantics and lexical selection. Paper presented at the Proceedings of the 32nd annual meeting on Association for Computational Linguistics, Las Cruces, New MexicoGoogle Scholar
  40. 40.
    Xu H, Wang J, Hua X-S, Li S (2009) Tag refinement by regularized LDA. In: ACM Multimedia' 09. pp 573–576Google Scholar

Copyright information

© Springer Science+Business Media, LLC 2011

Authors and Affiliations

  1. 1.KDE Laboratory, Department of Computer Science & EngineeringHanyang UniversityAnsan-siRepublic of Korea

Personalised recommendations