Skip to main content

Contextualizing Tag Ranking and Saliency Detection for Social Images

  • Conference paper
Advances in Multimedia Modeling

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7733))

Abstract

Tag ranking and saliency detection are two key tasks for image understanding, and have attracted much attention in the past decades. In this paper, we investigate how to iteratively and mutually boost tag ranking and saliency detection by taking the outputs from one task as the context of the other one. Our method first computes an initial saliency value based on fusing multiple feature maps, and then iteratively refines saliency map based on the contextual information from image tag ranking. As a result, an integrated framework for tag saliency ranking which combines both visual attention model and multi-instance learning to investigate the saliency ranking order information. We show that this mutual reinforcement of saliency detection and tag ranking improves the performance by using this combined approach. Experiments conducted on Corel and Flickr image datasets demonstrate the effectiveness of the proposed framework.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Itti, L., Koch, C., Niebur, E.: A model of saliency-based visual attention for rapid scene analysis. TPAMI (1998)

    Google Scholar 

  2. Bruce, N., Tsotsos, J.: Saliency based on information maximization. In: NIPS (2006)

    Google Scholar 

  3. Harel, J., Koch, C., Perona, P.: Graph-based visual saliency. In: NIPS (2006)

    Google Scholar 

  4. Tsotsos, J., Culhane, S., Wai, W., Lai, Y., Davis, N.: Modelling visual attention via selective tuning. Artificial Intelligence (1995)

    Google Scholar 

  5. Judd, T., Ehinger, K., Durand, F., Torralba, A.: Learning to predict where humans look. In: ICCV (2009)

    Google Scholar 

  6. Meur, O., Chevet, J.: Relevance of a feed-forward model of visual attention for goal-oriented and free-viewing tasks. TIP (2010)

    Google Scholar 

  7. Hou, X., Zhang, L.: Dynamic visual attention: searching for coding length increments. In: NIPS (2008)

    Google Scholar 

  8. Lang, C., Liu, G., Yu, J., Yan, S.: Saliency detection by multi-task sparsity pursuit. TIP (2011)

    Google Scholar 

  9. Wang, M., Ni, B., Hua, X., Chua, T.: Assistive Tagging: A Survey of Multimedia Tagging with Human-Computer Joint Exploration. ACM Computing Survey (2012)

    Article  Google Scholar 

  10. Liu, D., Hua, X.S., Yang, L.J., Wang, M., Zhang, H.J.: Tag Ranking In: WWW (2009)

    Google Scholar 

  11. Li, X.R., Snoek, C.G.M., Worring, M.: Learning Social Tag Relevance by Neighbor Voting. IEEE Trans. on Multimedia (2009)

    Google Scholar 

  12. Feng, S., Lang, C., Xu, D.: Beyond tag relevance: integrating visual attention model and multi-instance learning for tag saliency ranking In: ACM CIVR (2010)

    Google Scholar 

  13. Wang, C.H., Yan, S.C.: Multi-Label Sparse Coding for Automatic Image Annotation In: IEEE CVPR (2009)

    Google Scholar 

  14. Gao, S., Wang, Z., Chia, L.: Automatic Image Tagging via Category Label and Web Data In: ACM Multimedia (2010)

    Google Scholar 

  15. Tang, J., Hong, R., Yan, S., Chua, T.: Image Annotation by kNN-Sparse Graph-based Label Propagation over Noisily-Tagged Web Images. ACM Trans. on Intelligent Systems and Technology (2011)

    Article  Google Scholar 

  16. Tang, J., Yan, S., Hong, R., Qi, G., Seng Chua, T.: Inferring semantic concepts from community-contributed images and noisy tags. In: ACM Multimedia (2009)

    Google Scholar 

  17. Chua, T., Tang, J., Hong, R.: NUS-WIDE: A Real-World Web Image Database from National University of Singapore In: ACM CIVR (2009)

    Google Scholar 

  18. Rahmani, R., Goldman, S.: MISSL: multiple-instance semi-supervised learning In: ICML (2006)

    Google Scholar 

  19. Wang, M., Yang, K., Hua, X., Zhang, H.: Towards a Relevant and Diverse Search of Social Images. IEEE Trans. on Multimedia (2010)

    Google Scholar 

  20. Wang, M., Hong, R., Li, G., Zha, Z.: Event Driven Web Video Summarization by Tag Localization and Key-Shot Identification. IEEE Transactions on Multimedia (2012)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Wang, W., Lang, C., Feng, S. (2013). Contextualizing Tag Ranking and Saliency Detection for Social Images. In: Li, S., et al. Advances in Multimedia Modeling. Lecture Notes in Computer Science, vol 7733. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35728-2_41

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-35728-2_41

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-35727-5

  • Online ISBN: 978-3-642-35728-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics