Skip to main content

Automatic Image Semantic Annotation Based on Image-Keyword Document Model

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3568))

Abstract

This paper presents a novel method of automatic image semantic annotation. Our approach is based on the Image-Keyword Document Model (IKDM) with image features discretization. According to IKDM, the image keyword annotation is conducted using image similarity measurement based on language model from text information retrieval domain. Through the experiments on a testing set of 5000 annotated images, our approach demonstrates great improvement of annotation performance compared with the known discretization-based image annotation model such as CMRM. Our approach also performs better in annotation time compared with the continuous model such as CRM.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Assfalg, J., Bertini, M., Colombo, C., Del Bimbo, A.: Semantic Annotation of Sports Videos. IEEE Multimedia (April-June 2002)

    Google Scholar 

  2. Barnard, K., Duygulu, P., Forsyth, D.: Clustering Art. In: Proceedings of IEEE ICPR (2001)

    Google Scholar 

  3. Blei, D., Jordan, M.I.: Modeling annotated data. In: Proc. of the 26th Intl. ACM SIGIR Conf., pp. 127–134 (2003)

    Google Scholar 

  4. Berman, A., Shapiro, L.G.: Efficient image retrieval with multiple distance measures. In: Storage and Retrieval for Image and Video Databases(SPIE), pp. 12–21 (1997)

    Google Scholar 

  5. Cusano, C., Ciocca, G., Schettini, R.: Image Annotation Using Svm. In: Proceedings of Internet imaging IV, vol. SPIE 5304 (2004)

    Google Scholar 

  6. Duygulu, P., Barnard, K., de Freitas, N., Forsyth, D.: Object recognition as machine translation:learning a lexicon for a fixed image vocabulary. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002. LNCS, vol. 2353, pp. 97–112. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  7. Fayyad, U., Irani, K.: Multi-interval discretization of continuous-valued attributes for classification learning. In: Proc. 13th IJCAI, pp. 1022C–1027C (1993)

    Google Scholar 

  8. Fountain, S., Tan, T.: Content Based Annotation and Retrieval. In: RAIDER IRSG (1998)

    Google Scholar 

  9. Gupta, A., Weymouth, T.E., Jain, R.: Semantic queries with pictures: the VIMSYS model. In: VLDB, pp. 69–79 (1991)

    Google Scholar 

  10. Jaser, E., Kittler, J., Christmas, W.J.: Hierarchical Decision Making Scheme for Sports Video Categorisation with Temporal Post-Processing. In: CVPR, vol. II, pp. 908–913 (2004)

    Google Scholar 

  11. Jeon, J., Lavrenko, V., Manmatha, R.: Automatic image annotation and retrieval using cross-media relevance models. In: Proc. of 26th ACM SIGIR, pp. 119–126 (2003)

    Google Scholar 

  12. Jin, R., Chai, J., Si, L.: Effective Automatic Image Annotation Via A Coherent Language Model and Active Learning. In: Proc. of ACM Multimedia (2004)

    Google Scholar 

  13. Lavrenko, V., Manmatha, R., Jeon, J.: A Model for Learning the Semantics of Pictures. In: Proceedings of Advances in Neural Information Processing (2003)

    Google Scholar 

  14. Zhang, L., Chen, L., Li, M., Zhang, H.: Automated annotation of human faces in family albums. In: Proc. of ACM Multimedia, pp. 355–358 (2003)

    Google Scholar 

  15. Mori, Y., Takahashi, H., Oka, R.: Image-to-word transformation based on dividing and vector quantizing images with words. In: Proc. of MISRM (1999)

    Google Scholar 

  16. Muller, H., Muller, W., Marchand-Maillet, S., Pun, T., Squire, D.: Strategies for Positive and Negative Relevance Feedback in Image Retrieval. In: ICPR, pp. 5043–5042 (2000)

    Google Scholar 

  17. Lew, M., Sebe, N., Eakins, J.: Challenges of image and video retrieval. In: Lew, M., Sebe, N., Eakins, J.P. (eds.) CIVR 2002. LNCS, vol. 2383, pp. 1–6. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  18. Monay, F., Gatica-Perez, D.: On Image Auto- Annotation with Latent Space Models. In: Proceedings of ACM Multimedia Conf. (2003)

    Google Scholar 

  19. Naphade, M.R., Kozintsev, I.V., Huang, T.S.: A Factor Graph Framework for Semantic Video Inexing. IEEE Trans. on Circuits and Systems for Video Technology 12(1) (2002)

    Google Scholar 

  20. Wang, W., Zhang, A.: Evaluation of low-level features by decisive feature patterns. In: Proc. of IEEE ICME (2004)

    Google Scholar 

  21. Rui, Y., Huang, T.S.: A novel relevance feedback technique in image retrieval. In: Proc. of the 7th ACM Int.Conf. on Multimedia, pp. 67–70 (1999)

    Google Scholar 

  22. Smith, J.R., Chang, S.-F.: VisualSEEk: A Fully Automated Content-Based Image Query System. In: Proc. of ACM Multimedia, pp. 87–98 (1996)

    Google Scholar 

  23. Tao, J.L., Hung, Y.P.: A bayesian method for content-based image retrieval by use of relevance feedback. In: Chang, S.-K., Chen, Z., Lee, S.-Y. (eds.) VISUAL 2002. LNCS, vol. 2314, pp. 76–87. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Zhou, X., Chen, L., Ye, J., Zhang, Q., Shi, B. (2005). Automatic Image Semantic Annotation Based on Image-Keyword Document Model. In: Leow, WK., Lew, M.S., Chua, TS., Ma, WY., Chaisorn, L., Bakker, E.M. (eds) Image and Video Retrieval. CIVR 2005. Lecture Notes in Computer Science, vol 3568. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11526346_22

Download citation

  • DOI: https://doi.org/10.1007/11526346_22

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-27858-0

  • Online ISBN: 978-3-540-31678-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics