Automatic Image Semantic Annotation Based on Image-Keyword Document Model

Zhou, Xiangdong; Chen, Lian; Ye, Jianye; Zhang, Qi; Shi, Baile

doi:10.1007/11526346_22

Automatic Image Semantic Annotation Based on Image-Keyword Document Model

Xiangdong Zhou²¹,
Lian Chen²¹,
Jianye Ye²¹,
Qi Zhang²² &
…
Baile Shi²¹

Conference paper

1148 Accesses
3 Citations

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3568))

Abstract

This paper presents a novel method of automatic image semantic annotation. Our approach is based on the Image-Keyword Document Model (IKDM) with image features discretization. According to IKDM, the image keyword annotation is conducted using image similarity measurement based on language model from text information retrieval domain. Through the experiments on a testing set of 5000 annotated images, our approach demonstrates great improvement of annotation performance compared with the known discretization-based image annotation model such as CMRM. Our approach also performs better in annotation time compared with the continuous model such as CRM.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Assfalg, J., Bertini, M., Colombo, C., Del Bimbo, A.: Semantic Annotation of Sports Videos. IEEE Multimedia (April-June 2002)
Google Scholar
Barnard, K., Duygulu, P., Forsyth, D.: Clustering Art. In: Proceedings of IEEE ICPR (2001)
Google Scholar
Blei, D., Jordan, M.I.: Modeling annotated data. In: Proc. of the 26th Intl. ACM SIGIR Conf., pp. 127–134 (2003)
Google Scholar
Berman, A., Shapiro, L.G.: Efficient image retrieval with multiple distance measures. In: Storage and Retrieval for Image and Video Databases(SPIE), pp. 12–21 (1997)
Google Scholar
Cusano, C., Ciocca, G., Schettini, R.: Image Annotation Using Svm. In: Proceedings of Internet imaging IV, vol. SPIE 5304 (2004)
Google Scholar
Duygulu, P., Barnard, K., de Freitas, N., Forsyth, D.: Object recognition as machine translation:learning a lexicon for a fixed image vocabulary. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002. LNCS, vol. 2353, pp. 97–112. Springer, Heidelberg (2002)
Chapter Google Scholar
Fayyad, U., Irani, K.: Multi-interval discretization of continuous-valued attributes for classification learning. In: Proc. 13th IJCAI, pp. 1022C–1027C (1993)
Google Scholar
Fountain, S., Tan, T.: Content Based Annotation and Retrieval. In: RAIDER IRSG (1998)
Google Scholar
Gupta, A., Weymouth, T.E., Jain, R.: Semantic queries with pictures: the VIMSYS model. In: VLDB, pp. 69–79 (1991)
Google Scholar
Jaser, E., Kittler, J., Christmas, W.J.: Hierarchical Decision Making Scheme for Sports Video Categorisation with Temporal Post-Processing. In: CVPR, vol. II, pp. 908–913 (2004)
Google Scholar
Jeon, J., Lavrenko, V., Manmatha, R.: Automatic image annotation and retrieval using cross-media relevance models. In: Proc. of 26th ACM SIGIR, pp. 119–126 (2003)
Google Scholar
Jin, R., Chai, J., Si, L.: Effective Automatic Image Annotation Via A Coherent Language Model and Active Learning. In: Proc. of ACM Multimedia (2004)
Google Scholar
Lavrenko, V., Manmatha, R., Jeon, J.: A Model for Learning the Semantics of Pictures. In: Proceedings of Advances in Neural Information Processing (2003)
Google Scholar
Zhang, L., Chen, L., Li, M., Zhang, H.: Automated annotation of human faces in family albums. In: Proc. of ACM Multimedia, pp. 355–358 (2003)
Google Scholar
Mori, Y., Takahashi, H., Oka, R.: Image-to-word transformation based on dividing and vector quantizing images with words. In: Proc. of MISRM (1999)
Google Scholar
Muller, H., Muller, W., Marchand-Maillet, S., Pun, T., Squire, D.: Strategies for Positive and Negative Relevance Feedback in Image Retrieval. In: ICPR, pp. 5043–5042 (2000)
Google Scholar
Lew, M., Sebe, N., Eakins, J.: Challenges of image and video retrieval. In: Lew, M., Sebe, N., Eakins, J.P. (eds.) CIVR 2002. LNCS, vol. 2383, pp. 1–6. Springer, Heidelberg (2002)
Chapter Google Scholar
Monay, F., Gatica-Perez, D.: On Image Auto- Annotation with Latent Space Models. In: Proceedings of ACM Multimedia Conf. (2003)
Google Scholar
Naphade, M.R., Kozintsev, I.V., Huang, T.S.: A Factor Graph Framework for Semantic Video Inexing. IEEE Trans. on Circuits and Systems for Video Technology 12(1) (2002)
Google Scholar
Wang, W., Zhang, A.: Evaluation of low-level features by decisive feature patterns. In: Proc. of IEEE ICME (2004)
Google Scholar
Rui, Y., Huang, T.S.: A novel relevance feedback technique in image retrieval. In: Proc. of the 7th ACM Int.Conf. on Multimedia, pp. 67–70 (1999)
Google Scholar
Smith, J.R., Chang, S.-F.: VisualSEEk: A Fully Automated Content-Based Image Query System. In: Proc. of ACM Multimedia, pp. 87–98 (1996)
Google Scholar
Tao, J.L., Hung, Y.P.: A bayesian method for content-based image retrieval by use of relevance feedback. In: Chang, S.-K., Chen, Z., Lee, S.-Y. (eds.) VISUAL 2002. LNCS, vol. 2314, pp. 76–87. Springer, Heidelberg (2002)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computing and Information Technology, Fudan University Shanghai, 200433, China
Xiangdong Zhou, Lian Chen, Jianye Ye & Baile Shi
Department of Computer Science, University of North Carolina at Chapel Hill,
Qi Zhang

Authors

Xiangdong Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Lian Chen
View author publications
You can also search for this author in PubMed Google Scholar
Jianye Ye
View author publications
You can also search for this author in PubMed Google Scholar
Qi Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Baile Shi
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dept. of Computer Science, National University of Singapore, Computing 1, 117590, Singapore
Wee-Kheng Leow
LIACS Media Lab, Leiden University,
Michael S. Lew & Erwin M. Bakker &
National University of Singapore, 3 Science Dr, 117543, Singapore
Tat-Seng Chua
Microsoft Research Asia, 4F, Sigma Center, No.49, Zhichun Road, 100080, Beijing, P.R.China
Wei-Ying Ma
School of Computing, National University of Singapore, 3 Science Drive 2, 117543, Singapore
Lekha Chaisorn

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhou, X., Chen, L., Ye, J., Zhang, Q., Shi, B. (2005). Automatic Image Semantic Annotation Based on Image-Keyword Document Model. In: Leow, WK., Lew, M.S., Chua, TS., Ma, WY., Chaisorn, L., Bakker, E.M. (eds) Image and Video Retrieval. CIVR 2005. Lecture Notes in Computer Science, vol 3568. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11526346_22

Download citation

DOI: https://doi.org/10.1007/11526346_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-27858-0
Online ISBN: 978-3-540-31678-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics