Machine Learning Methods in Automatic Image Annotation

Kwaśnicka, Halina; Paradowski, Mariusz

doi:10.1007/978-3-642-05179-1_18

Halina Kwaśnicka⁵ &
Mariusz Paradowski⁵

Part of the book series: Studies in Computational Intelligence ((SCI,volume 263))

2194 Accesses
2 Citations

Abstract

Machine learning methods are successfully applied in many branches of Computer Science. One of these branches is image analysis, and being more specific – Automatic Image Annotation. Automatic Image Annotation was found an important research domain several years ago. It grew from such research domains as image recognition and cross-lingual machine translation. Increase of computational, data storage and data transfer abilities of todays’ computers has been one of key factors, making Automatic Image Annotation possible. Automatic Image Annotation methods, which have appeared during last several years, make a large use of many machine learning approaches. Clustering and classification methods are most frequently applied to annotate images. The chapter consists of three main parts. In the first, some general information concerning annotation methods is presented. In the second part, two original annotation methods are described. The last part presents experimental studies of the proposed methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Lavrenko, V., Manmatha, R., Jeon, J.: A model for learning the semantics of pictures. In: Proceedings of NIPS. MIT Press, Cambridge (2003)
Google Scholar
Lavrenko, V., Feng, S.L., Manmatha, R.: Statistical models for automatic video annotation and retrieval. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2004), vol. 3, pp. 1044–1047 (2004)
Google Scholar
Jeon, J., Manmatha, R.: Automatic image annotation of news images with large vocabularies and low quality training data. MM 368, University of Massachusetts (2004)
Google Scholar
Feng, S.L., Manmatha, R., Lavrenko, V.: Multiple bernoulli relevance models for image and video annotation. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2004), vol. 2, pp. 1002–1009 (2004)
Google Scholar
Laaksonen, J., Koskela, M., Oja, E.: Picsom-self-organizing image retrieval with mpeg-7 content descriptors. IEEE Transactions on Neural Networks 13(4), 841–853 (2002)
Article Google Scholar
Viitaniemi, V., Laaksonen, J.: Keyword-detection approach to automatic image annotation. In: Proceedings of 2nd European Workshop on the Integration of Knowledge, Semantic and Digital Media Technologies (EWIMT 2005), pp. 15–22 (2005)
Google Scholar
Cusano, C., Ciocca, G., Schettini, R.: Image annotation using SVM. In: Proceedings of SPIE, Internet Imaging V, vol. 5304, pp. 330–338 (2003)
Google Scholar
Chang, E., Goh, K., Sychay, G., Wu, G.: Cbsa: content-based soft annotation for multimodal image retrieval using bayes point machines. IEEE Transactions on Circuits and Systems for Video Technology 13(1), 26–38 (2003)
Article Google Scholar
Tang, J., Lewis, P.H.: A study of quality issues for image auto-annotation with the corel data-set. IEEE Transactions on Circuits and Systems for Video Technology 17(3), 384–389 (2007)
Article Google Scholar
Kwaænicka, H., Paradowski, M.: Multiple class machine learning approach for image auto-annotation problem. In: Proceedings of The Sixth International Conference on Intelligent Systems Design and Applications (ISDA 2006), vol. 2, pp. 347–352 (2006)
Google Scholar
Goh, K.-S., Chang, E.Y., Li, B.: Transactions on Knowledge and Data Engineering 17(10), 1333–1346 (2005)
Google Scholar
Yasuhide, M., Hironobu, T., Ryuichi, O.: Image-to-word transformation based on dividing and vector quantizing images with words. In: Proceedings of the International Workshop on Multimedia Intelligent Storage and Retrieval Management (1999)
Google Scholar
Jeon, J., Lavrenko, V., Manmatha, R.: Automatic image annotation and retrieval using cross-media relevance models. In: Proceedings of the ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 119–126 (2003)
Google Scholar
Monay, F., Gatica-Perez, D.: On image auto-annotation with latent space models. In: Proceedings of the eleventh ACM international conference on Multimedia, pp. 275–278 (2003)
Google Scholar
Pan, J.-Y., Yang, H.-J., Duygulu, P., Faloutsos, C.: Automatic image captioning. In: Proceedings of the 2004 IEEE International Conference on Multimedia and Expo. (ICME 2004), vol. 3, pp. 1987–1990 (2004)
Google Scholar
Liu, W., Tang, X.: Learning an image-word embedding for image auto-annotation on the nonlinear latent space. In: Proceedings of the 13th annual ACM international conference on Multimedia, pp. 451–454 (2005)
Google Scholar
Glotin, H., Tollari, S.: Fast image auto-annotation with visual vector approximation clusters. In: Proceedings of Fourth International Workshop on Content-Based Multimedia Indexing, CBMI 2005 (2005)
Google Scholar
Kwaænicka, H., Paradowski, M.: Fast image auto-annotation with discretized feature distance measures. Machine Graphics and Vision 15(2), 123–140 (2006)
Google Scholar
Hollink, L., Nguyen, G., Schreiber, G., Wielemaker, J., Wielinga, B., Worring, M.: Adding spatial semantics to image annotations. In: Proceedings of the 4th International Workshop on Knowledge Markup and Semantic Annotation at ISWC 2004 (2004)
Google Scholar
Bashir, A., Khan, L.: A framework for image annotation using semantic web. In: Proceedings of ACM SIGKDD First International Workshop on Mining for and from the Semantic Web (MSW 2004) (2004)
Google Scholar
Srikanth, M., Varner, J., Bowden, M., Moldovan, D.: Exploiting ontologies for automatic image annotation. In: Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 552–558 (2005)
Google Scholar
Hollink, L., Little, S., Hunter, J.: Evaluating the application of semantic inferencing rules to image annotation. In: Proceedings of the 3rd international conference on Knowledge capture, pp. 91–98 (2005)
Google Scholar
Hare, J.S., Sinclair, P.A.S., Lewis, P.H., Martinez, K., Enser, P.G.B., Sandom, C.J.: Bridging the semantic gap in multimedia information retrieval top-down and bottom-up approaches. In: Proceedings of Mastering the Gap: From Information Extraction to Semantic Representation / 3rd European Semantic Web Conference (2006)
Google Scholar
Duygulu, P., Barnard, K., de Freitas, J.F.G., Forsyth, D.: Object recognition as machine translation: Learning a lexicon for a fixed image vocabulary. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002. LNCS, vol. 2353, pp. 97–112. Springer, Heidelberg (2002)
Chapter Google Scholar
Rangayyan, R.M.: Biomedical Image Analysis. Biomedical Engineering Series. CRC Press, Boca Raton (2004)
Google Scholar
Kurzyñski, M.: Multistage diagnosis of myocardial infraction using a fuzzy relation. In: Proceesings of the 7th International Conference on Artificial Intelligence and Soft Computing, pp. 1014–1019 (2004)
Google Scholar
Abe, S.: Support Vector Machines for Pattern Classification. Springer-Verlag London Limited, Heidelberg (2005)
Google Scholar
Shah, B., Benton, R., Wu, Z., Raghavan, V.: Automatic and Semi-Automatic Techniques for Image Annotation, pp. 112–134 (2007)
Google Scholar
Barnard, K., Duygulu, P., de Freitas, N., Forsyth, D.: Object recognition as machine translation - part 2: Exploiting image database clustering models (2002)
Google Scholar
Barnard, K., Duygulu, P., Forsyth, D., de Freitas, N., Blei, D.M., Jordan, M.I.: Matching words and pictures. Journal of Machine Learning Research 3, 1107–1135 (2003)
Article MATH Google Scholar
Carneiro, G., Vasconcelos, N.: Formulating semantic image annotation as a supervised learning problem. In: Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), vol. 2, pp. 163–168 (2005)
Google Scholar
Chan, A.B., Moreno, P.J., Vasconcelos, N.: Using statistics to search and annotate pictures: an evaluation of semantic image annotation and retrieval on large databases. In: Joint Statistical Meetings (2006)
Google Scholar
Carneiro, G., Chan, A.B., Moreno, P.J., Vasconcelos, N.: Supervised learning of semantic classes for image annotation and retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence 29(3), 394–410 (2007)
Article Google Scholar
Buturovic, A.: Mpeg-7 color structure descriptor for visual information retrieval project vizir. In: Proceedings of the International Conference on Image Processing, vol. 1, pp. 670–673 (2001)
Google Scholar
Tsai, C.-F., Hung, C.: Automatically annotating images with keywords a review of image annotation systems. Recent Patents on Computer Science (1), 55 (2008)
Google Scholar
Shi, J., Malik, J.: Normalized cuts and image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence 22(8), 888–905 (2000)
Article Google Scholar
Cheng, P.-J., Chien, L.-F.: Effective image annotation for search using multi-level semantics. International Journal on Digital Libraries 4(4), 230 (2004)
Article Google Scholar
Hare, J.S., Lewis, P.H.: Saliency-based models of image content and their application to auto-annotation by semantic propagation. In: Multimedia and the Semantic Web / European Semantic Web Conference (2005)
Google Scholar
Vailaya, A., Jain, A., Zhang, H.J.: On image classification: city vs. landscape. Pattern Recognition 31(12), 1921–1935 (1998)
Article Google Scholar
Westerveld, T., de Vries, A.P., van Ballegooij, A., de Jong, F., Hiemstra, D.: A probabilistic multimedia retrieval model and its evaluation. EURASIP Journal on Applied Signal Processing 2, 186–198 (2003)
Google Scholar
Xu, F., Zhang, Y.-J.: A Novel Framework for Image Categorization and Automatic Annotation, pp. 90–111 (2007)
Google Scholar
Kang, F., Jin, R., Chai, J.Y.: Regularizing translation models for better automatic image annotation. In: Proceedings of the 2004 ACM CIKM International Conference on Information and Knowledge Management, pp. 350–359 (2004)
Google Scholar
Duygulu, P., Hauptmann, A.: What’s news, what’s not? associating news videos with words. In: Enser, P.G.B., Kompatsiaris, Y., O’Connor, N.E., Smeaton, A., Smeulders, A.W.M. (eds.) CIVR 2004. LNCS, vol. 3115, pp. 132–140. Springer, Heidelberg (2004)
Google Scholar
Jin, R., Chai, J.Y., Si, L.: Effective automatic image annotation via a coherent language model and active learning. In: Proceedings of the 12th annual ACM international conference on Multimedia, pp. 892–899 (2004)
Google Scholar
Kwaænicka, H., Paradowski, M.: Resulted word counts optimization a new approach for better automatic image annotation. Pattern Recognition 41, 3562–3571 (2008)
Article Google Scholar
Mitchell, T.M.: Machine Learning. McGraw-Hill Science Engineering (1997)
Google Scholar
Japkowicz, N., Stephen, S.: The class imbalance problem: A systematic study. Intelligent Data Analysis 6(5), 429–449 (2002)
MATH Google Scholar
Wang, L., Khan, L.: Automatic image annotation and retrieval using weighted feature selection. Multimedia Tools and Applications 29(1), 55–71 (2006)
Article Google Scholar
Yavlinsky, A., Schofield, E., Rüger, S.M.: Automated image annotation using global features and robust nonparametric density estimation. In: Leow, W.-K., Lew, M., Chua, T.-S., Ma, W.-Y., Chaisorn, L., Bakker, E.M. (eds.) CIVR 2005. LNCS, vol. 3568, pp. 507–517. Springer, Heidelberg (2005)
Google Scholar
Smeulders, A.W.M., Gupta, A.: Content-based image retrieval at the end of the early years. Transactions on Pattern Analysis and Machine Intelligence 22(12), 1349–1380 (2000)
Article Google Scholar
Kwaænicka, H., Paradowski, M.: A discussion on evaluation of image auto-annotation methods. In: Proceedings of The Sixth International Conference on Intelligent Systems Design and Applications (ISDA 2006), vol. 2, pp. 353–358 (2006)
Google Scholar
Pan, J.-Y., Yang, H.-J., Faloutsos, C., Duygulu, P.: Gcap: Graph-based automatic image captioning. In: Proceedings of the 4th International Workshop on Multimedia Data and Document Engineering (MDDE 2004), in conjunction with Computer Vision Pattern Recognition Conference, CVPR 2004 (2004)
Google Scholar
Jeon, J., Manmatha, R.: Using maximum entropy for automatic image annotation. In: Enser, P.G.B., Kompatsiaris, Y., O’Connor, N.E., Smeaton, A., Smeulders, A.W.M. (eds.) CIVR 2004. LNCS, vol. 3115, pp. 24–32. Springer, Heidelberg (2004)
Google Scholar
Ye, J., Zhou, X., Pei, J., Chen, L., Zhang, L.: A stratification-based approach to accurate and fast image annotation. In: Fan, W., Wu, Z., Yang, J. (eds.) WAIM 2005. LNCS, vol. 3739, pp. 284–296. Springer, Heidelberg (2005)
Chapter Google Scholar
Tang, J., Lewis, P.H.: Image auto-annotation using ‘easy’ and ‘more challenging’ training sets. In: Proceedings of 7th International Workshop on Image Analysis for Multimedia Interactive Services, pp. 121–124 (2006)
Google Scholar
Metzler, D., Manmatha, R.: An inference network approach to image retrieval. In: Enser, P.G.B., Kompatsiaris, Y., O’Connor, N.E., Smeaton, A., Smeulders, A.W.M. (eds.) CIVR 2004. LNCS, vol. 3115, pp. 42–50. Springer, Heidelberg (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Wroclaw University of Technology, Wroclaw, Poland
Halina Kwaśnicka & Mariusz Paradowski

Authors

Halina Kwaśnicka
View author publications
You can also search for this author in PubMed Google Scholar
Mariusz Paradowski
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Computer Science, Polish Academy of Sciences, ul. Ordona 21, 01-237, Warsaw, Poland
Jacek Koronacki & Sławomir T. Wierzchoń &
Woodward Hall 430C, University of North Carolina, 9201 University City Blvd., N.C. 28223, Charlotte, USA
Zbigniew W. Raś
Systems Research Institute, Polish Academy of Sciences, ul.Newelska 6, 01-447, Warsaw, Poland
Janusz Kacprzyk

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Kwaśnicka, H., Paradowski, M. (2010). Machine Learning Methods in Automatic Image Annotation. In: Koronacki, J., Raś, Z.W., Wierzchoń, S.T., Kacprzyk, J. (eds) Advances in Machine Learning II. Studies in Computational Intelligence, vol 263. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-05179-1_18

Download citation

DOI: https://doi.org/10.1007/978-3-642-05179-1_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-05178-4
Online ISBN: 978-3-642-05179-1
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics