Abstract
Machine learning methods are successfully applied in many branches of Computer Science. One of these branches is image analysis, and being more specific – Automatic Image Annotation. Automatic Image Annotation was found an important research domain several years ago. It grew from such research domains as image recognition and cross-lingual machine translation. Increase of computational, data storage and data transfer abilities of todays’ computers has been one of key factors, making Automatic Image Annotation possible. Automatic Image Annotation methods, which have appeared during last several years, make a large use of many machine learning approaches. Clustering and classification methods are most frequently applied to annotate images. The chapter consists of three main parts. In the first, some general information concerning annotation methods is presented. In the second part, two original annotation methods are described. The last part presents experimental studies of the proposed methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Lavrenko, V., Manmatha, R., Jeon, J.: A model for learning the semantics of pictures. In: Proceedings of NIPS. MIT Press, Cambridge (2003)
Lavrenko, V., Feng, S.L., Manmatha, R.: Statistical models for automatic video annotation and retrieval. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2004), vol. 3, pp. 1044–1047 (2004)
Jeon, J., Manmatha, R.: Automatic image annotation of news images with large vocabularies and low quality training data. MM 368, University of Massachusetts (2004)
Feng, S.L., Manmatha, R., Lavrenko, V.: Multiple bernoulli relevance models for image and video annotation. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2004), vol. 2, pp. 1002–1009 (2004)
Laaksonen, J., Koskela, M., Oja, E.: Picsom-self-organizing image retrieval with mpeg-7 content descriptors. IEEE Transactions on Neural Networks 13(4), 841–853 (2002)
Viitaniemi, V., Laaksonen, J.: Keyword-detection approach to automatic image annotation. In: Proceedings of 2nd European Workshop on the Integration of Knowledge, Semantic and Digital Media Technologies (EWIMT 2005), pp. 15–22 (2005)
Cusano, C., Ciocca, G., Schettini, R.: Image annotation using SVM. In: Proceedings of SPIE, Internet Imaging V, vol. 5304, pp. 330–338 (2003)
Chang, E., Goh, K., Sychay, G., Wu, G.: Cbsa: content-based soft annotation for multimodal image retrieval using bayes point machines. IEEE Transactions on Circuits and Systems for Video Technology 13(1), 26–38 (2003)
Tang, J., Lewis, P.H.: A study of quality issues for image auto-annotation with the corel data-set. IEEE Transactions on Circuits and Systems for Video Technology 17(3), 384–389 (2007)
Kwaænicka, H., Paradowski, M.: Multiple class machine learning approach for image auto-annotation problem. In: Proceedings of The Sixth International Conference on Intelligent Systems Design and Applications (ISDA 2006), vol. 2, pp. 347–352 (2006)
Goh, K.-S., Chang, E.Y., Li, B.: Transactions on Knowledge and Data Engineering 17(10), 1333–1346 (2005)
Yasuhide, M., Hironobu, T., Ryuichi, O.: Image-to-word transformation based on dividing and vector quantizing images with words. In: Proceedings of the International Workshop on Multimedia Intelligent Storage and Retrieval Management (1999)
Jeon, J., Lavrenko, V., Manmatha, R.: Automatic image annotation and retrieval using cross-media relevance models. In: Proceedings of the ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 119–126 (2003)
Monay, F., Gatica-Perez, D.: On image auto-annotation with latent space models. In: Proceedings of the eleventh ACM international conference on Multimedia, pp. 275–278 (2003)
Pan, J.-Y., Yang, H.-J., Duygulu, P., Faloutsos, C.: Automatic image captioning. In: Proceedings of the 2004 IEEE International Conference on Multimedia and Expo. (ICME 2004), vol. 3, pp. 1987–1990 (2004)
Liu, W., Tang, X.: Learning an image-word embedding for image auto-annotation on the nonlinear latent space. In: Proceedings of the 13th annual ACM international conference on Multimedia, pp. 451–454 (2005)
Glotin, H., Tollari, S.: Fast image auto-annotation with visual vector approximation clusters. In: Proceedings of Fourth International Workshop on Content-Based Multimedia Indexing, CBMI 2005 (2005)
Kwaænicka, H., Paradowski, M.: Fast image auto-annotation with discretized feature distance measures. Machine Graphics and Vision 15(2), 123–140 (2006)
Hollink, L., Nguyen, G., Schreiber, G., Wielemaker, J., Wielinga, B., Worring, M.: Adding spatial semantics to image annotations. In: Proceedings of the 4th International Workshop on Knowledge Markup and Semantic Annotation at ISWC 2004 (2004)
Bashir, A., Khan, L.: A framework for image annotation using semantic web. In: Proceedings of ACM SIGKDD First International Workshop on Mining for and from the Semantic Web (MSW 2004) (2004)
Srikanth, M., Varner, J., Bowden, M., Moldovan, D.: Exploiting ontologies for automatic image annotation. In: Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 552–558 (2005)
Hollink, L., Little, S., Hunter, J.: Evaluating the application of semantic inferencing rules to image annotation. In: Proceedings of the 3rd international conference on Knowledge capture, pp. 91–98 (2005)
Hare, J.S., Sinclair, P.A.S., Lewis, P.H., Martinez, K., Enser, P.G.B., Sandom, C.J.: Bridging the semantic gap in multimedia information retrieval top-down and bottom-up approaches. In: Proceedings of Mastering the Gap: From Information Extraction to Semantic Representation / 3rd European Semantic Web Conference (2006)
Duygulu, P., Barnard, K., de Freitas, J.F.G., Forsyth, D.: Object recognition as machine translation: Learning a lexicon for a fixed image vocabulary. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002. LNCS, vol. 2353, pp. 97–112. Springer, Heidelberg (2002)
Rangayyan, R.M.: Biomedical Image Analysis. Biomedical Engineering Series. CRC Press, Boca Raton (2004)
Kurzyñski, M.: Multistage diagnosis of myocardial infraction using a fuzzy relation. In: Proceesings of the 7th International Conference on Artificial Intelligence and Soft Computing, pp. 1014–1019 (2004)
Abe, S.: Support Vector Machines for Pattern Classification. Springer-Verlag London Limited, Heidelberg (2005)
Shah, B., Benton, R., Wu, Z., Raghavan, V.: Automatic and Semi-Automatic Techniques for Image Annotation, pp. 112–134 (2007)
Barnard, K., Duygulu, P., de Freitas, N., Forsyth, D.: Object recognition as machine translation - part 2: Exploiting image database clustering models (2002)
Barnard, K., Duygulu, P., Forsyth, D., de Freitas, N., Blei, D.M., Jordan, M.I.: Matching words and pictures. Journal of Machine Learning Research 3, 1107–1135 (2003)
Carneiro, G., Vasconcelos, N.: Formulating semantic image annotation as a supervised learning problem. In: Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), vol. 2, pp. 163–168 (2005)
Chan, A.B., Moreno, P.J., Vasconcelos, N.: Using statistics to search and annotate pictures: an evaluation of semantic image annotation and retrieval on large databases. In: Joint Statistical Meetings (2006)
Carneiro, G., Chan, A.B., Moreno, P.J., Vasconcelos, N.: Supervised learning of semantic classes for image annotation and retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence 29(3), 394–410 (2007)
Buturovic, A.: Mpeg-7 color structure descriptor for visual information retrieval project vizir. In: Proceedings of the International Conference on Image Processing, vol. 1, pp. 670–673 (2001)
Tsai, C.-F., Hung, C.: Automatically annotating images with keywords a review of image annotation systems. Recent Patents on Computer Science (1), 55 (2008)
Shi, J., Malik, J.: Normalized cuts and image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence 22(8), 888–905 (2000)
Cheng, P.-J., Chien, L.-F.: Effective image annotation for search using multi-level semantics. International Journal on Digital Libraries 4(4), 230 (2004)
Hare, J.S., Lewis, P.H.: Saliency-based models of image content and their application to auto-annotation by semantic propagation. In: Multimedia and the Semantic Web / European Semantic Web Conference (2005)
Vailaya, A., Jain, A., Zhang, H.J.: On image classification: city vs. landscape. Pattern Recognition 31(12), 1921–1935 (1998)
Westerveld, T., de Vries, A.P., van Ballegooij, A., de Jong, F., Hiemstra, D.: A probabilistic multimedia retrieval model and its evaluation. EURASIP Journal on Applied Signal Processing 2, 186–198 (2003)
Xu, F., Zhang, Y.-J.: A Novel Framework for Image Categorization and Automatic Annotation, pp. 90–111 (2007)
Kang, F., Jin, R., Chai, J.Y.: Regularizing translation models for better automatic image annotation. In: Proceedings of the 2004 ACM CIKM International Conference on Information and Knowledge Management, pp. 350–359 (2004)
Duygulu, P., Hauptmann, A.: What’s news, what’s not? associating news videos with words. In: Enser, P.G.B., Kompatsiaris, Y., O’Connor, N.E., Smeaton, A., Smeulders, A.W.M. (eds.) CIVR 2004. LNCS, vol. 3115, pp. 132–140. Springer, Heidelberg (2004)
Jin, R., Chai, J.Y., Si, L.: Effective automatic image annotation via a coherent language model and active learning. In: Proceedings of the 12th annual ACM international conference on Multimedia, pp. 892–899 (2004)
Kwaænicka, H., Paradowski, M.: Resulted word counts optimization a new approach for better automatic image annotation. Pattern Recognition 41, 3562–3571 (2008)
Mitchell, T.M.: Machine Learning. McGraw-Hill Science Engineering (1997)
Japkowicz, N., Stephen, S.: The class imbalance problem: A systematic study. Intelligent Data Analysis 6(5), 429–449 (2002)
Wang, L., Khan, L.: Automatic image annotation and retrieval using weighted feature selection. Multimedia Tools and Applications 29(1), 55–71 (2006)
Yavlinsky, A., Schofield, E., Rüger, S.M.: Automated image annotation using global features and robust nonparametric density estimation. In: Leow, W.-K., Lew, M., Chua, T.-S., Ma, W.-Y., Chaisorn, L., Bakker, E.M. (eds.) CIVR 2005. LNCS, vol. 3568, pp. 507–517. Springer, Heidelberg (2005)
Smeulders, A.W.M., Gupta, A.: Content-based image retrieval at the end of the early years. Transactions on Pattern Analysis and Machine Intelligence 22(12), 1349–1380 (2000)
Kwaænicka, H., Paradowski, M.: A discussion on evaluation of image auto-annotation methods. In: Proceedings of The Sixth International Conference on Intelligent Systems Design and Applications (ISDA 2006), vol. 2, pp. 353–358 (2006)
Pan, J.-Y., Yang, H.-J., Faloutsos, C., Duygulu, P.: Gcap: Graph-based automatic image captioning. In: Proceedings of the 4th International Workshop on Multimedia Data and Document Engineering (MDDE 2004), in conjunction with Computer Vision Pattern Recognition Conference, CVPR 2004 (2004)
Jeon, J., Manmatha, R.: Using maximum entropy for automatic image annotation. In: Enser, P.G.B., Kompatsiaris, Y., O’Connor, N.E., Smeaton, A., Smeulders, A.W.M. (eds.) CIVR 2004. LNCS, vol. 3115, pp. 24–32. Springer, Heidelberg (2004)
Ye, J., Zhou, X., Pei, J., Chen, L., Zhang, L.: A stratification-based approach to accurate and fast image annotation. In: Fan, W., Wu, Z., Yang, J. (eds.) WAIM 2005. LNCS, vol. 3739, pp. 284–296. Springer, Heidelberg (2005)
Tang, J., Lewis, P.H.: Image auto-annotation using ‘easy’ and ‘more challenging’ training sets. In: Proceedings of 7th International Workshop on Image Analysis for Multimedia Interactive Services, pp. 121–124 (2006)
Metzler, D., Manmatha, R.: An inference network approach to image retrieval. In: Enser, P.G.B., Kompatsiaris, Y., O’Connor, N.E., Smeaton, A., Smeulders, A.W.M. (eds.) CIVR 2004. LNCS, vol. 3115, pp. 42–50. Springer, Heidelberg (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Kwaśnicka, H., Paradowski, M. (2010). Machine Learning Methods in Automatic Image Annotation. In: Koronacki, J., Raś, Z.W., Wierzchoń, S.T., Kacprzyk, J. (eds) Advances in Machine Learning II. Studies in Computational Intelligence, vol 263. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-05179-1_18
Download citation
DOI: https://doi.org/10.1007/978-3-642-05179-1_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-05178-4
Online ISBN: 978-3-642-05179-1
eBook Packages: EngineeringEngineering (R0)