Skip to main content

Machine Learning Methods in Automatic Image Annotation

  • Chapter
Advances in Machine Learning II

Part of the book series: Studies in Computational Intelligence ((SCI,volume 263))

Abstract

Machine learning methods are successfully applied in many branches of Computer Science. One of these branches is image analysis, and being more specific – Automatic Image Annotation. Automatic Image Annotation was found an important research domain several years ago. It grew from such research domains as image recognition and cross-lingual machine translation. Increase of computational, data storage and data transfer abilities of todays’ computers has been one of key factors, making Automatic Image Annotation possible. Automatic Image Annotation methods, which have appeared during last several years, make a large use of many machine learning approaches. Clustering and classification methods are most frequently applied to annotate images. The chapter consists of three main parts. In the first, some general information concerning annotation methods is presented. In the second part, two original annotation methods are described. The last part presents experimental studies of the proposed methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Lavrenko, V., Manmatha, R., Jeon, J.: A model for learning the semantics of pictures. In: Proceedings of NIPS. MIT Press, Cambridge (2003)

    Google Scholar 

  2. Lavrenko, V., Feng, S.L., Manmatha, R.: Statistical models for automatic video annotation and retrieval. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2004), vol. 3, pp. 1044–1047 (2004)

    Google Scholar 

  3. Jeon, J., Manmatha, R.: Automatic image annotation of news images with large vocabularies and low quality training data. MM 368, University of Massachusetts (2004)

    Google Scholar 

  4. Feng, S.L., Manmatha, R., Lavrenko, V.: Multiple bernoulli relevance models for image and video annotation. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2004), vol. 2, pp. 1002–1009 (2004)

    Google Scholar 

  5. Laaksonen, J., Koskela, M., Oja, E.: Picsom-self-organizing image retrieval with mpeg-7 content descriptors. IEEE Transactions on Neural Networks 13(4), 841–853 (2002)

    Article  Google Scholar 

  6. Viitaniemi, V., Laaksonen, J.: Keyword-detection approach to automatic image annotation. In: Proceedings of 2nd European Workshop on the Integration of Knowledge, Semantic and Digital Media Technologies (EWIMT 2005), pp. 15–22 (2005)

    Google Scholar 

  7. Cusano, C., Ciocca, G., Schettini, R.: Image annotation using SVM. In: Proceedings of SPIE, Internet Imaging V, vol. 5304, pp. 330–338 (2003)

    Google Scholar 

  8. Chang, E., Goh, K., Sychay, G., Wu, G.: Cbsa: content-based soft annotation for multimodal image retrieval using bayes point machines. IEEE Transactions on Circuits and Systems for Video Technology 13(1), 26–38 (2003)

    Article  Google Scholar 

  9. Tang, J., Lewis, P.H.: A study of quality issues for image auto-annotation with the corel data-set. IEEE Transactions on Circuits and Systems for Video Technology 17(3), 384–389 (2007)

    Article  Google Scholar 

  10. Kwaænicka, H., Paradowski, M.: Multiple class machine learning approach for image auto-annotation problem. In: Proceedings of The Sixth International Conference on Intelligent Systems Design and Applications (ISDA 2006), vol. 2, pp. 347–352 (2006)

    Google Scholar 

  11. Goh, K.-S., Chang, E.Y., Li, B.: Transactions on Knowledge and Data Engineering 17(10), 1333–1346 (2005)

    Google Scholar 

  12. Yasuhide, M., Hironobu, T., Ryuichi, O.: Image-to-word transformation based on dividing and vector quantizing images with words. In: Proceedings of the International Workshop on Multimedia Intelligent Storage and Retrieval Management (1999)

    Google Scholar 

  13. Jeon, J., Lavrenko, V., Manmatha, R.: Automatic image annotation and retrieval using cross-media relevance models. In: Proceedings of the ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 119–126 (2003)

    Google Scholar 

  14. Monay, F., Gatica-Perez, D.: On image auto-annotation with latent space models. In: Proceedings of the eleventh ACM international conference on Multimedia, pp. 275–278 (2003)

    Google Scholar 

  15. Pan, J.-Y., Yang, H.-J., Duygulu, P., Faloutsos, C.: Automatic image captioning. In: Proceedings of the 2004 IEEE International Conference on Multimedia and Expo. (ICME 2004), vol. 3, pp. 1987–1990 (2004)

    Google Scholar 

  16. Liu, W., Tang, X.: Learning an image-word embedding for image auto-annotation on the nonlinear latent space. In: Proceedings of the 13th annual ACM international conference on Multimedia, pp. 451–454 (2005)

    Google Scholar 

  17. Glotin, H., Tollari, S.: Fast image auto-annotation with visual vector approximation clusters. In: Proceedings of Fourth International Workshop on Content-Based Multimedia Indexing, CBMI 2005 (2005)

    Google Scholar 

  18. Kwaænicka, H., Paradowski, M.: Fast image auto-annotation with discretized feature distance measures. Machine Graphics and Vision 15(2), 123–140 (2006)

    Google Scholar 

  19. Hollink, L., Nguyen, G., Schreiber, G., Wielemaker, J., Wielinga, B., Worring, M.: Adding spatial semantics to image annotations. In: Proceedings of the 4th International Workshop on Knowledge Markup and Semantic Annotation at ISWC 2004 (2004)

    Google Scholar 

  20. Bashir, A., Khan, L.: A framework for image annotation using semantic web. In: Proceedings of ACM SIGKDD First International Workshop on Mining for and from the Semantic Web (MSW 2004) (2004)

    Google Scholar 

  21. Srikanth, M., Varner, J., Bowden, M., Moldovan, D.: Exploiting ontologies for automatic image annotation. In: Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 552–558 (2005)

    Google Scholar 

  22. Hollink, L., Little, S., Hunter, J.: Evaluating the application of semantic inferencing rules to image annotation. In: Proceedings of the 3rd international conference on Knowledge capture, pp. 91–98 (2005)

    Google Scholar 

  23. Hare, J.S., Sinclair, P.A.S., Lewis, P.H., Martinez, K., Enser, P.G.B., Sandom, C.J.: Bridging the semantic gap in multimedia information retrieval top-down and bottom-up approaches. In: Proceedings of Mastering the Gap: From Information Extraction to Semantic Representation / 3rd European Semantic Web Conference (2006)

    Google Scholar 

  24. Duygulu, P., Barnard, K., de Freitas, J.F.G., Forsyth, D.: Object recognition as machine translation: Learning a lexicon for a fixed image vocabulary. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002. LNCS, vol. 2353, pp. 97–112. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  25. Rangayyan, R.M.: Biomedical Image Analysis. Biomedical Engineering Series. CRC Press, Boca Raton (2004)

    Google Scholar 

  26. Kurzyñski, M.: Multistage diagnosis of myocardial infraction using a fuzzy relation. In: Proceesings of the 7th International Conference on Artificial Intelligence and Soft Computing, pp. 1014–1019 (2004)

    Google Scholar 

  27. Abe, S.: Support Vector Machines for Pattern Classification. Springer-Verlag London Limited, Heidelberg (2005)

    Google Scholar 

  28. Shah, B., Benton, R., Wu, Z., Raghavan, V.: Automatic and Semi-Automatic Techniques for Image Annotation, pp. 112–134 (2007)

    Google Scholar 

  29. Barnard, K., Duygulu, P., de Freitas, N., Forsyth, D.: Object recognition as machine translation - part 2: Exploiting image database clustering models (2002)

    Google Scholar 

  30. Barnard, K., Duygulu, P., Forsyth, D., de Freitas, N., Blei, D.M., Jordan, M.I.: Matching words and pictures. Journal of Machine Learning Research 3, 1107–1135 (2003)

    Article  MATH  Google Scholar 

  31. Carneiro, G., Vasconcelos, N.: Formulating semantic image annotation as a supervised learning problem. In: Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), vol. 2, pp. 163–168 (2005)

    Google Scholar 

  32. Chan, A.B., Moreno, P.J., Vasconcelos, N.: Using statistics to search and annotate pictures: an evaluation of semantic image annotation and retrieval on large databases. In: Joint Statistical Meetings (2006)

    Google Scholar 

  33. Carneiro, G., Chan, A.B., Moreno, P.J., Vasconcelos, N.: Supervised learning of semantic classes for image annotation and retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence 29(3), 394–410 (2007)

    Article  Google Scholar 

  34. Buturovic, A.: Mpeg-7 color structure descriptor for visual information retrieval project vizir. In: Proceedings of the International Conference on Image Processing, vol. 1, pp. 670–673 (2001)

    Google Scholar 

  35. Tsai, C.-F., Hung, C.: Automatically annotating images with keywords a review of image annotation systems. Recent Patents on Computer Science (1), 55 (2008)

    Google Scholar 

  36. Shi, J., Malik, J.: Normalized cuts and image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence 22(8), 888–905 (2000)

    Article  Google Scholar 

  37. Cheng, P.-J., Chien, L.-F.: Effective image annotation for search using multi-level semantics. International Journal on Digital Libraries 4(4), 230 (2004)

    Article  Google Scholar 

  38. Hare, J.S., Lewis, P.H.: Saliency-based models of image content and their application to auto-annotation by semantic propagation. In: Multimedia and the Semantic Web / European Semantic Web Conference (2005)

    Google Scholar 

  39. Vailaya, A., Jain, A., Zhang, H.J.: On image classification: city vs. landscape. Pattern Recognition 31(12), 1921–1935 (1998)

    Article  Google Scholar 

  40. Westerveld, T., de Vries, A.P., van Ballegooij, A., de Jong, F., Hiemstra, D.: A probabilistic multimedia retrieval model and its evaluation. EURASIP Journal on Applied Signal Processing 2, 186–198 (2003)

    Google Scholar 

  41. Xu, F., Zhang, Y.-J.: A Novel Framework for Image Categorization and Automatic Annotation, pp. 90–111 (2007)

    Google Scholar 

  42. Kang, F., Jin, R., Chai, J.Y.: Regularizing translation models for better automatic image annotation. In: Proceedings of the 2004 ACM CIKM International Conference on Information and Knowledge Management, pp. 350–359 (2004)

    Google Scholar 

  43. Duygulu, P., Hauptmann, A.: What’s news, what’s not? associating news videos with words. In: Enser, P.G.B., Kompatsiaris, Y., O’Connor, N.E., Smeaton, A., Smeulders, A.W.M. (eds.) CIVR 2004. LNCS, vol. 3115, pp. 132–140. Springer, Heidelberg (2004)

    Google Scholar 

  44. Jin, R., Chai, J.Y., Si, L.: Effective automatic image annotation via a coherent language model and active learning. In: Proceedings of the 12th annual ACM international conference on Multimedia, pp. 892–899 (2004)

    Google Scholar 

  45. Kwaænicka, H., Paradowski, M.: Resulted word counts optimization a new approach for better automatic image annotation. Pattern Recognition 41, 3562–3571 (2008)

    Article  Google Scholar 

  46. Mitchell, T.M.: Machine Learning. McGraw-Hill Science Engineering (1997)

    Google Scholar 

  47. Japkowicz, N., Stephen, S.: The class imbalance problem: A systematic study. Intelligent Data Analysis 6(5), 429–449 (2002)

    MATH  Google Scholar 

  48. Wang, L., Khan, L.: Automatic image annotation and retrieval using weighted feature selection. Multimedia Tools and Applications 29(1), 55–71 (2006)

    Article  Google Scholar 

  49. Yavlinsky, A., Schofield, E., Rüger, S.M.: Automated image annotation using global features and robust nonparametric density estimation. In: Leow, W.-K., Lew, M., Chua, T.-S., Ma, W.-Y., Chaisorn, L., Bakker, E.M. (eds.) CIVR 2005. LNCS, vol. 3568, pp. 507–517. Springer, Heidelberg (2005)

    Google Scholar 

  50. Smeulders, A.W.M., Gupta, A.: Content-based image retrieval at the end of the early years. Transactions on Pattern Analysis and Machine Intelligence 22(12), 1349–1380 (2000)

    Article  Google Scholar 

  51. Kwaænicka, H., Paradowski, M.: A discussion on evaluation of image auto-annotation methods. In: Proceedings of The Sixth International Conference on Intelligent Systems Design and Applications (ISDA 2006), vol. 2, pp. 353–358 (2006)

    Google Scholar 

  52. Pan, J.-Y., Yang, H.-J., Faloutsos, C., Duygulu, P.: Gcap: Graph-based automatic image captioning. In: Proceedings of the 4th International Workshop on Multimedia Data and Document Engineering (MDDE 2004), in conjunction with Computer Vision Pattern Recognition Conference, CVPR 2004 (2004)

    Google Scholar 

  53. Jeon, J., Manmatha, R.: Using maximum entropy for automatic image annotation. In: Enser, P.G.B., Kompatsiaris, Y., O’Connor, N.E., Smeaton, A., Smeulders, A.W.M. (eds.) CIVR 2004. LNCS, vol. 3115, pp. 24–32. Springer, Heidelberg (2004)

    Google Scholar 

  54. Ye, J., Zhou, X., Pei, J., Chen, L., Zhang, L.: A stratification-based approach to accurate and fast image annotation. In: Fan, W., Wu, Z., Yang, J. (eds.) WAIM 2005. LNCS, vol. 3739, pp. 284–296. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  55. Tang, J., Lewis, P.H.: Image auto-annotation using ‘easy’ and ‘more challenging’ training sets. In: Proceedings of 7th International Workshop on Image Analysis for Multimedia Interactive Services, pp. 121–124 (2006)

    Google Scholar 

  56. Metzler, D., Manmatha, R.: An inference network approach to image retrieval. In: Enser, P.G.B., Kompatsiaris, Y., O’Connor, N.E., Smeaton, A., Smeulders, A.W.M. (eds.) CIVR 2004. LNCS, vol. 3115, pp. 42–50. Springer, Heidelberg (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Kwaśnicka, H., Paradowski, M. (2010). Machine Learning Methods in Automatic Image Annotation. In: Koronacki, J., Raś, Z.W., Wierzchoń, S.T., Kacprzyk, J. (eds) Advances in Machine Learning II. Studies in Computational Intelligence, vol 263. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-05179-1_18

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-05179-1_18

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-05178-4

  • Online ISBN: 978-3-642-05179-1

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics