Skip to main content

Automatic Image Annotation Using Multiple Grid Segmentation

  • Conference paper
Advances in Artificial Intelligence (MICAI 2010)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6437))

Included in the following conference series:

Abstract

Automatic image annotation refers to the process of automatically labeling an image with a predefined set of keywords. Image annotation is an important step of content-based image retrieval (CBIR), which is relevant for many real-world applications. In this paper, a new algorithm based on multiple grid segmentation, entropy-based information and a Bayesian classifier, is proposed for an efficient, yet very effective, image annotation process. The proposed approach follows a two step process. In the first step, the algorithm generates grids of different sizes and different overlaps, and each grid is classified with a Naive Bayes classifier. In a second step, we used information based on the predicted class probability, its entropy, and the entropy of the neighbors of each grid element at the same and different resolutions, as input to a second binary classifier that qualifies the initial classification to select the correct segments. This significantly reduces false positives and improves the overall performance. We performed several experiments with images from the MSRC-9 database collection, which has manual ground truth segmentation and annotation information. The results show that the proposed approach has a very good performance compared to the initial labeling, and it also improves other scheme based on multiple segmentations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Viola, P., Jones, M.: Robust real-time face detection. Int. J. of Comp. Vision (2001)

    Google Scholar 

  2. Malisiewicz, T., Efros, A.A.: Improving spatial support for objects via multiple segmentations. In: BMVC (2007)

    Google Scholar 

  3. Pantofaru, C., Schmid, C.: Object recognition by integrating multiple image segmentations. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part III. LNCS, vol. 5304, pp. 481–494. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  4. Shi, J., Malik, J.: Normalized cuts and image segmentation. In: Proc. CVPR, pp. 731–743 (1997)

    Google Scholar 

  5. Comaniciu, D., Meer, P.: Mean shift: A robust approach toward feature space analysis pami. IEEE Trans. Patt. Anal. Mach. Intell. 24, 603–619 (2002)

    Article  Google Scholar 

  6. Felzenszwalb, P., Huttenlocher, D.: Efficient graph-based image segmentation. Int. Journal of Computer Vision 59, 167–181 (2004)

    Article  Google Scholar 

  7. Shotton, J., Winn, J., Rother, C., Criminisi, A.: The msrc 21-class object recognition database (2006)

    Google Scholar 

  8. Everingham, M., Van Gool, L., Williams, C., Winn, J., Zisserman, A.: The pascal voc 2007 (2007)

    Google Scholar 

  9. Carbonetto, P.: Unsupervised statistical models for general object recognition. Master’s thesis, The University of British Columbia (2003)

    Google Scholar 

  10. Aksoy, S., Haralick, R.: Textural features for image database retrieval. In: CBAIVL 1998, p. 45. IEEE Computer Society, Los Alamitos (1998)

    Google Scholar 

  11. Chen, L., Lu, G., Zhang, D.: Content-based image retrieval using gabor texture features. In: PCM 2000, Sydney, Australia, pp. 1139–1142 (2000)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Arellano, G., Sucar, L.E., Morales, E.F. (2010). Automatic Image Annotation Using Multiple Grid Segmentation. In: Sidorov, G., Hernández Aguirre, A., Reyes García, C.A. (eds) Advances in Artificial Intelligence. MICAI 2010. Lecture Notes in Computer Science(), vol 6437. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16761-4_25

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-16761-4_25

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-16760-7

  • Online ISBN: 978-3-642-16761-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics