Abstract
Automatic image annotation refers to the process of automatically labeling an image with a predefined set of keywords. Image annotation is an important step of content-based image retrieval (CBIR), which is relevant for many real-world applications. In this paper, a new algorithm based on multiple grid segmentation, entropy-based information and a Bayesian classifier, is proposed for an efficient, yet very effective, image annotation process. The proposed approach follows a two step process. In the first step, the algorithm generates grids of different sizes and different overlaps, and each grid is classified with a Naive Bayes classifier. In a second step, we used information based on the predicted class probability, its entropy, and the entropy of the neighbors of each grid element at the same and different resolutions, as input to a second binary classifier that qualifies the initial classification to select the correct segments. This significantly reduces false positives and improves the overall performance. We performed several experiments with images from the MSRC-9 database collection, which has manual ground truth segmentation and annotation information. The results show that the proposed approach has a very good performance compared to the initial labeling, and it also improves other scheme based on multiple segmentations.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Viola, P., Jones, M.: Robust real-time face detection. Int. J. of Comp. Vision (2001)
Malisiewicz, T., Efros, A.A.: Improving spatial support for objects via multiple segmentations. In: BMVC (2007)
Pantofaru, C., Schmid, C.: Object recognition by integrating multiple image segmentations. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part III. LNCS, vol. 5304, pp. 481–494. Springer, Heidelberg (2008)
Shi, J., Malik, J.: Normalized cuts and image segmentation. In: Proc. CVPR, pp. 731–743 (1997)
Comaniciu, D., Meer, P.: Mean shift: A robust approach toward feature space analysis pami. IEEE Trans. Patt. Anal. Mach. Intell. 24, 603–619 (2002)
Felzenszwalb, P., Huttenlocher, D.: Efficient graph-based image segmentation. Int. Journal of Computer Vision 59, 167–181 (2004)
Shotton, J., Winn, J., Rother, C., Criminisi, A.: The msrc 21-class object recognition database (2006)
Everingham, M., Van Gool, L., Williams, C., Winn, J., Zisserman, A.: The pascal voc 2007 (2007)
Carbonetto, P.: Unsupervised statistical models for general object recognition. Master’s thesis, The University of British Columbia (2003)
Aksoy, S., Haralick, R.: Textural features for image database retrieval. In: CBAIVL 1998, p. 45. IEEE Computer Society, Los Alamitos (1998)
Chen, L., Lu, G., Zhang, D.: Content-based image retrieval using gabor texture features. In: PCM 2000, Sydney, Australia, pp. 1139–1142 (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Arellano, G., Sucar, L.E., Morales, E.F. (2010). Automatic Image Annotation Using Multiple Grid Segmentation. In: Sidorov, G., Hernández Aguirre, A., Reyes García, C.A. (eds) Advances in Artificial Intelligence. MICAI 2010. Lecture Notes in Computer Science(), vol 6437. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16761-4_25
Download citation
DOI: https://doi.org/10.1007/978-3-642-16761-4_25
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-16760-7
Online ISBN: 978-3-642-16761-4
eBook Packages: Computer ScienceComputer Science (R0)