Automatic Image Annotation Using Multiple Grid Segmentation

Arellano, Gerardo; Sucar, Luis Enrique; Morales, Eduardo F.

doi:10.1007/978-3-642-16761-4_25

Gerardo Arellano²²,
Luis Enrique Sucar²² &
Eduardo F. Morales²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6437))

Included in the following conference series:

Mexican International Conference on Artificial Intelligence

1337 Accesses
1 Citations

Abstract

Automatic image annotation refers to the process of automatically labeling an image with a predefined set of keywords. Image annotation is an important step of content-based image retrieval (CBIR), which is relevant for many real-world applications. In this paper, a new algorithm based on multiple grid segmentation, entropy-based information and a Bayesian classifier, is proposed for an efficient, yet very effective, image annotation process. The proposed approach follows a two step process. In the first step, the algorithm generates grids of different sizes and different overlaps, and each grid is classified with a Naive Bayes classifier. In a second step, we used information based on the predicted class probability, its entropy, and the entropy of the neighbors of each grid element at the same and different resolutions, as input to a second binary classifier that qualifies the initial classification to select the correct segments. This significantly reduces false positives and improves the overall performance. We performed several experiments with images from the MSRC-9 database collection, which has manual ground truth segmentation and annotation information. The results show that the proposed approach has a very good performance compared to the initial labeling, and it also improves other scheme based on multiple segmentations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Viola, P., Jones, M.: Robust real-time face detection. Int. J. of Comp. Vision (2001)
Google Scholar
Malisiewicz, T., Efros, A.A.: Improving spatial support for objects via multiple segmentations. In: BMVC (2007)
Google Scholar
Pantofaru, C., Schmid, C.: Object recognition by integrating multiple image segmentations. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part III. LNCS, vol. 5304, pp. 481–494. Springer, Heidelberg (2008)
Chapter Google Scholar
Shi, J., Malik, J.: Normalized cuts and image segmentation. In: Proc. CVPR, pp. 731–743 (1997)
Google Scholar
Comaniciu, D., Meer, P.: Mean shift: A robust approach toward feature space analysis pami. IEEE Trans. Patt. Anal. Mach. Intell. 24, 603–619 (2002)
Article Google Scholar
Felzenszwalb, P., Huttenlocher, D.: Efficient graph-based image segmentation. Int. Journal of Computer Vision 59, 167–181 (2004)
Article Google Scholar
Shotton, J., Winn, J., Rother, C., Criminisi, A.: The msrc 21-class object recognition database (2006)
Google Scholar
Everingham, M., Van Gool, L., Williams, C., Winn, J., Zisserman, A.: The pascal voc 2007 (2007)
Google Scholar
Carbonetto, P.: Unsupervised statistical models for general object recognition. Master’s thesis, The University of British Columbia (2003)
Google Scholar
Aksoy, S., Haralick, R.: Textural features for image database retrieval. In: CBAIVL 1998, p. 45. IEEE Computer Society, Los Alamitos (1998)
Google Scholar
Chen, L., Lu, G., Zhang, D.: Content-based image retrieval using gabor texture features. In: PCM 2000, Sydney, Australia, pp. 1139–1142 (2000)
Google Scholar

Download references

Author information

Authors and Affiliations

Computer Science Department, Instituto Nacional de Astrofísica, Óptica y Electrónica, Luis Enrique Erro 1, Tonantzintla, Puebla, México
Gerardo Arellano, Luis Enrique Sucar & Eduardo F. Morales

Authors

Gerardo Arellano
View author publications
You can also search for this author in PubMed Google Scholar
Luis Enrique Sucar
View author publications
You can also search for this author in PubMed Google Scholar
Eduardo F. Morales
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Instituto Politécnico Nacional, Centro de Investigación en Computación, Av. Juan Dios Batiz, s/n, Zacatenco, 07738, Mexico City, México
Grigori Sidorov
Area de Computación, Centro de Investigación en Matemáticas (CIMAT), Callejón de Jalisco s/n, Mineral de Valenciana, 36240, Guanajuato, México
Arturo Hernández Aguirre
Instituto Nacional de Astrofísica, Optica y Electrónica (INAOE), Ciencias Computacionales, Luis Enrique Erro No. 1, 72840, Santa María Tonantzintla, Puebla,, México
Carlos Alberto Reyes García

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Arellano, G., Sucar, L.E., Morales, E.F. (2010). Automatic Image Annotation Using Multiple Grid Segmentation. In: Sidorov, G., Hernández Aguirre, A., Reyes García, C.A. (eds) Advances in Artificial Intelligence. MICAI 2010. Lecture Notes in Computer Science(), vol 6437. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16761-4_25

Download citation

DOI: https://doi.org/10.1007/978-3-642-16761-4_25
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-16760-7
Online ISBN: 978-3-642-16761-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics