Interest Point and Segmentation-Based Photo Annotation

Daróczy, Bálint; Petrás, István; Benczúr, András A.; Fekete, Zsolt; Nemeskey, Dávid; Siklósi, Dávid; Weiner, Zsuzsa

doi:10.1007/978-3-642-15751-6_44

Bálint Daróczy²³,
István Petrás²³,
András A. Benczúr²³,
Zsolt Fekete²³,
Dávid Nemeskey²³,
Dávid Siklósi²³ &
…
Zsuzsa Weiner²³

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6242))

Included in the following conference series:

Workshop of the Cross-Language Evaluation Forum for European Languages

486 Accesses
1 Citations

Abstract

Our approach to the ImageCLEF 2009 tasks is based on image segmentation, SIFT keypoints and Okapi BM25-based text retrieval. We use feature vectors to describe the visual content of an image segment, a keypoint or the entire image. The features include color histograms, a shape descriptor as well as a 2D Fourier transform of a segment and an orientation histogram of detected keypoints. We trained a Gaussian Mixture Model (GMM) to cluster the feature vectors extracted from the image segments and keypoints independently. The normalized Fisher gradient vector computed from GMM of SIFT descriptors is a well known technique to represent an image with only one vector. Novel to our method is the combination of Fisher vectors for keypoints with those of the image segments to improve classification accuracy. We introduced correlation-based combining methods to further improve classification quality.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Ah-Pine, J., Cifarelli, C., Clinchant, S., Csurka, G., Renders, J.: Xrce’s participation to imageclef 2008. In: Peters, C., Deselaers, T., Ferro, N., Gonzalo, J., Jones, G.J.F., Kurimo, M., Mandl, T., Peñas, A., Petras, V. (eds.) Evaluating Systems for Multilingual and Multimodal Information Access. LNCS, vol. 5706. Springer, Heidelberg (2009)
Chapter Google Scholar
Daroczy, B., et al.: Sztaki@imageclef 2009. In: Working Notes for the CLEF 2009 Workshop, Corfu, Greece (2009)
Google Scholar
Benczúr, A.A., Csalogány, K., Friedman, E., Fogaras, D., Sarlós, T., Uher, M., Windhager, E.: Searching a small national domain—preliminary report. In: Proceedings of the 12th World Wide Web Conference (WWW), Budapest, Hungary (2003), http://datamining.sztaki.hu/?q=en/en-publications
Büttcher, S., Clarke, C.L.A., Lushman, B.: Term proximity scoring for ad-hoc retrieval on very large text collections. In: SIGIR 2006, pp. 621–622. ACM Press, New York (2006)
Chapter Google Scholar
Carson, C., Belongie, S., Greenspan, H., Malik, J.: Blobworld: Image segmentation using expectation-maximization and its application to image querying. IEEE Trans. Pattern Anal. Mach. Intell. 24(8), 1026–1038 (2002)
Article Google Scholar
Chen, Y., Wang, J.Z.: Image categorization by learning and reasoning with regions. J. Mach. Learn. Res. 5, 913–939 (2004)
Google Scholar
Daróczy, B., Fekete, Z., Brendel, M., Rácz, S., Benczúr, A., Siklósi, D., Pereszlényi, A.: Cross-modal image retrieval with parameter tuning. In: Peters, C., Deselaers, T., Ferro, N., Gonzalo, J., Jones, G.J.F., Kurimo, M., Mandl, T., Peñas, A., Petras, V. (eds.) CLEF 2008. LNCS, vol. 5706. Springer, Heidelberg (2009)
Google Scholar
Fan, R., Chang, K., Hsieh, C., Wang, X., Lin, C.: LIBLINEAR: A library for large linear classication. The Journal of Machine Learning Research 9, 1871–1874 (2008)
Google Scholar
Lowe, D.: Object recognition from local scale-invariant features. In: International Conference on Computer Vision, vol. 2, pp. 1150–1157 (1999)
Google Scholar
Lv, Q., Charikar, M., Li, K.: Image similarity search with compact data structures. In: CIKM 2004: Proceedings of the Thirteenth ACM International Conference on Information and Knowledge Management, pp. 208–217. ACM Press, New York (2004)
Chapter Google Scholar
Nowak, S., Dunker, P.: Overview of the CLEF 2009 large scale visual concept detection and annotation task. In: Peters, C., et al. (eds.) CLEF 2009 Workshop, Part II. LNCS, vol. 6242, pp. 94–109. Springer, Heidelberg (2010)
Google Scholar
Paramita, M., Sanderson, M., Clough, P.: Diversity in photo retrieval: overview of the ImageCLEFPhoto task 2009. In: Peters, C., et al. (eds.) CLEF 2009 Workshop, Part II. LNCS, vol. 6242, pp. 45–59. Springer, Heidelberg (2010)
Google Scholar
Perronnin, F., Dance, C.: Fisher kernels on visual vocabularies for image categorization. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2007, pp. 1–8 (2007)
Google Scholar
Prasad, B.G., Biswas, K.K., Gupta, S.K.: Region-based image retrieval using integrated color, shape, and location index. Comput. Vis. Image Underst. 94(1-3), 193–233 (2004)
Article Google Scholar
Rasolofo, Y., Savoy, J.: Term proximity scoring for keyword-based retrieval systems. In: Sebastiani, F. (ed.) ECIR 2003. LNCS, vol. 2633, pp. 207–218. Springer, Heidelberg (2003)
Chapter Google Scholar
Robertson, S.E., Jones, K.S.: Relevance weighting of search terms. In: Document retrieval systems, pp. 143–160. Taylor Graham Publishing, London (1988)
Google Scholar
Tsikrika, T., Kludas, J.: Overview of the WikipediaMM task at ImageCLEF 2009. In: Working Notes for the CLEF 2009 Workshop, Corfu, Greece (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

Data Mining and Web search Research Group, Informatics Laboratory, Computer and Automation Research Institute, of the Hungarian Academy of Sciences,
Bálint Daróczy, István Petrás, András A. Benczúr, Zsolt Fekete, Dávid Nemeskey, Dávid Siklósi & Zsuzsa Weiner

Authors

Bálint Daróczy
View author publications
You can also search for this author in PubMed Google Scholar
István Petrás
View author publications
You can also search for this author in PubMed Google Scholar
András A. Benczúr
View author publications
You can also search for this author in PubMed Google Scholar
Zsolt Fekete
View author publications
You can also search for this author in PubMed Google Scholar
Dávid Nemeskey
View author publications
You can also search for this author in PubMed Google Scholar
Dávid Siklósi
View author publications
You can also search for this author in PubMed Google Scholar
Zsuzsa Weiner
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

ISTI-CNR, Area Ricerca CNR, Via Moruzzi, 1, 56124, Pisa, Italy
Carol Peters
Idiap Research Institute, Rue Marconi 19, 1920, Martigny, Switzerland
Barbara Caputo
LSI-UNED, Juan del Rosal, 16, 28040, Madrid, Spain
Julio Gonzalo
Centre for Digital Video Processing, School of Computing, Dublin City University, Dublin 9, Ireland
Gareth J. F. Jones
Oregon Health and Science University, 3181 SW Sam Jackson Park Road, 97239-3098, Portland, OR, USA
Jayashree Kalpathy-Cramer
University of Applied Sciences Western Switzerland, TechnoArk 3, 3960, Sierre, Switzerland
Henning Müller
Centrum Wiskunde and Infoormatica, Science Park 123, 1098, Amsterdam, XG, The Netherlands
Theodora Tsikrika

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Daróczy, B. et al. (2010). Interest Point and Segmentation-Based Photo Annotation. In: Peters, C., et al. Multilingual Information Access Evaluation II. Multimedia Experiments. CLEF 2009. Lecture Notes in Computer Science, vol 6242. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15751-6_44

Download citation

DOI: https://doi.org/10.1007/978-3-642-15751-6_44
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15750-9
Online ISBN: 978-3-642-15751-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics