A Study into Annotation Ranking Metrics in Community Contributed Image Corpora

Hughes, Mark; Jones, Gareth J. F.; O’Connor, Noel E.

doi:10.1007/978-3-319-12093-5_8

Mark Hughes¹⁷,
Gareth J. F. Jones¹⁸ &
Noel E. O’Connor¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8382))

Included in the following conference series:

International Workshop on Adaptive Multimedia Retrieval

790 Accesses

Abstract

Community contributed datasets are becoming increasing common in automated image annotation systems. One important issue with community image data is that there is no guarantee that the associated metadata is relevant. A method is required that can accurately rank the semantic relevance of community annotations. This should enable the extracting of relevant subsets from potentially noisy collections of these annotations. Having relevant, non-heterogeneous tags assigned to images should improve community image retrieval systems, such as Flickr, which are based on text retrieval methods. In the literature, the current state of the art approach to ranking the semantic relevance of Flickr tags is based on the widely used tf-idf metric. In the case of datasets containing landmark images, however, this metric is inefficient and can be improved upon. In this paper, we present a landmark recognition framework, that provides end-to-end automated recognition and annotation. In our study into automated annotation, we evaluate 5 alternate approaches to tf-idf to rank tag relevance in community contributed landmark image corpora. We carry out a thorough evaluation of each of these ranking metrics and results of this evaluation demonstrate that four of these proposed techniques outperform the current commonly-used tf-idf approach for this task. Our best performing evaluated approach achieves a significant F-Measure increase of .19 over tf-idf.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
Flickr: www.flickr.com

References

Kennedy, L., Naaman, M., Ahern, S., Nair, R., Rattenbury, T.: How flickr helps us make sense of the world: context and content in community-contributed media collections. In: MULTIMEDIA ’07: Proceedings of the 15th international conference on Multimedia, pp. 631–640 (2007)
Google Scholar
Kennedy, L., Naaman, M.: Generating diverse and representative image search results for landmarks. In: WWW ’08: Proceeding of the 17th international conference on World Wide Web, pp. 297–306 (2008)
Google Scholar
Ahern, S., Naaman, M., Nair, R., Yang, J.: World explorer: visualizing aggregate data from unstructured text in geo-referenced collections. In: Proceedings of the Seventh ACM/IEEE-CS Joint Conference on Digital Libraries, pp. 1–10 (2007)
Google Scholar
Xirong, L., Snoek, C., Worring, M.: Annotating images by harnessing worldwide user-tagged photos. In: Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 3717–3720 (2009)
Google Scholar
Mahapatra, A., Wan, X., Tian, Y., Srivastava, J.: Augmenting image processing with social tag mining for landmark recognition. In: Lee, K.-T., Tsai, W.-H., Liao, H.-Y.M., Chen, T., Hsieh, J.-W., Tseng, C.-C. (eds.) MMM 2011 Part I. LNCS, vol. 6523, pp. 273–283. Springer, Heidelberg (2011)
Chapter Google Scholar
Sigurbornsson, B., Van Zwol, R.: Flickr tag recommendation based on collective knowledge. In: WWW ’08: Proceeding of the 17th International Conference on World Wide Web, pp. 327–336 (2008)
Google Scholar
Bay, H., Tuytelaars, T., Van Gool, L.: SURF: speeded up robust features. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006, Part I. LNCS, vol. 3951, pp. 404–417. Springer, Heidelberg (2006)
Chapter Google Scholar
Nister, D., Stewenius, H.: Scalable recognition with a vocabulary tree. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 2161–2168 (2006)
Google Scholar
Sivic, J., Zisserman, A.: DVideo Google: a text retrieval approach to object matching in videos. In: Ninth IEEE International Conference on Computer Vision 2003, Proceedings, pp. 1470–1477 (2003)
Google Scholar
Lowe, D.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60, 91–110 (2004)
Article Google Scholar
Girardin, F., Blat, J.: Place this photo on a map: a study of explicit disclosure of location information. In: UbiComp (2007)
Google Scholar
Hollenstein, L.: Capturing vernacular geography from georeferenced tags. Masters thesis, University of Zurich (2008)
Google Scholar

Download references

Author information

Authors and Affiliations

CLARITY: Centre for Sensor Web Technologies, Dublin City University, Dublin 9, Ireland
Mark Hughes & Noel E. O’Connor
Centre for Next Generation Localisation, Dublin City University, Dublin 9, Ireland
Gareth J. F. Jones

Authors

Mark Hughes
View author publications
You can also search for this author in PubMed Google Scholar
Gareth J. F. Jones
View author publications
You can also search for this author in PubMed Google Scholar
Noel E. O’Connor
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Noel E. O’Connor .

Editor information

Editors and Affiliations

Otto-von-Guericke-Universität Magdeburg, Magdeburg, Germany
Andreas Nürnberger
Otto-von-Guericke-Universität Magdeburg, Magdeburg, Germany
Sebastian Stober
Royal School of Library and Information Science, Copenhagen, Denmark
Birger Larsen
Université Pierre et Marie Curie, Paris, France
Marcin Detyniecki

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hughes, M., Jones, G.J.F., O’Connor, N.E. (2014). A Study into Annotation Ranking Metrics in Community Contributed Image Corpora. In: Nürnberger, A., Stober, S., Larsen, B., Detyniecki, M. (eds) Adaptive Multimedia Retrieval: Semantics, Context, and Adaptation. AMR 2012. Lecture Notes in Computer Science(), vol 8382. Springer, Cham. https://doi.org/10.1007/978-3-319-12093-5_8

Download citation

DOI: https://doi.org/10.1007/978-3-319-12093-5_8
Published: 29 October 2014
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-12092-8
Online ISBN: 978-3-319-12093-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics