Automatic Image Tagging by Multiple Feature Tag Relevance Learning

Tian, Feng; Shen, Xu-Kun; Shang, Fu-Hua; Zhou, Kai

doi:10.1007/978-3-642-33506-8_62

Feng Tian^4,5,
Xu-Kun Shen⁴,
Fu-Hua Shang⁵ &
…
Kai Zhou⁵

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 321))

Included in the following conference series:

Chinese Conference on Pattern Recognition

3333 Accesses

Abstract

In this paper, we present an image tagging framework based on multiple feature tag relevance learning (MFTRL). First, in specific feature space, each training image is encoded as a sparse linear combination of other training images by ℓ¹ minimization, component images are treated as the nearest neighbors of the target image, so we can get each image’s ℓ¹ nearest-neighbor by the ℓ¹ norm cost function. Then, maximum a posteriori (MAP) principle is utilized to determine the tag relevance for the testing image in specific feature space. Finally, the output of many tag relevance by diverse features can be combined in the manner of combining multi-feature tag relevance. The experiments over the well known data set demonstrate that the proposed method is beneficial and outperforms most existing image tagging algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Carneiro, G., Chan, A., Moreno, P.: Supervised Learning of Semantic Classes for Image Annotation and Retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence, 394–410 (2007)
Google Scholar
Lavrenko, V.: A Model for Learning the Semantics of Pictures. In: Proc. NIPS, pp. 125–129 (2003)
Google Scholar
Feng, S., Manmatha, R., Lavrenko, V.: Multiple bernoulli relevance models for image and video annotation. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 1002–1009 (2004)
Google Scholar
Jeon, J., Lavrenko, V., Manmatha, R.: Automatic Image Annotation and Retrieval Using Cross-media Relevance Models. In: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 119–126 (2003)
Google Scholar
Field, D.: What is the Goal of Sensory Coding? Neural Computation, 559–601 (1994)
Google Scholar
Wright, J.: Robust Face Recognition via Sparse Representation. In: IEEE Transactions on Pattern Analysis and Machine Intelligence, pp. 210–227 (2008)
Google Scholar
Donoho, D.: For most large underdetermined systems of linear equation the minimall L1-norm solution is also the sparsest solution. Comm. on Pure and Applied Math. 59(6), 797–829 (2006)
Article MathSciNet MATH Google Scholar
Freund, Y.: An efficient boosting algorithm for combining preferences. J. Mach. Learn. Res., 933–969 (2003)
Google Scholar
Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Springer (2001)
Google Scholar
Aslam, J., Montague, M.: Models for meta search. In: Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 276–284 (2001)
Google Scholar
Hare, J.S.: Automatically Annotating the MIR Flickr Dataset. In: ACM SIGMM International Conference on Multimedia Information Retrieval, pp. 547–556 (2010)
Google Scholar
Guillaumin, M.: Tagprop: Discriminative metric learning in nearest neighbor models for image auto-tagging. In: International Conference on Computer Vision, pp. 309–316 (2009)
Google Scholar
Metzler, D., Manmatha, R.: An inference network approach to image. Image Feature Extraction. Indexing and Retrieval In Image and Video Retrieval, 42–50 (2004)
Google Scholar
Yavlinsky, A.: Automated image tagging using global features and robust nonparametric density estimation. Image Feature Extraction, Indexing and Retrieval In Image and Video Retrieval, 507–517 (2005)
Google Scholar
Carneiros, G.: Supervised learning of semantic classes for image tagging and retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence 29(3), 394–410 (2007)
Article Google Scholar
Liu, J., Li, M., Liu, Q., Lu, H., Ma, S.: Image tagging via graph learning. Pattern Recogn. 42(2), 218–228 (2009)
Article MATH Google Scholar
Makadia, A., Pavlovic, V., Kumar, S.: A New Baseline for Image Annotation. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part III. LNCS, vol. 5304, pp. 316–329. Springer, Heidelberg (2008)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

The State Key Laboratory of Virtual Reality Technology and Systems, BeiHang University, Beijing, 100191, China
Feng Tian & Xu-Kun Shen
School of Computer and Information Technology, Northeast Petroleum University, DaQing, 163318, China
Feng Tian, Fu-Hua Shang & Kai Zhou

Authors

Feng Tian
View author publications
You can also search for this author in PubMed Google Scholar
Xu-Kun Shen
View author publications
You can also search for this author in PubMed Google Scholar
Fu-Hua Shang
View author publications
You can also search for this author in PubMed Google Scholar
Kai Zhou
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Automation, National Laboratory of Pattern Recognition, Chinese Academy of Sciences, No.95, Zhongguancun East Road, 100190, Beijing, China
Cheng-Lin Liu
Department of Automation, Tsinghua University, Haidian District, 100084, Beijing, China
Changshui Zhang
Institute of Automation, National Laboratory of Pattern Recognition, Chinese Academy of Sciences, 100190, Beijing, China
Liang Wang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tian, F., Shen, XK., Shang, FH., Zhou, K. (2012). Automatic Image Tagging by Multiple Feature Tag Relevance Learning. In: Liu, CL., Zhang, C., Wang, L. (eds) Pattern Recognition. CCPR 2012. Communications in Computer and Information Science, vol 321. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33506-8_62

Download citation

DOI: https://doi.org/10.1007/978-3-642-33506-8_62
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33505-1
Online ISBN: 978-3-642-33506-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics