Abstract
In this paper, we present an image tagging framework based on multiple feature tag relevance learning (MFTRL). First, in specific feature space, each training image is encoded as a sparse linear combination of other training images by ℓ1 minimization, component images are treated as the nearest neighbors of the target image, so we can get each image’s ℓ1 nearest-neighbor by the ℓ1 norm cost function. Then, maximum a posteriori (MAP) principle is utilized to determine the tag relevance for the testing image in specific feature space. Finally, the output of many tag relevance by diverse features can be combined in the manner of combining multi-feature tag relevance. The experiments over the well known data set demonstrate that the proposed method is beneficial and outperforms most existing image tagging algorithms.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Carneiro, G., Chan, A., Moreno, P.: Supervised Learning of Semantic Classes for Image Annotation and Retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence, 394–410 (2007)
Lavrenko, V.: A Model for Learning the Semantics of Pictures. In: Proc. NIPS, pp. 125–129 (2003)
Feng, S., Manmatha, R., Lavrenko, V.: Multiple bernoulli relevance models for image and video annotation. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 1002–1009 (2004)
Jeon, J., Lavrenko, V., Manmatha, R.: Automatic Image Annotation and Retrieval Using Cross-media Relevance Models. In: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 119–126 (2003)
Field, D.: What is the Goal of Sensory Coding? Neural Computation, 559–601 (1994)
Wright, J.: Robust Face Recognition via Sparse Representation. In: IEEE Transactions on Pattern Analysis and Machine Intelligence, pp. 210–227 (2008)
Donoho, D.: For most large underdetermined systems of linear equation the minimall L1-norm solution is also the sparsest solution. Comm. on Pure and Applied Math. 59(6), 797–829 (2006)
Freund, Y.: An efficient boosting algorithm for combining preferences. J. Mach. Learn. Res., 933–969 (2003)
Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Springer (2001)
Aslam, J., Montague, M.: Models for meta search. In: Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 276–284 (2001)
Hare, J.S.: Automatically Annotating the MIR Flickr Dataset. In: ACM SIGMM International Conference on Multimedia Information Retrieval, pp. 547–556 (2010)
Guillaumin, M.: Tagprop: Discriminative metric learning in nearest neighbor models for image auto-tagging. In: International Conference on Computer Vision, pp. 309–316 (2009)
Metzler, D., Manmatha, R.: An inference network approach to image. Image Feature Extraction. Indexing and Retrieval In Image and Video Retrieval, 42–50 (2004)
Yavlinsky, A.: Automated image tagging using global features and robust nonparametric density estimation. Image Feature Extraction, Indexing and Retrieval In Image and Video Retrieval, 507–517 (2005)
Carneiros, G.: Supervised learning of semantic classes for image tagging and retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence 29(3), 394–410 (2007)
Liu, J., Li, M., Liu, Q., Lu, H., Ma, S.: Image tagging via graph learning. Pattern Recogn. 42(2), 218–228 (2009)
Makadia, A., Pavlovic, V., Kumar, S.: A New Baseline for Image Annotation. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part III. LNCS, vol. 5304, pp. 316–329. Springer, Heidelberg (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Tian, F., Shen, XK., Shang, FH., Zhou, K. (2012). Automatic Image Tagging by Multiple Feature Tag Relevance Learning. In: Liu, CL., Zhang, C., Wang, L. (eds) Pattern Recognition. CCPR 2012. Communications in Computer and Information Science, vol 321. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33506-8_62
Download citation
DOI: https://doi.org/10.1007/978-3-642-33506-8_62
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33505-1
Online ISBN: 978-3-642-33506-8
eBook Packages: Computer ScienceComputer Science (R0)