On Inferring Image Label Information Using Rank Minimization for Supervised Concept Embedding

  • Dmitriy Bespalov
  • Anders Lindbjerg Dahl
  • Bing Bai
  • Ali Shokoufandeh
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6688)

Abstract

Concept-based representation — combined with some classifier (e.g., support vector machine) or regression analysis (e.g., linear regression) — induces a popular approach among image processing community, used to infer image labels. We propose a supervised learning procedure to obtain an embedding to a latent concept space with the pre-defined inner product. This learning procedure uses rank minimization of the sought inner product matrix, defined in the original concept space, to find an embedding to a new low dimensional space. The empirical evidence show that the proposed supervised learning method can be used in combination with another computational image embedding procedure, such as bag-of-features method, to significantly improve accuracy of label inference, while producing embedding of low complexity.

Keywords

Independent Component Analysis Meat Sample Image Annotation Concept Space Latent Semantic Indexing 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

  1. 1.
    Bai, B., Weston, J., Collobert, R., Grangier, D., Sadamasa, K., Qi, Y., Chapelle, O., Weinberger, K.: Supervised semantic indexing. In: Proceeding of the 18th ACM Conference on Information and Knowledge Management, pp. 187–196. ACM, New York (2009)Google Scholar
  2. 2.
    Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. The Journal of Machine Learning Research 3, 993–1022 (2003)MATHGoogle Scholar
  3. 3.
    Boiman, O., Shechtman, E., Irani, M.: In defense of nearest-neighbor based image classification. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–8. IEEE, Los Alamitos (2008)Google Scholar
  4. 4.
    Bruckstein, A.M., Donoho, D.L., Elad, M.: From sparse solutions of systems of equations to sparse modeling of signals and images. SIAM review 51(1), 34–81 (2009)MathSciNetCrossRefMATHGoogle Scholar
  5. 5.
    Csurka, G., Dance, C., Fan, L., Willamowski, J., Bray, C.: Visual categorization with bags of keypoints. In: Workshop on Statistical Learning in Computer Vision, ECCV, vol. 1, p. 22. Citeseer (2004)Google Scholar
  6. 6.
    Deerwester, S., Dumais, S.T., Furnas, G.W., Landauer, T.K., Harshman, R.: Indexing by latent semantic analysis. Journal of The American Society for Information Science 41(6), 391–407 (1990)CrossRefGoogle Scholar
  7. 7.
    Ding, T., Sznaier, M., Camps, O.I.: A rank minimization approach to video inpainting. In: IEEE International Conference on Computer Vision, pp. 1–8 (2007)Google Scholar
  8. 8.
    Ding, T., Sznaier, M., Camps, O.I.: Receding horizon rank minimization based estimation with applications to visual tracking. In: Proceedings of the 47th IEEE Conference on Decision and Control, CDC 2008, Cancún, México, December 9-11, pp. 3446–3451 (2008)Google Scholar
  9. 9.
    Elad, M.: Sparse and Redundant Representations: From Theory to Applications in Signal and Image Processing. Springer, Heidelberg (2010)CrossRefMATHGoogle Scholar
  10. 10.
    Fazel, M., Hindi, H., Boyd, S.P.: Log-det heuristic for matrix rank minimization with applications to hankel and euclidean distance matrices. In: Proceedings American Control Conference, pp. 2156–2162 (2003)Google Scholar
  11. 11.
    Hofmann, T.: Probabilistic latent semantic indexing. In: Proceedings of the 22nd annual international ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 50–57. ACM Press, New York (1999)Google Scholar
  12. 12.
    Lowe, D.G.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60(2), 91–110 (2004)CrossRefGoogle Scholar
  13. 13.
    Pearson, K.: On lines and planes of closest fit to systems of points in space. Philosophical Magazine 2(6), 559–572 (1901)CrossRefMATHGoogle Scholar
  14. 14.
    Schmid, C., Mohr, R.: Local grayvalue invariants for image retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence 19(5), 530–535 (1997)CrossRefGoogle Scholar
  15. 15.
    Sivic, J., Zisserman, A.: Video google: Efficient visual search of videos. In: Ponce, J., Hebert, M., Schmid, C., Zisserman, A. (eds.) Toward Category-Level Object Recognition. LNCS, vol. 4170, pp. 127–144. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  16. 16.
    Weinberger, K.Q., Blitzer, J., Saul, L.K.: Distance metric learning for large margin nearest neighbor classification. In: NIPS. MIT Press, Cambridge (2006)Google Scholar
  17. 17.
    Weston, J., Bengio, S., Usunier, N.: Large scale image annotation: Learning to rank with joint word-image embeddings. Machine learning 81(1), 21–35 (2010)MathSciNetCrossRefGoogle Scholar
  18. 18.
    Yang, J., Yuan, X.: An Inexact Alternating Direction Method for Trace Norm Regularized Least Squares Problem. Report, Department of Mathematics, Nanjing Uinversity (2010)Google Scholar
  19. 19.
    Yuan, X.: Alternating Direction Methods for Sparse Covariance Selection. In: 20th International Symposium of Mathematical Programming, ISMP (2009)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  • Dmitriy Bespalov
    • 1
  • Anders Lindbjerg Dahl
    • 2
  • Bing Bai
    • 3
  • Ali Shokoufandeh
    • 1
  1. 1.Department of Computer ScienceDrexel UniversityUSA
  2. 2.DTU InformaticsTechnical University of DenmarkDenmark
  3. 3.NEC Labs AmericaUSA

Personalised recommendations