Skip to main content

Heterogeneous Metric Learning for Cross-Modal Multimedia Retrieval

  • Conference paper
Web Information Systems Engineering – WISE 2013 (WISE 2013)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8180))

Included in the following conference series:

Abstract

Due to the massive explosion of multimedia content on the web, users demand a new type of information retrieval, called cross-modal multimedia retrieval where users submit queries of one media type and get results of various other media types. Performing effective retrieval of heterogeneous multimedia content brings new challenges. One essential aspect of these challenges is to learn a heterogeneous metric between different types of multimedia objects. In this paper, we propose a Bayesian personalized ranking based heterogeneous metric learning (BPRHML) algorithm, which optimizes for correctly ranking the retrieval results. It uses pairwise preference constraints as training data and explicitly optimizes for preserving these constraints. To further encouraging the smoothness of learning results, we integrate graph regularization with Bayesian personalized ranking. The experimental results on two publicly available datasets show the effectiveness of our method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Xing, E., Ng, A., Jordan, M., Russell, S.: Distance metric learning with application to clustering with side-information. In: NIPS 2002, pp. 505–512 (2002)

    Google Scholar 

  2. Li, D., Dimitrova, N., Li, M., Sethi, I.K.: Multimedia content processing through cross-modal association. In: Proceedings of the Eleventh ACM International Conference on Multimedia, pp. 604–611 (2003)

    Google Scholar 

  3. Weinberger, K., Blitzer, J., Saul, L.: Distance metric learning for large margin nearest neighbour classification. In: NIPS 2006, pp. 1475–1482 (2006)

    Google Scholar 

  4. Rasiwasia, N., Pereira, J.C., Coviello, E., Doyle, G., Lanckriet, G.R.G., Levy, R., Vasconcelos, N.: A New Approach to Cross-Modal Multimedia Retrieval. In: Proceedings of the Eighteenth International Conference on Multimedia, pp. 251–260 (2010)

    Google Scholar 

  5. Liu, Y., Rong, J., Rahul, S.: Bayesian Active Distance Metric Learning. In: Proceedings of the 23rd Conference on Uncertainty in Artificial Intelligence (UAI 2007), pp. 442–449 (2007)

    Google Scholar 

  6. Wu, W., Xu, J., Li, H.: Learning Similarity Function between Objects in Heterogeneous Spaces. Microsoft Research Technique Report (2010)

    Google Scholar 

  7. Davis, J.V., Kulis, B., Jain, P., Sra, S., Dhillon, I.: Information-Theoretic Metric Learning. In: ICML 2007, pp. 209–216 (2007)

    Google Scholar 

  8. Hoi, S.C.H., Liu, W., Chang, F.: Semi-Supervised Distance Metric Learning for Collaborative Image Retrieval. In: CVPR 2008, pp. 1–7 (2008)

    Google Scholar 

  9. Liu, Y.: Distance Metric Learning: A Comprehensive Survey, School of Computer Science, Carnegie Mellon University (2006)

    Google Scholar 

  10. Timm, N.: Applied multivariate analysis. Springer (2002)

    Google Scholar 

  11. Liu, J., Xu, C.S., Lu, H.Q.: Cross-media retrieval: state-of-the-art and open issues. International Journal of Multimedia Intelligence and Security 1(1), 33–52 (2010)

    Article  Google Scholar 

  12. Rendle, S., Freudenthaler, C., Gantner, Z., Schmidt-Thieme, L.: BPR: Bayesian Personalized Ranking from Implicit Feedback. In: Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence, pp. 452–461 (2009)

    Google Scholar 

  13. Liu, T.: Learning to rank for information retrieval. Foundations and Trends in Information Retrieval 3(3), 225–331 (2009)

    Article  Google Scholar 

  14. Guillaumin, M., Mensink, T., Verbeek, J., Schmid, C.: TagProp: Discriminative metric learning in nearest neighbor models for image auto-annotation. In: ICCV 2009, pp. 309–316 (2009)

    Google Scholar 

  15. Guillaumin, M., Verbeek, J., Schmid, C.: Multimodal semi-supervised learning for image classification. In: CVPR 2010, pp. 902–909 (2010)

    Google Scholar 

  16. Krzanowski, W.: Principles of multivariate analysis. Oxford University Press, Oxford (1988)

    MATH  Google Scholar 

  17. Yang, S.H., Long, B., Smola, A., Zha, H.Y., Zheng, Z.H.: Collaborative Competitive Filtering: Learning Recommender Using Context of User Choice. In: SIGIR 2011, pp. 295–304 (2011)

    Google Scholar 

  18. Wang, F., Sun, J., Li, T., Anerousis, N.: Two heads better than one: Metric+active learning and its applications for it service classification. In: ICDM 2009, pp. 1022–1027 (2009)

    Google Scholar 

  19. Belkin, M., Niyogi, P.: Laplacian eigenmaps for dimensionality reduction and data representation. Neural Computing, 1373–1396 (2003)

    Google Scholar 

  20. Cai, D., He, X., Han, J., Huang, T.: Graph regularized non-negative matrix factorization for data representation. IEEE Transaction on Pattern Analysis and Machine Intelligence (2010)

    Google Scholar 

  21. Belkin, M., Niyogi, P., Sindhwani, V.: Manifold regularization: A geometric framework for learning from labeled an unlabeled examples. The Journal of Machine Learning Research 7, 2399–2434 (2006)

    MathSciNet  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Deng, J., Du, L., Shen, YD. (2013). Heterogeneous Metric Learning for Cross-Modal Multimedia Retrieval. In: Lin, X., Manolopoulos, Y., Srivastava, D., Huang, G. (eds) Web Information Systems Engineering – WISE 2013. WISE 2013. Lecture Notes in Computer Science, vol 8180. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41230-1_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-41230-1_4

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-41229-5

  • Online ISBN: 978-3-642-41230-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics