Abstract
This paper investigates the perspective of exploiting pairwise similarities to improve the performance of visual features for video genre retrieval. We employ manifold learning based on the reciprocal neighborhood and on the authority of ranked lists to improve the retrieval of videos considering their genre. A comparative analysis of different visual features is conducted and discussed. We experimentally show in the dataset of 14,838 videos from the MediaEval benchmark that we can achieve considerable improvements in results. In addition, we also evaluate how the late fusion of different visual features using the same manifold learning scheme can improve the retrieval results.
Chapter PDF
Similar content being viewed by others
References
Almeida, J., Leite, N.J., Torres, R.S.: Comparison of video sequences with histograms of motion patterns. In: ICIP, pp. 3673–3676 (2011)
Bimbo, A.: Visual information retrieval. Morgan Kaufmann Publishers Inc. (1999)
Boureau, Y.L., Bach, F., LeCun, Y., Ponce, J.: Learning mid-level features for recognition. In: CVPR, pp. 2559–2566 (2010)
Jiang, J., Wang, B., Tu, Z.: Unsupervised metric learning by self-smoothing operator. In: ICCV, pp. 794–801 (2011)
Pedronette, D.C.G., Almeida, J., Torres, R.S.: A scalable re-ranking method for content-based image retrieval. Information Sciences 265, 91–104 (2014)
Pedronette, D.C.G., Penatti, O.A.B., da Silva Torres, R.: Unsupervised manifold learning using reciprocal knn graphs in image re-ranking and rank aggregation tasks. Image and Vision Computing 32(2), 120–130 (2014)
Pedronette, D.C.G., da Silva Torres, R.: Image re-ranking and rank aggregation based on similarity of ranked lists. Pattern Recognition 46(8), 2350–2360 (2013)
Penatti, O.A.B., Li, L.T., Almeida, J., da Silva Torres, R.: A visual approach for video geocoding using bag-of-scenes. In: ICMR, pp. 1–8 (2012)
Qin, D., Gammeter, S., Bossard, L., Quack, T., van Gool, L.: Hello neighbor: Accurate object retrieval with k-reciprocal nearest neighbors. In: CVPR, pp. 777–784 (2011)
Schmiedeke, S., Kofler, C., Ferrané, I.: Overview of mediaeval 2012 genre tagging task. In: MediaEval (2012)
van Gemert, J.C., Veenman, C.J., Smeulders, A.W.M., Geusebroek, J.M.: Visual word ambiguity. TPAMI 32(7), 1271–1283 (2010)
Yang, X., Prasad, L., Latecki, L.: Affinity learning with diffusion on tensor product graph. TPAMI 35(1), 28–38 (2013)
Yang, X., Latecki, L.J.: Affinity learning on a tensor product graph with applications to shape and image retrieval. In: CVPR, pp. 2369–2376 (2011)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Almeida, J., Pedronette, D.C.G., Penatti, O.A.B. (2014). Unsupervised Manifold Learning for Video Genre Retrieval. In: Bayro-Corrochano, E., Hancock, E. (eds) Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications. CIARP 2014. Lecture Notes in Computer Science, vol 8827. Springer, Cham. https://doi.org/10.1007/978-3-319-12568-8_74
Download citation
DOI: https://doi.org/10.1007/978-3-319-12568-8_74
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-12567-1
Online ISBN: 978-3-319-12568-8
eBook Packages: Computer ScienceComputer Science (R0)