Unsupervised Manifold Learning for Video Genre Retrieval

Almeida, Jurandy; Pedronette, Daniel C. G.; Penatti, Otávio A. B.

doi:10.1007/978-3-319-12568-8_74

Jurandy Almeida¹⁷,
Daniel C. G. Pedronette¹⁸ &
Otávio A. B. Penatti¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 8827))

Included in the following conference series:

Iberoamerican Congress on Pattern Recognition

2252 Accesses
7 Citations

Abstract

This paper investigates the perspective of exploiting pairwise similarities to improve the performance of visual features for video genre retrieval. We employ manifold learning based on the reciprocal neighborhood and on the authority of ranked lists to improve the retrieval of videos considering their genre. A comparative analysis of different visual features is conducted and discussed. We experimentally show in the dataset of 14,838 videos from the MediaEval benchmark that we can achieve considerable improvements in results. In addition, we also evaluate how the late fusion of different visual features using the same manifold learning scheme can improve the retrieval results.

Download to read the full chapter text

Chapter PDF

Are All Combinations Equal? Combining Textual and Visual Features with Multiple Space Learning for Text-Based Video Retrieval

A cross-media distance metric learning framework based on multi-view correlation mining and matching

Article 21 April 2015

Finding Near-Duplicate Videos in Large-Scale Collections

Keywords

References

Almeida, J., Leite, N.J., Torres, R.S.: Comparison of video sequences with histograms of motion patterns. In: ICIP, pp. 3673–3676 (2011)
Google Scholar
Bimbo, A.: Visual information retrieval. Morgan Kaufmann Publishers Inc. (1999)
Google Scholar
Boureau, Y.L., Bach, F., LeCun, Y., Ponce, J.: Learning mid-level features for recognition. In: CVPR, pp. 2559–2566 (2010)
Google Scholar
Jiang, J., Wang, B., Tu, Z.: Unsupervised metric learning by self-smoothing operator. In: ICCV, pp. 794–801 (2011)
Google Scholar
Pedronette, D.C.G., Almeida, J., Torres, R.S.: A scalable re-ranking method for content-based image retrieval. Information Sciences 265, 91–104 (2014)
Article MathSciNet Google Scholar
Pedronette, D.C.G., Penatti, O.A.B., da Silva Torres, R.: Unsupervised manifold learning using reciprocal knn graphs in image re-ranking and rank aggregation tasks. Image and Vision Computing 32(2), 120–130 (2014)
Article Google Scholar
Pedronette, D.C.G., da Silva Torres, R.: Image re-ranking and rank aggregation based on similarity of ranked lists. Pattern Recognition 46(8), 2350–2360 (2013)
Article Google Scholar
Penatti, O.A.B., Li, L.T., Almeida, J., da Silva Torres, R.: A visual approach for video geocoding using bag-of-scenes. In: ICMR, pp. 1–8 (2012)
Google Scholar
Qin, D., Gammeter, S., Bossard, L., Quack, T., van Gool, L.: Hello neighbor: Accurate object retrieval with k-reciprocal nearest neighbors. In: CVPR, pp. 777–784 (2011)
Google Scholar
Schmiedeke, S., Kofler, C., Ferrané, I.: Overview of mediaeval 2012 genre tagging task. In: MediaEval (2012)
Google Scholar
van Gemert, J.C., Veenman, C.J., Smeulders, A.W.M., Geusebroek, J.M.: Visual word ambiguity. TPAMI 32(7), 1271–1283 (2010)
Google Scholar
Yang, X., Prasad, L., Latecki, L.: Affinity learning with diffusion on tensor product graph. TPAMI 35(1), 28–38 (2013)
Article Google Scholar
Yang, X., Latecki, L.J.: Affinity learning on a tensor product graph with applications to shape and image retrieval. In: CVPR, pp. 2369–2376 (2011)
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Science and Technology, Federal University of São Paulo - UNIFESP, 12231-280, São José dos Campos, SP, Brazil
Jurandy Almeida
Dept. of Statistics, Applied Mathematics and Computation, São Paulo State University - UNESP, 13506-900, Rio Claro, SP, Brazil
Daniel C. G. Pedronette
Advanced Technologies, SAMSUNG Research Institute, 13097-160, Campinas, SP, Brazil
Otávio A. B. Penatti

Authors

Jurandy Almeida
View author publications
You can also search for this author in PubMed Google Scholar
Daniel C. G. Pedronette
View author publications
You can also search for this author in PubMed Google Scholar
Otávio A. B. Penatti
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Electrical Engineering and Computer Science, CINVESTAV, Guadalajara, Jalisco, México
Eduardo Bayro-Corrochano
Department of Computer Science, University of York, YO10 5GH, Deramore Lane, York, UK
Edwin Hancock

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Almeida, J., Pedronette, D.C.G., Penatti, O.A.B. (2014). Unsupervised Manifold Learning for Video Genre Retrieval. In: Bayro-Corrochano, E., Hancock, E. (eds) Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications. CIARP 2014. Lecture Notes in Computer Science, vol 8827. Springer, Cham. https://doi.org/10.1007/978-3-319-12568-8_74

Download citation

DOI: https://doi.org/10.1007/978-3-319-12568-8_74
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-12567-1
Online ISBN: 978-3-319-12568-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)

Unsupervised Manifold Learning for Video Genre Retrieval

Abstract

Chapter PDF

Similar content being viewed by others

Are All Combinations Equal? Combining Textual and Visual Features with Multiple Space Learning for Text-Based Video Retrieval

A cross-media distance metric learning framework based on multi-view correlation mining and matching

Finding Near-Duplicate Videos in Large-Scale Collections

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Navigation

Unsupervised Manifold Learning for Video Genre Retrieval

Abstract

Chapter PDF

Similar content being viewed by others

Are All Combinations Equal? Combining Textual and Visual Features with Multiple Space Learning for Text-Based Video Retrieval

A cross-media distance metric learning framework based on multi-view correlation mining and matching

Finding Near-Duplicate Videos in Large-Scale Collections

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation