Encyclopedia of Database Systems

2018 Edition
| Editors: Ling Liu, M. Tamer Özsu

Indexing Techniques for Multimedia Data Retrieval

  • Jingkuan Song
Reference work entry
DOI: https://doi.org/10.1007/978-1-4614-8265-9_80631

Synonyms

Hashing; Indexing; Multimedia data; Quantization; Retrieval; Tree

Definition

Indexing techniques for multimedia data retrieval is defined as the problem of preprocessing a database of multimedia objects to provide efficient accesses and comparisons on the basis of their extracted features. Due to the very nature of multimedia content which is represented by high-dimensional float-valued feature vectors, the complexity of similarity criteria that are used to compare multimedia objects is often high. The goal of multimedia indexing is to effectively support multimedia similarity search which serves as the foundation of most multimedia applications. This can be realized by accessing a very small portion of database objects and/or approximating expensive similarity computations with efficient forms. Most multimedia applications actually do not require exact similarity search. To improve efficiency, approximate similarity search is often used, given satisfactory search accuracy...

This is a preview of subscription content, log in to check access.

Recommended Reading

  1. 1.
    Beis JS, Lowe DG. Shape indexing using approximate nearest-neighbour search in high-dimensional spaces. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition; 1997. p. 1000–6.Google Scholar
  2. 2.
    Bentley JL. Multidimensional binary search trees used for associative searching. Commun ACM. 1975;18(9):509–17.zbMATHCrossRefGoogle Scholar
  3. 3.
    Böhm C, Berchtold S, Keim DA. Searching in high-dimensional spaces: index structures for improving the performance of multimedia databases. ACM Comput Surv. 2001;33(3):322–73.CrossRefGoogle Scholar
  4. 4.
    Chakrabarti K, Mehrotra S. Local dimensionality reduction: a new approach to indexing high dimensional spaces. In: Proceedings of the 26th International Conference on Very Large Data Bases; 2000. p. 89–100.Google Scholar
  5. 5.
    Datar M, Immorlica N, Indyk P, Mirrokni VS. Locality-sensitive hashing scheme based on p-stable distributions. In: Proceedings of the 20th Annual Symposium on Computational Geometry; 2004. p. 253–62.Google Scholar
  6. 6.
    Friedman JH, Bentley JL, Finkel RA. An algorithm for finding best matches in logarithmic expected time. ACM Trans Math Softw. 1977;3(3): 209–26.zbMATHCrossRefGoogle Scholar
  7. 7.
    Gao L, Song J, Nie F, Yan Y, Sebe N, Shen HT. Optimal graph leaning with partial tags and multiple features for image and video annotation. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition; 2015.Google Scholar
  8. 8.
    Gao L, Song J, Zou F, Zhang D, Shao J. Scalable multimedia retrieval by deep learning hashing with relative similarity learning. In: Proceedings of the 23rd ACM International Conference on Multimedia; 2015.Google Scholar
  9. 9.
    Guttman A. R-trees: a dynamic index structure for spatial searching. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 1984. p. 47–57.Google Scholar
  10. 10.
    Huang Z, Shen HT, Shao J, Rüger SM, Zhou X. Locality condensation: a new dimensionality reduction method for image retrieval. In: Proceedings of the 16th ACM International Conference on Multimedia; 2008. p. 219–28.Google Scholar
  11. 11.
    Huang Z, Wang L, Shen HT, Shao J, Zhou X. Online near-duplicate video clip detection and retrieval: an accurate and fast system. In: Proceedings of the 25th International Conference on Data Engineering; 2009. p. 1511–4.Google Scholar
  12. 12.
    Jagadish HV, Ooi BC, Tan K-L, Yu C, Zhang R. Idistance: an adaptive b+-tree based indexing method for nearest neighbor search. ACM Trans Database Syst. 2005;30(2):364–97.CrossRefGoogle Scholar
  13. 13.
    Norouzi M, Fleet DJ. Cartesian k-means. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition; 2013.Google Scholar
  14. 14.
    Song J, Gao L, Yan Y, Zhang D, Sebe N. Supervised hashing with pseudo labels for scalable multimedia retrieval. In: Proceedings of the 23rd ACM International Conference on Multimedia; 2015.Google Scholar
  15. 15.
    Song J, Yang Y, Huang Z, Shen HT, Hong R. Multiple feature hashing for real-time large scale near-duplicate video retrieval. In: Proceedings of the 19th ACM International Conference on Multimedia; 2011. p. 423–32.Google Scholar
  16. 16.
    Song J, Yang Y, Yang Y, Huang Z, Shen HT. Inter-media hashing for large-scale retrieval from heterogeneous data sources. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 2013. p. 785–96.Google Scholar
  17. 17.
    Wang J, Wang J, Song J, Xu X-S, Shen HT, Li S. Optimized cartesian k-means. IEEE Trans Knowl Data Eng. 2015;27(1):180–92.CrossRefGoogle Scholar
  18. 18.
    Wang J, Zhang T, Song J, Sebe N, Shen HT. A survey on learning to hash. IEEE Trans Pattern Anal Mach Intell. 2017;(99):1Google Scholar
  19. 19.
    Zhang S, Fan J, Lu H, Xue X. Salient object detection on large-scale video data. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition; 2007.Google Scholar

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2018

Authors and Affiliations

  1. 1.Columbia UniversityNew YorkUSA