Skip to main content

Semantics Supervised Cluster-Based Index for Video Databases

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4071))

Abstract

High-dimensional index is one of the most challenging tasks for content-based video retrieval (CBVR). Typically, in video database, there exist two kinds of clues for query: visual features and semantic classes. In this paper, we modeled the relationship between semantic classes and visual feature distributions of data set with the Gaussian mixture model (GMM), and proposed a semantics supervised cluster based index approach (briefly as SSCI) to integrate the advantages of both semantic classes and visual features. The entire data set is divided hierarchically by a modified clustering technique into many clusters until the objects within a cluster are not only close in the visual feature space but also within the same semantic class, and then an index entry including semantic clue and visual feature clue is built for each cluster. Especially, the visual feature vectors in a cluster are organized adjacently in disk. So the SSCI-based nearest-neighbor (NN) search can be divided into two phases: the first phase computes the distances between the query example and each cluster index and returns the clusters with the smallest distance, here namely candidate clusters; then the second phase retrieves the original feature vectors within the candidate clusters to gain the approximate nearest neighbors. Our experiments showed that for approximate searching the SSCI-based approach was faster than VA + -based approach; moreover, the quality of the result set was better than that of the sequential search in terms of semantics.

This paper is supported by National Natural Science Foundation of China No. 60435010 and National Basic Research Priorities Programme No. 2003CB317004.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Kanth, K.V.R., Agrawal, D., Singh, A.: Dimensionality reduction for similarity searching in dynamic databases. In: Proc. ACM SIGMOD ICMD, pp. 166–176 (1998)

    Google Scholar 

  2. Weber, R., Schek, H., Blott, S.: A Quantitative Analysis and Performance Study for Similarity-Search Methods in High-Dimensional Spaces. In: Proceeding of ACM VLDB (1998)

    Google Scholar 

  3. Ferhatosmanoglu, H., Tuncel, E., Agrawal, D., El Abbadi, A.: Vector approximation based indexing for non-uniform high dimensional data sets. In: Proceedings of the 9th ACM Int. Conf. on Information and Knowledge Management, McLean, Virginia, pp. 202–209 (2000)

    Google Scholar 

  4. Ye, H.J., Xu, G.Y.: Fast search in large-scale image database using vector quantization. In: Bakker, E.M., Lew, M., Huang, T.S., Sebe, N., Zhou, X.S. (eds.) CIVR 2003. LNCS, vol. 2728, pp. 458–467. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  5. Ferhatosmanoglu, H., Tuncel, E., Agrawal, D., El Abbadi, A.: Approximate Nearest Neighbor Searching in Multimedia Databases. In: Proceedings of the 17th Int. Conf. on Data Engineering, Wanshington, DC, USA, pp. 503–511 (2001)

    Google Scholar 

  6. Fischer, S., Lienhart, R., Effelsberg, W.: Automatic Recognition of Film Genres. In: ACM Multimedia 1995, San Francisco, USA, pp. 295–304 (1995)

    Google Scholar 

  7. Chen, Y., Wong, E.K.: A Knowledge-Based Approach to Video Content Classification. In: Proceedings of SPIE: Storage and Retrieval for Media Databases, vol. 4315, pp. 292–300 (2001)

    Google Scholar 

  8. Mittal, A., Cheong, L.F.: Addressing the problems of Bayesian Network Classification of Video Using High-Dimensional Features. IEEE Trans. On Knowledge and Data Engineering 16(2), 230–244 (2004)

    Article  Google Scholar 

  9. Shi, Z.P., Hu, H., Li, Q.Y., Shi, Z.Z., Duan, C.L.: Texture spectrum descriptor based image retrieval. Journal of Software (Chineses) 16(6), 1039–1045 (2005)

    Article  MATH  Google Scholar 

  10. Weber, R., Böhm, K.: Trading quality for time with nearest-neighbor search. In: Zaniolo, C., Grust, T., Scholl, M.H., Lockemann, P.C. (eds.) EDBT 2000. LNCS, vol. 1777, pp. 21–35. Springer, Heidelberg (2000)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Shi, Z., Li, Q., Shi, Z., Shi, Z. (2006). Semantics Supervised Cluster-Based Index for Video Databases. In: Sundaram, H., Naphade, M., Smith, J.R., Rui, Y. (eds) Image and Video Retrieval. CIVR 2006. Lecture Notes in Computer Science, vol 4071. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11788034_46

Download citation

  • DOI: https://doi.org/10.1007/11788034_46

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-36018-6

  • Online ISBN: 978-3-540-36019-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics