Abstract
High-dimensional index is one of the most challenging tasks for content-based video retrieval (CBVR). Typically, in video database, there exist two kinds of clues for query: visual features and semantic classes. In this paper, we modeled the relationship between semantic classes and visual feature distributions of data set with the Gaussian mixture model (GMM), and proposed a semantics supervised cluster based index approach (briefly as SSCI) to integrate the advantages of both semantic classes and visual features. The entire data set is divided hierarchically by a modified clustering technique into many clusters until the objects within a cluster are not only close in the visual feature space but also within the same semantic class, and then an index entry including semantic clue and visual feature clue is built for each cluster. Especially, the visual feature vectors in a cluster are organized adjacently in disk. So the SSCI-based nearest-neighbor (NN) search can be divided into two phases: the first phase computes the distances between the query example and each cluster index and returns the clusters with the smallest distance, here namely candidate clusters; then the second phase retrieves the original feature vectors within the candidate clusters to gain the approximate nearest neighbors. Our experiments showed that for approximate searching the SSCI-based approach was faster than VA + -based approach; moreover, the quality of the result set was better than that of the sequential search in terms of semantics.
This paper is supported by National Natural Science Foundation of China No. 60435010 and National Basic Research Priorities Programme No. 2003CB317004.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Kanth, K.V.R., Agrawal, D., Singh, A.: Dimensionality reduction for similarity searching in dynamic databases. In: Proc. ACM SIGMOD ICMD, pp. 166–176 (1998)
Weber, R., Schek, H., Blott, S.: A Quantitative Analysis and Performance Study for Similarity-Search Methods in High-Dimensional Spaces. In: Proceeding of ACM VLDB (1998)
Ferhatosmanoglu, H., Tuncel, E., Agrawal, D., El Abbadi, A.: Vector approximation based indexing for non-uniform high dimensional data sets. In: Proceedings of the 9th ACM Int. Conf. on Information and Knowledge Management, McLean, Virginia, pp. 202–209 (2000)
Ye, H.J., Xu, G.Y.: Fast search in large-scale image database using vector quantization. In: Bakker, E.M., Lew, M., Huang, T.S., Sebe, N., Zhou, X.S. (eds.) CIVR 2003. LNCS, vol. 2728, pp. 458–467. Springer, Heidelberg (2003)
Ferhatosmanoglu, H., Tuncel, E., Agrawal, D., El Abbadi, A.: Approximate Nearest Neighbor Searching in Multimedia Databases. In: Proceedings of the 17th Int. Conf. on Data Engineering, Wanshington, DC, USA, pp. 503–511 (2001)
Fischer, S., Lienhart, R., Effelsberg, W.: Automatic Recognition of Film Genres. In: ACM Multimedia 1995, San Francisco, USA, pp. 295–304 (1995)
Chen, Y., Wong, E.K.: A Knowledge-Based Approach to Video Content Classification. In: Proceedings of SPIE: Storage and Retrieval for Media Databases, vol. 4315, pp. 292–300 (2001)
Mittal, A., Cheong, L.F.: Addressing the problems of Bayesian Network Classification of Video Using High-Dimensional Features. IEEE Trans. On Knowledge and Data Engineering 16(2), 230–244 (2004)
Shi, Z.P., Hu, H., Li, Q.Y., Shi, Z.Z., Duan, C.L.: Texture spectrum descriptor based image retrieval. Journal of Software (Chineses) 16(6), 1039–1045 (2005)
Weber, R., Böhm, K.: Trading quality for time with nearest-neighbor search. In: Zaniolo, C., Grust, T., Scholl, M.H., Lockemann, P.C. (eds.) EDBT 2000. LNCS, vol. 1777, pp. 21–35. Springer, Heidelberg (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Shi, Z., Li, Q., Shi, Z., Shi, Z. (2006). Semantics Supervised Cluster-Based Index for Video Databases. In: Sundaram, H., Naphade, M., Smith, J.R., Rui, Y. (eds) Image and Video Retrieval. CIVR 2006. Lecture Notes in Computer Science, vol 4071. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11788034_46
Download citation
DOI: https://doi.org/10.1007/11788034_46
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-36018-6
Online ISBN: 978-3-540-36019-3
eBook Packages: Computer ScienceComputer Science (R0)