Abstract
Typically, in multimedia databases, there exist two kinds of clues for query: perceptive features and semantic classes. In this paper, we propose a novel framework for multimedia databases index and retrieval integrating the perceptive features and semantic classes to improve the speed and the precision of the content-based multimedia retrieval (CBMR). We develop a semantics supervised clustering based index approach (briefly as SSCI): the entire data set is divided hierarchically into many clusters until the objects within a cluster are not only close in the perceptive feature space but also within the same semantic class, and then an index term is built for each cluster. Especially, the perceptive feature vectors in a cluster are organized adjacently in disk. So the SSCI-based nearest-neighbor (NN) search can be divided into two phases: first, the indexes of all clusters are scanned sequentially to get the candidate clusters with the smallest distances from the query example; second, the original feature vectors within the candidate clusters are visited to get search results. Furthermore, if the results are not satisfied, the SSCI supports an effective relevance feedback (RF) search: users mark the positive and negative samples regarded a cluster as unit instead of a single object; then the Bayesian classifiers on perceptive features and that on semantics are used respectively to adjust retrieval similarity distance. Our experiments show that SSCI-based searching was faster than VA+-based searching; the quality of the search result based on SSCI was better than that of the sequential search in terms of semantics; and a few cycles of the RF by the proposed approach can improve the retrieval precision significantly.
Similar content being viewed by others
Notes
An index file in an information retrieval system is a file which stores keys that stand for objects in another file.
References
Bach JR, Fuller C, Gupta A et al (1996) Virage image search engine: an open framework for image management. In Proceedings of the SPIE Storage and Retrieval for Still Image and Video Databases, 76–87
Beckmann N, Kriegel HP, Schneider R, Seeger B (1990) The R*-tree: an efficient and robust access method for points and rectangles. In Proceeding of the ACM International Conference on Management of Data. SIGMOD, 322~331
Boujemaa N, Nastar C (1999) Content-based image retrieval at the IMEDIA group of the INRIA 10th DELOS Workshop Audio-Visual Digital Libraries Santorini, Greece, June
Chen Y, Wong EK (2001) A knowledge-based approach to video content classification. Proceedings of SPIE Vol. 4315: Storage and Retrieval for Media Databases, 292–300
Cox IJ, Miller ML, Minda TP et al (2000) The Bayesian image retrieval system, PicHunter: theory, implementation, and psychophysical experiments. IEEE Trans Image Process 9(1):20–37 doi:10.1109/83.817596
Ferhatosmanoglu H, Tuncel E, Agrawal D, El Abbadi A (2000) Vector approximation based indexing for non-uniform high dimensional data sets. In Proceedings of the 9th ACM Int. Conf. on Information and Knowledge Management, McLean, Virginia, 202–209
Ferhatosmanoglu H, Tuncel E, Agrawal D, El Abbadi A (2006) High dimensional nearest neighbor searching. Inf Syst J 31(6):512–540 doi:10.1016/j.is.2005.01.001
Fischer S, Lienhart R, Effelsberg W (1995) Automatic recognition of film genres. In Proc. of ACM Multimedia 95, San Francisco, CA, Nov, 295–304
Flickner M, Sawhney H, Niblack W et al (1995) Query by image and video content: the QBIC system. IEEE Comput 28(9):23–32
Ishikawa Y, Subramanya R, Faloustos C (1998) MindReader: query database through multiple examples. Proceedings of the 24th international conference on Very Large Data Bases. San Fransisco, 218–227
Kanth KVR, Agrawal D, Singh A (1998) Dimensionality reduction for similarity searching in dynamic databases. In Proc. ACM SIGMOD ICMD, 166–176
Liang S, Sun ZX (2006) BSVM-based relevance feedback for sketch retrieval. J Computer-Aided Des Comput Graph 18(11):1753–1757 in Chinese
Meilhac C, Nastar C (1999) Relevance feedback and category search in image databases. Proceedings of the IEEE International Conference on Multimedia Computing and System. Florence, Italy, 512–517
Mittal A, Cheong LF (2004) Addressing the problems of Bayesian network classification of video using high-dimensional features. IEEE Trans Knowl Data Eng 16(2):230–244 doi:10.1109/TKDE.2004.1269600
Naphade M, Smith JR, Tesic J, Chang SF, Hsu W, Kennedy L et al (2006) Large-scale concept ontology for multimedia. IEEE MultiMedia 13(3):86–91 doi:10.1109/MMUL.2006.63
Nievergelt J, Hinterberger H, Sevcik K (1984) The gridfile: an adaptable symmetric multikey file structure. ACM Trans Database Syst 9(1):38–71 doi:10.1145/348.318586
Rui Y, Huang TS (1998) Relevance feedback: a power tool for interactive content-based image retrieval. IEEE Transactions on Circuits and System for Video Technology, Special Issue on Segmentation, Description, and Retrieval of Video Content 8(5):644–655
Rui Y, Huang TS (2000) Optimizing learning in image retrieval. Proc. of IEEE Int. Conf. On Computer Vision and Pattern Recognition, Hilton Head, SC, 236–243
Rui Y, Huang TS, Mehrotra S (1997) Content-Based image retrieval with relevance feedback in MARS. Proceedings of IEEE International Conference on Image Processing. New York, 815–818
Shi ZP, Li QY, Shi ZZ, Duan CL (2005) Texture spectrum descriptor based image retrieval. J Softw 16(6):1039–1045 in Chinese doi:10.1360/jos161039
Smeulders A, Santini S, Gupta A, Jain R (2000) Content-based image retrieval at the end of the early years. IEEE Trans Pattern Anal Mach Intell 22(12):1349–1379 doi:10.1109/34.895972
Smith JR, Chang SF (1997) Visually searching the web for content. IEEE Multimedia 4(3):12–20 doi:10.1109/93.621578
Su Z, Zhang HJ, Ma SP (2002) An image retrieval relevance feedback algorithm based on the Bayesian classifier. J Softw 13(10):2001–2006 in Chinese
Weber R, Bohm K (2000) Trading quality for time with nearest-neighbor search. In Proceedings of the 7th International Conference on Extending Database Technology, Konstanz, Germany, March, 21–35
Weber R, Schek H, Blott S (1998) A quantitative analysis and performance study for similarity-search methods in high-dimensional spaces. In Proceeding of ACM VLDB’98, 194–205
Ye HJ, Xu GY (2003) Fast search in large-scale image database using vector quantization. In Proceeding of the International Conference on Image and Video Retrieval, Lecture Notes in Computer Science, vol. 2728, Springer, 458–467
Author information
Authors and Affiliations
Corresponding author
Additional information
This work is supported by the National Science Foundation of China (No. 60435010), 863 National High-Tech Program (No.2006AA01Z128), National Basic Research Priorities Programme (No. 2007CB311004).
Rights and permissions
About this article
Cite this article
Shi, Z., He, Q. & Shi, Z. An index and retrieval framework integrating perceptive features and semantics for multimedia databases. Multimed Tools Appl 42, 207–231 (2009). https://doi.org/10.1007/s11042-008-0235-y
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-008-0235-y