Abstract
Multimedia data particularly digital videos that contain various modalities (visual, audio, and text) are complex and time consuming to deal with. Therefore, managing a large volume of multimedia data reveals the necessity for efficient methods for modeling, processing, storing and retrieving such data. In this study, we investigate how to efficiently manage multimedia data, especially video data. In addition, we discuss various flexible query types including the combination of content as well as concept-based queries that provide users with the ability to perform multimodal query. Furthermore, we introduce a fusion-based approach at the query level to improve query retrieval performance of the multimedia database. Our experimental tests show a significant improvement in the query retrieval performance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Yilmaz, T., Yildirim, Y., Yazici, A.: A genetic algorithms based classifier for object classification in images. In: ISCIS, London, pp. 519–525 (2011)
Demir, U., Koyuncu, M., Yazici, A., Yilmaz, T., Sert, M.: Flexible content extraction and querying for videos. In: Christiansen, H., De Tré, G., Yazici, A., Zadrozny, S., Andreasen, T., Larsen, H.L. (eds.) FQAS 2011. LNCS, vol. 7022, pp. 460–471. Springer, Heidelberg (2011)
Jonker, W., Petkovic, M.: An overview of data models and query languages for content-based video retrieval. In: Advances in Infrastructure for E-Business, Science, and Education on the Internet, Italy (2000)
Datta, R., Joshi, D., Li, J., Wang, J.: Image retrieval: Ideas, influences, and trends of the new age. In: ACM Computing Surveys (CSUR), pp. 1–60. ACM (2008)
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. In: JMLR, vol. 3, pp. 993–1022 (2003)
Csurka, G., Dance, C., Fan, L., Willamowski, J., Bray, C.: Visual categorization with bags of keypoints. In: Workshop on Statistical Learning in Computer Vision, ECCV, vol. 1, p. 22 (2004)
Deerwester, S., Dumais, S., Furnas, G., Landauer, T., Harshman, R.: Indexing by latent semantic analysis. Journal of the American society for Information Science 41(6), 391–404 (1990)
Wei, X.Y., Jiang, Y.G., Ngo, H.: Concept-Driven Multi-Modality Fusion for Video Search. IEEE Transactions on Circuits and Systems for Video Technology 21(1), 62–73 (2011)
Natsev, A., Haubold, A., Tesic, J., Xie, L., Yan., R.: Semantic concept-based query expansion and re-ranking for multimedia retrieval. In: MULTIMEDIA 2007, pp. 991–1000. ACM, NY (2007)
Dunckley, L.: Multimedia Databases – An Object-Relational Approach (2003). ISBN # 0 201 78899 3
Bhatti, M.A., Rashid, U.: Exploration and management of web based multimedia information resources. In: International Conference on Systems, Computing Sciences and Software Engineering. Bridgeport, USA (2007)
Rolleke, T., Tsikrika, T., Kazai, G.: A general matrix framework for modeling information retrieval. In: Proceedings of the ACM SIGIR MF/IR (2003)
Sanchez, J.A., Arias, J.A.: Content-based search and annotations in multimedia digital libraries. In: ENC 2003, Fourth Mexican International Conference on Computer Science, pp. 109–117 (2003)
Kerne, A., Koh, E., Dworaczyk, B., Mistrot, J.M., Choi, H., Smith, S.M., Graeber, R., Caruso, D.: CombinFormation: a mixed-initiative system for representing collections as compositions of image and text. In: JCDL 2005, ACM/IEEECS Joint Conference on Digital Libraries, pp. 11–20 (2006)
Witten, I.H., Bainbridge, D.: Building digital library collections with greenstone. In: JCDL 2005, 5th ACM/IEEECS Joint Conference on Digital Libraries, pp. 425–425 (2005)
Wilson, M.L.: Advanced search interfaces considering information retrieval and human computer interaction. In: Agents and Multimedia, Southampton (2007)
Schraefel, M.C., Wilson, M., Russel, A., Smith, D.A.: MSPACE: improving information access to multimedia domains with multimodal exploratory search. Communication of the ACM 49(4), 47–49 (2006). ACM
Dunn, W.: EVIADA: ethnomusicological video for instruction and analysis digital archive. In: JCDL 2005, 5th ACM/IEEE-CS Joint Conference on Digital Libraries, p. 407 (2005)
Rashid, U., Niaz, I.A., Bhatti, M.A.: Unified multimodal search framework for multimedia information retrieval. In: 4th International Conference on Systems, Computing Sciences and Software Engineering. Springer, Bridgeport (2007)
Yan, R.: Probabilistic Models for Combining Diverse Knowledge Sources in Multimedia Retrieval. Ph.D. Thesis. School of Computer Science, Carnegie Mellon University, USA (2006)
Manmatha, R.: Multimedia indexing and retrieval. In: Workshop on Challenges in Information Retrieval and Language Modeling (2002)
Westerveld, T., Ianeva, T., Boldareva, L., de Vries A.P., Hiemstra, D.: Combining infomation sources for video retrieval. In: NIST TRECVID (2003)
Amir, A., Hsu, W., Iyengar, G., Lin, C.Y., Naphade, M., Natsev, A., Neti, C., Nock, H.J., Smith, J. R., Tseng, B.L., Wu, Y., Zhang, D.: IBM research TRECVID- video retrieval system. In: NIST TRECVID (2003)
Rautiainen, M., Hosio, M., Hanski, I., Varanka, M., Kortelainen, J., Ojala, T., Seppanen, T.: TRECVID 2004 experiments at mediateam oulu. In: Proc. of TRECVID (2004)
Yu, J., Cong, Y., Qin, Z., Wan, T.: Cross-modal topic correlations for multimedia retrieval. In: ICPR, Tsukuba, Japan (2012)
Song, Y., Philippe Morency, L., Davis, R.: Multimodal human behavior analysis: learning correlation and interaction across modalities. In: ICMI, Santa Monica, California, USA, pp. 22–26 (2012)
Jiang, W., Loui, A.C.: Video concept detection by audio-visual grouplets. Multimedia Information Retrieval 1, 223–238 (2012)
Zeng, J.D., Zheng, H.J., Lu, C., Li, T., Ma, W.: ReCoM: reinforcement clustering of multi-type interrelated data objects. In: ACM SIGIR, pp. 274–281. ACM, Canada (2003)
Wang, X.J., Ma, W.Y., Xue, G.R., Xing, L.: Multi-model similarity propagation and its application for web image retrieval. In: ACM Multimedia, pp. 944–951. ACM (2004)
Zhang, H., Zhuang, Y.T., Wu, F.: Cross-modal correlation learning for clustering on image-audio dataset. In: ACM Multimedia, pp. 273–276. ACM (2007)
Yang, Y., Zuang, Y.T., Wu, F., Pan, Y.H.: Harmonizing hierarchical manifolds for multimedia document semantics understanding and cross-media retrieval. IEEE Transactions, 437–446. (2008)
Rasiwasia, N., Pereira, J.C., Coviello, E., Doyle, G., Lanckriet, Gert, R.G., Levy, R., Vasconcelos, N.: A new approach to cross-modal multimedia retrieval. In: Multimedia 2010. ACM, Firenze (2010)
Chuang, C.T., Yang, K.H., Lin, Y.L., Wang, J.H.: Combining query terms extension and weight correlative for expert finding. In: International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT). IEEE (2014)
Safadi, B., Sahuguet, M., Benoi, H.: When textual and visual information join forces for multimedia retrieval. In: ICMR 2014. ACM, Glasgow (2014)
Aydinlilar, M., Yazici, A.: Semi-automatic semantic video annotation tool. In: ISCIS 2012, Paris, pp. 303–310 (2012)
Okuyucu, C., Sert, M., Yazici, A.: Environmental sound classification using spectral. In: IEEE, Turkey (2013)
Kucuk, D., Yazici, A.: Exploiting information extraction techniques for automatic semantic video indexing with an application to Turkish news videos. Knowledge-Based Systems 25(6), 844–857 (2011)
Arslan, S., Yazici, A., Sacan, A., Toroslu, I.H., Acar, E.: Comparison of feature-based and image registration-based retrieval of image data using multidimensional data access methods. Data & Knowledge Engineering 86, 124–145 (2013). Elsevier
Hardoon, D.R., Szedmak, S., Taylor, J.S.: Canonical correlation analysis; An overview with application to learning methods. Technical Report CSD-TR-03-02. University of London (2003)
Kuss, M., Graepel, T.: The Geometry Of Kernel Canonical Correlation Analysis. Technical Report No. 108. Max Planck Institute (2003)
News Channel of Turkey. http://www.ntvmsnbc.com/
Naphade, M., Smith, J.R., Tesic, J., Chang, S.-F., Hsu, W., Kennedy, L., Hauptmann, A., Curtis, J.: Large-scale concept ontology for multimedia. IEEE MultiMedia 13(3), 86–91 (2006). IEEE
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Sattari, S., Yazici, A. (2016). Efficient Multimedia Information Retrieval with Query Level Fusion. In: Andreasen, T., et al. Flexible Query Answering Systems 2015. Advances in Intelligent Systems and Computing, vol 400. Springer, Cham. https://doi.org/10.1007/978-3-319-26154-6_28
Download citation
DOI: https://doi.org/10.1007/978-3-319-26154-6_28
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-26153-9
Online ISBN: 978-3-319-26154-6
eBook Packages: Computer ScienceComputer Science (R0)