Abstract
Managing a large volume of multimedia data, which contain various modalities (visual, audio, and text), reveals the need for a specialized multimedia database system (MMDS) to efficiently model, process, store and retrieve video shots based on their semantic content. This demo introduces METU-MMDS, an intelligent MMDS which employs both machine learning and database techniques. The system extracts semantic content automatically by using visual, audio and textual data, stores the extracted content in an appropriate format and uses this content to efficiently retrieve video shots. The system architecture supports various multimedia query types including unimodal querying, multimodal querying, query-by-concept, query-by-example, and utilizes a multimedia index structure for efficiently querying multi-dimensional multimedia data. We demonstrate METU-MMDS for semantic data extraction from videos and complex multimedia querying by considering content and concept-based queries containing all modalities.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Rashid, U., Bhatti, M.A.: Exploration and management of web based multimedia information resources. In: Elleithy, K. (ed.) Innovations and Advanced Techniques in Systems, Computing Sciences and Software Engineering, pp. 500–506. Springer, The Netherlands (2008)
Brendan, J., Hongzhi, L., et al.: Structured exploration of who, what, when, and where in heterogeneous multimedia news sources. In: ACM MM, pp. 357–360 (2013)
Stefanidis, K., Koutrika, G., Pitoura, E.: A survey on representation, composition and application of preferences in database systems. J. TODS 36, 19–45 (2011). ACM
Meng, T., Shyu, M.L.: Leveraging concept association network for multimedia rare concept mining and retrieval. In: ICME, pp. 860–865. IEEE, Melbourne (2012)
Smith, J.R.: Riding the multimedia big data wave. In: SIGIR, pp. 1–2. ACM (2013)
Aydinlilar, M., Yazici, A.: Semi-automatic semantic video annotation tool. In: Gelenbe, E., Lent, R. (eds.) International Symposium on Computer and Information Sciences, pp. 303–310. Springer, Paris (2012)
Yilmaz, T., Yazici, A., Yildirim, Y.: Exploiting class-specific features in multi-feature dissimilarity space for efficient querying of images. In: Christiansen, H., De Tré, G., Yazici, A., Zadrozny, S., Andreasen, T., Larsen, H.L. (eds.) FQAS 2011. LNCS, vol. 7022, pp. 149–161. Springer, Heidelberg (2011)
Deng, Y., Manjunath, B.S.: Unsupervised segmentation of color-texture regions in images and video. IEEE J. TPAMI 23(8), 800–810 (2001)
Okuyucu, C., Sert, M., Yazici, A.: Audio feature and classifier analysis for efficient recognition of environmental sounds. In: ISM, pp. 125–132. IEEE, USA (2013)
Kucuk, D., Yazici, A.: Exploiting information extraction techniques for automatic semantic video indexing with an application to Turkish news videos. J. Knowl.-Based Sys. 25(6), 844–857 (2011)
Gulen, E., Yilmaz, T., Yazici, A.: Multimodal information fusion for semantic video analysis. J. IJMDEM 3(4), 52–74 (2012)
Kucuk, D., Ozgur, N.B., Yazici, A., Koyuncu, M.: A fuzzy conceptual model for multimedia data with a text-based automatic annotation scheme. J. IJUFKS 17(1), 135–152 (2009)
Yazici, A., Ince, C., Koyuncu, M.: Food index: a multidimensional index structure for similarity-based fuzzy object-oriented database models. J. IEEE Trans. Fuzzy Sys. 16(4), 942–957 (2008). IEEE
Arslan, S., Yazici, A., Sacan, A., Toroslu, I.H., Acar, E.: Comparison of feature-based and image registration-based retrieval of image data using multidimensional data access methods. J. TKDE 86, 124–145 (2013). Elsevier
Safadi, B., Sahuguet, M., Huet, B.: When textual and visual information join forces for multimedia retrieval. In: ICMR, pp. 265–272. ACM (2014)
Yu, J., Cong, Y., Qin, Z., Wan, T.: Cross-modal topic correlations for multimedia retrieval. In: International Conference on Pattern Recognition, pp. 246–249. IEEE, Japan (2012)
Acknowledgments
This work is supported by the research grant from TUBITAK with the grant number 114R0182. We also thank to all of the previous researchers of Multimedia Db. Lab. at METU who have contributed to this study.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Yazici, A., Sattari, S., Yilmaz, T., Sert, M., Koyuncu, M., Gulen, E. (2016). METU-MMDS: An Intelligent Multimedia Database System for Multimodal Content Extraction and Querying. In: Tian, Q., Sebe, N., Qi, GJ., Huet, B., Hong, R., Liu, X. (eds) MultiMedia Modeling. MMM 2016. Lecture Notes in Computer Science(), vol 9517. Springer, Cham. https://doi.org/10.1007/978-3-319-27674-8_33
Download citation
DOI: https://doi.org/10.1007/978-3-319-27674-8_33
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-27673-1
Online ISBN: 978-3-319-27674-8
eBook Packages: Computer ScienceComputer Science (R0)