Abstract
We propose a novel video integration architecture, INTERVIDEO, for faceted search on web-scale. First, we demonstrate that the traditional video integration techniques are no longer valid in face of such heterogeneity and scale. Then, we present three new integrating techniques to build a global relation schema for organizing web videos and aiding user to retrieve faceted results. Finally, we conduct an experimental study and demonstrate the ability of our system to automatically integrate videos and build a complete and concise high-level relation schema on large, heterogeneous web sites.
Chapter PDF
Similar content being viewed by others
References
Yee, K.P., Swearingen, K., Li, K., Hearst, M.: Faceted metadata for image search and browsing. In: Proc. of the SIGCHI Conference on Human Factors in Computing Systems (2003)
Teevan, J., Dumais, S.T., Gutt, Z.: Challenges for Supporting Faceted Search in Large, Heterogeneous Corpora like the Web. In: Proceedings of HCIR (2008)
Barish, G., Shin Chen, Y., Dipasquo, D., Knoblock, C.A., et al.: Theaterloc: Using information integration technology to rapidly build virtual applications. In: ICDE (2000)
Bleiholder, J., Naumann, F.: Data fusion. ACM Computing Surveys 41(1) ( December 2008)
Cao, J., Zhang, Y.D., et al.: VideoMap: An Interactive Video Retrieval System of MCG-ICT-CAS. In: CIVR 2009 (July 2009)
Christel, M.G., Yan, R.: Merging Storyboard Strategies and Automatic Retrieval for Improving Interactive Video Search. In: CIVR 2007 (July 2007)
Chang, K., He, B., Zhang, Z.: Toward large scale integration: Building a MetaQuerier over database on the web. In: CIDR (2005)
Madhavan, J., Jeffery, S.R., Cohen, S., et al.: Web-scale data integration: you can only afford to pay as you go. In: CIDR (2007)
Amer-Yahia, S., Lakshmanan, L., Yu, C.: SocialScope: Enabling Information Discovery on Social Content Sites [C]. In: CIDR (2009)
Crescenzi, V., Mecca, G., Merialdo, P.: RoadRunner: Towards automatic data extraction from large web wites. In: VLDB (2001)
Arasu, A., Molina, H.G.: Extracting structured data from Web pages. In: SIGMOD (2003)
Zhai, Y., Liu, B.: Web data extraction based on partial tree alignment. In: WWW (2005)
Hung, M., Zou, Y.: Recovering workflows from multi tiered e-commerce systems. In: 15th IEEE International Conference on Program Comprehension, ICPC 2007 (2007)
Wu, X., Hauptmann, A.G., Ngo, C.-W.: Practical elimination of near-duplicates from web video search. In: ACM Multimedia, MM 2007 (2007)
Siersdorfer, S., Pedro, J.S., Sanderson, M.: Automatic video tagging using content redundancy. In: SIGIR 2009, July 19-23 (2009)
Pedro, J.S., Dominguez, S.: Network-aware identification of video clip fragments. In: CIVR 2007, pp. 317–324. ACM Press, New York (2007)
Abbasi, R., Staab, S.: RichVSM: enRiched vector space models for folksonomies. In: Proceedings of the 20th ACM Conference on Hypertext and Hypermedia (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 IFIP International Federation for Information Processing
About this paper
Cite this paper
Liao, Z., Yang, J., Fu, C., Zhang, G. (2010). Integrating Web Videos for Faceted Search Based on Duplicates, Contexts and Rules. In: Shi, Z., Vadera, S., Aamodt, A., Leake, D. (eds) Intelligent Information Processing V. IIP 2010. IFIP Advances in Information and Communication Technology, vol 340. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16327-2_26
Download citation
DOI: https://doi.org/10.1007/978-3-642-16327-2_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-16326-5
Online ISBN: 978-3-642-16327-2
eBook Packages: Computer ScienceComputer Science (R0)