Abstract
In big data epoch, one of the major challenges is the large volume of mixed structured and unstructured data. Because of different form, structured and unstructured data are often considered apart from each other. However, they may speak about the same entities of the world. If a query involves both structured data and its unstructured counterpart, it is inefficient to execute it separately. The paper presents a novel index structure tailored towards associations between structured and unstructured data, based on entity co-occurrences. It is also a semantic index represented as RDF graphs which describes the semantic relationships among entities. Experiments show that the associated index can not only provide apposite information but also execute queries efficiently.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Mudunuri, U.S., Khouja, M., Repetski, S.: Knowledge and Theme Discovery across Very Large Biological Data Sets Using Distributed Queries: A Prototype Combining Unstructured and Structured Data. PLoS One 8(12), e80503 (2013)
Wu, S., Jiang, D.W., Ooi, B.C., Wu, K.: Efficient B-tree based indexing for cloud data processing. Proc. of the VLDB Endowment 3(1), 1207–1218 (2010)
George, T., Iraklis, V., Kjetil, N.: SemaFor: semantic document indexing using semantic forests. In: CIKM, pp. 1692–1696 (2012)
Markus, G., Andreas, R., Helmut, B.: Bridging structured and unstructured data via hybrid semantic search and interactive ontology-enhanced query formulation. Knowl. Inf. Syst. 41(3), 761–792 (2014)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Zhu, C., Li, Q., Kong, L., Wang, X., Hong, X. (2015). Associated Index for Big Structured and Unstructured Data. In: Dong, X., Yu, X., Li, J., Sun, Y. (eds) Web-Age Information Management. WAIM 2015. Lecture Notes in Computer Science(), vol 9098. Springer, Cham. https://doi.org/10.1007/978-3-319-21042-1_64
Download citation
DOI: https://doi.org/10.1007/978-3-319-21042-1_64
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-21041-4
Online ISBN: 978-3-319-21042-1
eBook Packages: Computer ScienceComputer Science (R0)