Abstract
Finding efficient and useful ways to search and index XML documents is a popular research topic in the field of computer and information science today. The path-based indexing method shows disadvantages of performance degradation when performing join operations of ancestor-descendent relationships and searching for middle and lower level nodes. To alleviate these disadvantages, a numbering scheme based indexing technique was proposed. This technique shows better performance in a variety of queries. However, a numbering scheme based indexing method is necessary to assign numbers to all nodes of all XML documents. It occurs the problem of both search overhead and disk space usage for indexes. In this paper, we propose a novel method that can efficiently construct and manage common paths of all XML documents. The proposed method stores similar structured XML documents more efficiently. In addition, it supports both insertion and deletion of XML documents more flexible.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Abiteboul, S., Quass, D., McHugh, J., Widom, J., Wiener, J.: The Lorel Query Language for Semistructured Data. Int. J. on Digital Libraries 1(1), 68–88 (1997)
Boag, S., Chamberlin, D., Fernandez, M., Florescu, D., Robie, J., Simeon, J.: An XML Query Language(XQuery) (2004), http://www.w3.org/TR/xquery/
Bruno, N., Koudas, N., Srivastava, D.: Holistic Twig Joins: Optimal XML Pattern Matching. In: Proc. of the ACM SIGMOD, pp. 310–321 (2002)
Chen, Y., Davidson, S., Zheng, Y.: BLAS: An Efficient XPath Processing System. In: Proc. of the ACM SIGMOD, pp. 47–58 (2004)
Chien, S., Vagena, Z., Zhang, D., Tsotras, V., Zaniolo, C.: Efficient Structural Joins on Indexed XML Documents. In: Proc. of the VLDB, pp. 263–274 (2002)
Chung, C., Min, J., Shim, K.: APEX: An Adaptive Path Index for XML Data. In: Proc. of the ACM SIGMOD, pp. 121–132 (2002)
Clark, J., DeRose, S.: XML Path Language(XPath). Version 1.0 w3c recommendation. Technical Report REC-xpath-1999 1116, World Wide Web Consortium (1999)
Cooper, B., Sample, N., Franklin, M., Hjaltason, G., Shadmon, M.: A Fast Index for Semistructured Data. In: Proc. of the VLDB, pp. 341–350 (2001)
Deutsch, A., Fernandez, M., Florescu, D., Levy, A., Suciu, D.: XML-QL: A Query Language for XML (1998), http://www.w3.org/TR-/1998/Note-XML-QL-19980819/
Dietz, P.: Maintaining Order in a Linked List. In: Proc. of the Fourteenth Annual ACM Symposium on Theory of Computing, pp. 122–127 (1982)
Goldman, R., Widom, J.: DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases. In: Proc. of the VLDB, pp. 436–445 (1997)
Harding, P., Li, Q., Moon, B.: XISS/R: XML Indexing and Storage System Using RDBMS. In: Proc. of the VLDB, pp. 1073–1076 (2003)
Knuth, D.: The Art of Computer Programming, Sorting and Searching, 3rd edn., vol. III. Addison Wesley, Reading (1998)
Li, Q., Moon, B.: Indexing and Querying XML Data for Regular Path Expression. In: Proc. of VLDB, pp. 361–370 (2001)
Milo, T., Suciu, D.: Index Structures for Path Expressions. In: Afrati, F.N., Kolaitis, P.G. (eds.) ICDT 1997. LNCS, vol. 1186, pp. 277–295. Springer, Heidelberg (1996)
Robie, J., Lapp, J., Schach, D.: XML Query Language (XQL). (1998), http://www.w3.org/-TrandS/QL/QL98/pp/xql.htm
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kim, Y., Park, SH., Kim, TS., Lee, JH., Park, TS. (2006). An Efficient Index Scheme for XML Databases. In: Wiedermann, J., Tel, G., Pokorný, J., Bieliková, M., Štuller, J. (eds) SOFSEM 2006: Theory and Practice of Computer Science. SOFSEM 2006. Lecture Notes in Computer Science, vol 3831. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11611257_35
Download citation
DOI: https://doi.org/10.1007/11611257_35
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-31198-0
Online ISBN: 978-3-540-32217-7
eBook Packages: Computer ScienceComputer Science (R0)