Skip to main content

An Efficient Index Scheme for XML Databases

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 3831))

Abstract

Finding efficient and useful ways to search and index XML documents is a popular research topic in the field of computer and information science today. The path-based indexing method shows disadvantages of performance degradation when performing join operations of ancestor-descendent relationships and searching for middle and lower level nodes. To alleviate these disadvantages, a numbering scheme based indexing technique was proposed. This technique shows better performance in a variety of queries. However, a numbering scheme based indexing method is necessary to assign numbers to all nodes of all XML documents. It occurs the problem of both search overhead and disk space usage for indexes. In this paper, we propose a novel method that can efficiently construct and manage common paths of all XML documents. The proposed method stores similar structured XML documents more efficiently. In addition, it supports both insertion and deletion of XML documents more flexible.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Abiteboul, S., Quass, D., McHugh, J., Widom, J., Wiener, J.: The Lorel Query Language for Semistructured Data. Int. J. on Digital Libraries 1(1), 68–88 (1997)

    Google Scholar 

  2. Boag, S., Chamberlin, D., Fernandez, M., Florescu, D., Robie, J., Simeon, J.: An XML Query Language(XQuery) (2004), http://www.w3.org/TR/xquery/

  3. Bruno, N., Koudas, N., Srivastava, D.: Holistic Twig Joins: Optimal XML Pattern Matching. In: Proc. of the ACM SIGMOD, pp. 310–321 (2002)

    Google Scholar 

  4. Chen, Y., Davidson, S., Zheng, Y.: BLAS: An Efficient XPath Processing System. In: Proc. of the ACM SIGMOD, pp. 47–58 (2004)

    Google Scholar 

  5. Chien, S., Vagena, Z., Zhang, D., Tsotras, V., Zaniolo, C.: Efficient Structural Joins on Indexed XML Documents. In: Proc. of the VLDB, pp. 263–274 (2002)

    Google Scholar 

  6. Chung, C., Min, J., Shim, K.: APEX: An Adaptive Path Index for XML Data. In: Proc. of the ACM SIGMOD, pp. 121–132 (2002)

    Google Scholar 

  7. Clark, J., DeRose, S.: XML Path Language(XPath). Version 1.0 w3c recommendation. Technical Report REC-xpath-1999 1116, World Wide Web Consortium (1999)

    Google Scholar 

  8. Cooper, B., Sample, N., Franklin, M., Hjaltason, G., Shadmon, M.: A Fast Index for Semistructured Data. In: Proc. of the VLDB, pp. 341–350 (2001)

    Google Scholar 

  9. Deutsch, A., Fernandez, M., Florescu, D., Levy, A., Suciu, D.: XML-QL: A Query Language for XML (1998), http://www.w3.org/TR-/1998/Note-XML-QL-19980819/

  10. Dietz, P.: Maintaining Order in a Linked List. In: Proc. of the Fourteenth Annual ACM Symposium on Theory of Computing, pp. 122–127 (1982)

    Google Scholar 

  11. Goldman, R., Widom, J.: DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases. In: Proc. of the VLDB, pp. 436–445 (1997)

    Google Scholar 

  12. Harding, P., Li, Q., Moon, B.: XISS/R: XML Indexing and Storage System Using RDBMS. In: Proc. of the VLDB, pp. 1073–1076 (2003)

    Google Scholar 

  13. Knuth, D.: The Art of Computer Programming, Sorting and Searching, 3rd edn., vol. III. Addison Wesley, Reading (1998)

    Google Scholar 

  14. Li, Q., Moon, B.: Indexing and Querying XML Data for Regular Path Expression. In: Proc. of VLDB, pp. 361–370 (2001)

    Google Scholar 

  15. Milo, T., Suciu, D.: Index Structures for Path Expressions. In: Afrati, F.N., Kolaitis, P.G. (eds.) ICDT 1997. LNCS, vol. 1186, pp. 277–295. Springer, Heidelberg (1996)

    Google Scholar 

  16. Robie, J., Lapp, J., Schach, D.: XML Query Language (XQL). (1998), http://www.w3.org/-TrandS/QL/QL98/pp/xql.htm

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Kim, Y., Park, SH., Kim, TS., Lee, JH., Park, TS. (2006). An Efficient Index Scheme for XML Databases. In: Wiedermann, J., Tel, G., Pokorný, J., Bieliková, M., Štuller, J. (eds) SOFSEM 2006: Theory and Practice of Computer Science. SOFSEM 2006. Lecture Notes in Computer Science, vol 3831. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11611257_35

Download citation

  • DOI: https://doi.org/10.1007/11611257_35

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-31198-0

  • Online ISBN: 978-3-540-32217-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics