Advertisement

Querying Semistructured Data Efficiently

  • Hongsik Rho
  • Wen-Chi Hou
  • Dunren Che
  • Chih-Fang Wang
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 2736)

Abstract

In this paper, we address the issue of fast query processing of semistructured data. To this end, we propose a new index scheme, called the HQ-Index. The HQ-Index consists of two indexes, the H-Index and Q-Index. The H-Index is basically a hash table built upon the path expressions. It serves as a path index. The Q-Index facilitates fast traversal to the ancestor nodes in the graph and the retrieval of requested information. Not only the H-Index and Q-Index have very simple index structures, but also the combined use of them can effectively speed up the evaluation of ordinary path queries. The results of our experiments further confirm the advantage of our approach when compared with the Dataguide, one of the most referenced index schemes for XML and semistructured data.

Keywords

Query Processing Hash Table Index Scheme Source Database Author Data 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Chung, C., Min, J., Shim, K.: APEX: An Adaptive Path Index for XML data. In: Proc. of the ACM SIGMOD conference on Management of data (June 2002)Google Scholar
  2. 2.
    Cover, R.: The Cover Pages: XML (2002), http://www.oasis-open.org/cover/xml.html
  3. 3.
    Cooper, B., Sample, N., Franklin, M., Hjaltason, G., Shadmon, M.: A Fast Index for Semistructured Data. In: Proc. of the 27th VLDB Conference, pp. 341–350 (September 2001)Google Scholar
  4. 4.
    Document Object Model (DOM) (2002), http://www.w3.org/DOM
  5. 5.
    Goldman, R., Widom, J.: DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases. In: Proc. of the 23rd VLDB Conference, pp. 436–445 (1997)Google Scholar
  6. 6.
    Milo, T., Suciu, D.: Index structures for path expressions. In: Proc. of the International Conference on Database Theory, pp. 277–295 (1999)Google Scholar
  7. 7.
    McHugh, J., Widom, J.: Query Optimization for XML. In: Proc. VLDB Conf., pp. 315–326 (1999)Google Scholar
  8. 8.

Copyright information

© Springer-Verlag Berlin Heidelberg 2003

Authors and Affiliations

  • Hongsik Rho
    • 1
  • Wen-Chi Hou
    • 1
  • Dunren Che
    • 1
  • Chih-Fang Wang
    • 1
  1. 1.Dept. of Computer ScienceSouthern Illinois UniversityCarbondaleUSA

Personalised recommendations