A Structure-Based Filtering Method for XML Management Systems

  • Olli Luoma
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3180)


To answer queries, many XML management systems perform structural joins, i.e., they determine all occurences of parent/child or ancestor/descendant relationships between node sets. These joins are often one of the most time-consuming phases in query evaluation, so it is desirable to reduce the size of the node sets before performing the joins. This problem has earlier been approached by using signatures built on the content of the nodes, but in this paper, we propose a novel method in which the nodes are filtered based on the structural properties of their subtrees. To achieve this, we use a schema graph which summarizes the structures of XML documents more accurately than conventional summarization methods.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    World Wide Web Consortium. Extensible Markup Language (XML) 1.0 (2000),
  2. 2.
    Chaudri, A.B., Rashid, A., Zicari, R.: XML Data Management: Native XML and XML-Enabled Database Systems. Addison-Wesley, Reading (2003)Google Scholar
  3. 3.
    World Wide Web Consortium. XML Path Language (XPath) Version 1.0 (2000),
  4. 4.
    Dietz, P.F.: Maintaining order in a linked list. In: Proc. of the 14th ACM Symposium on Theory of Computing, pp. 122–127 (1982)Google Scholar
  5. 5.
    Kha, D.D., Yoshikawa, M., Uemura, S.: An XML indexing structure with relative region coordinate. In: Proc. of the 17th IEEE Intl Conf. on Data Engineering, pp. 212–220 (2001)Google Scholar
  6. 6.
    Lee, Y.K., Yoo, S., Yoon, K., Berra, B.: Index structures for structured documents. In: Proc. of the 1st Intl Conf. on Digital Libraries, pp. 91–99 (1996)Google Scholar
  7. 7.
    Luoma, O.: Indexing XML data with a schema graph. In: Proc. of the IASTED Intl Conf. on Databases and Applications, pp. 274–279 (2004)Google Scholar
  8. 8.
    McHugh, J., Abiteboul, S., Goldman, R., Quass, R., Widom, J.: Lore: A database management system for semistructured data. SIGMOD Record 26(3), 54–66 (1997)CrossRefGoogle Scholar
  9. 9.
    Shin, D., Jang, H., Jin, H.: BUS: An effective indexing and retrieval scheme in structured documents. In: Proc. of the 3rd ACM Intl Conf. on Digital Libraries, pp. 235–243 (1998)Google Scholar
  10. 10.
    Tak, T.W., Annevelink, J.: Integrating a structured-text retrieval system with an object-oriented database system. In: Proc. of the 20th Intl Conf. on Very Large Databases, pp. 740–749 (1994)Google Scholar
  11. 11.
    Kanne, C.C., Moerkotte, G.: Efficient storage of XML data. In: Poster abstract in Proc. of the 16th Intl Conf. on Data Engineering, p. 198 (2000)Google Scholar
  12. 12.
    Chen, Y., Aberer, K.: Combining pat-trees and signature files for query evaluation in document databases. In: Proc. of the 10th Intl Conf. on Database and Expert Systems Applications, pp. 473–484 (1999)Google Scholar
  13. 13.
    Park, S., Kim, H.J.: A new query processing techique for XML based on signature. In: Proc. of the 7th Intl Conf. on Database Systems for Advanced Applications, pp. 22–31 (2001)Google Scholar
  14. 14.
    Yoshikawa, M., Amagasa, T., Shimura, T., Uemura, S.: XRel: A path-based approach to storage and retrieval of XML documents using relational databases. ACM Transactions on Internet Technologies 1(1), 110–141 (2001)CrossRefGoogle Scholar
  15. 15.
    Bosak, J.: The complete plays of Shakespeare marked up in XML,
  16. 16.
    Busse, R., Carey, M., Florescu, D., Kersten, M., Manolescu, I., Schmidt, A., Waas, F.: XMark - an XML benchmark project,
  17. 17.
    Ley, M.: Digital bibliography library project,

Copyright information

© Springer-Verlag Berlin Heidelberg 2004

Authors and Affiliations

  • Olli Luoma
    • 1
  1. 1.Department of Information TechnologyUniversity of TurkuFinland

Personalised recommendations