Advertisement

TP+Output: Modeling Complex Output Information in XML Twig Pattern Query

  • Huayu Wu
  • Tok Wang Ling
  • Gillian Dobbie
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6309)

Abstract

Twig pattern is considered a core pattern for XML queries. However, due to the limited expressivity of twig pattern expressions, many queries that aim to find complex output information under one object cannot be expressed in a single twig pattern. Instead, they have to be expressed as XQuery expression, which is transformed into several twig patterns linked by joins. To process such an XQuery query, we need to match multiple twig patterns to the XML document, even though they are all centered on the same object. In this paper we analyze the characteristics of each query node, i.e. the purpose, optionality and occurrence, and define four types of nodes in a twig pattern query to express output information, namely, output node, optional-output node, predicated-output node, and optional-predicated-output node. Then we propose the TP+Output expression to extend twig pattern queries, to model complex output information based on the semantics of different node types. With TP+Output, queries with the four output types can be expressed in one TP+Output expression and processed more efficiently. We extend our previously proposed twig pattern query processing algorithm, VERT, to process the TP+Output query, and demonstrate the performance improvement of using TP+Output to represent queries.

Keywords

Query Processing Output Node Query Node Inverted List Tree Pattern Query 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
  2. 2.
  3. 3.
  4. 4.
    Al-Khalifa, S., Jagadish, H.V., Patel, J.M., Wu, Y., Koudas, N., Srivastava, D.: Structural joins: A primitive for efficient XML query pattern matching. In: ICDE, pp. 141–154 (2002)Google Scholar
  5. 5.
    Berglund, A., Chamberlin, D., Fernandez, M.F., Kay, M., Robie, J., Simeon, J.: XML Path Language (XPath) 2.0. W3C Working Draft (2003)Google Scholar
  6. 6.
    Boag, S., Chamberlin, D., Fernandez, M.F., Florescu, D., Robie, J., Simeon, J.: XQuery 1.0: An XML Query. W3C Working Draft (2003)Google Scholar
  7. 7.
    Bruno, N., Koudas, N., Srivastava, D.: Holistic twig joins: Optimal XML pattern matching. In: SIGMOD Conference, pp. 310–321 (2002)Google Scholar
  8. 8.
    Chen, S., Li, H., Tatemura, J., Hsiung, W., Agrawal, D., Candan, K.S.: Twig2Stack: bottom-up processing of generalized-tree-pattern queries over XML documents. In: VLDB, pp. 283–294 (2006)Google Scholar
  9. 9.
    Chen, T., Lu, J., Ling, T.W.: On boosting holism in XML twig pattern matching using structural indexing techniques. In: SIGMOD Conference, pp. 455–466 (2005)Google Scholar
  10. 10.
    Chen, Z., Jagadish, H.V., Lakshmanan, L.V.S., Paparizos, S.: From tree patterns to generalized tree patterns: on efficient evaluation of XQuery. In: VLDB, pp. 237–248 (2003)Google Scholar
  11. 11.
    Florescu, D., Kossmann, D.: Storing and querying XML data using an RDMBS. IEEE Data Eng. Bull., 27–34 (1999)Google Scholar
  12. 12.
    Gou, G., Chirkova, R.: Efficiently querying large XML data repositories: a survey. IEEE Trans. Knowl. Data Eng. 19(10), 1381–1403 (2007)CrossRefGoogle Scholar
  13. 13.
    Jiang, H., Lu, H., Wang, W.: Efficient processing of XML twig queries with OR-predicates. In: SIGMOD Conference, pp. 59–70 (2004)Google Scholar
  14. 14.
    Jiang, H., Wang, W., Lu, H., Yu, J.: Holistic twig joins on indexed XML documents. In: VLDB, pp. 273–284 (2003)Google Scholar
  15. 15.
    Lu, J., Ling, T.W., Chan, C., Chen, T.: From region encoding to extended dewey: On efficient processing of XML twig pattern matching. In: VLDB, pp. 193–204 (2005)Google Scholar
  16. 16.
    Pal, S., Cseri, I., Seeliger, O., Schaller, G., Giakoumakis, L., Zolotov, V.: Indexing XML data stored in a relational database. In: VLDB, pp. 1146–1157 (2004)Google Scholar
  17. 17.
    Shanmugasundaram, J., Tufte, K., He, G., Zhang, C., Dewitt, D., Naughton, J.: Relational databases for querying XML documents: limitations and opportunities. In: VLDB, pp. 302–314 (1999)Google Scholar
  18. 18.
    Theodoratos, D., Dalamagas, T., Koufopoulos, A., Gehani, N.: Semantic querying of tree-structured data sources using partially specified tree patterns. In: CIKM, pp. 712–719 (2005)Google Scholar
  19. 19.
    Wu, H., Ling, T.W., Chen, B.: VERT: a semantic approach for content search and content extraction in XML query processing. In: Parent, C., Schewe, K.-D., Storey, V.C., Thalheim, B. (eds.) ER 2007. LNCS, vol. 4801, pp. 534–549. Springer, Heidelberg (2007)CrossRefGoogle Scholar
  20. 20.
    Yoshikawa, M., Amagasa, T., Shimura, T., Uemura, S.: XRel: a path-based approach to storage and retrieval of XML documents using relational databases. ACM Trans. Interet Technol. 1, 110–141 (2001)CrossRefGoogle Scholar
  21. 21.
    Zhang, C., Naughton, J., Dewitt, D., Luo, Q., Lohman, G.: On supporting containment queries in relational database management systems. In: SIGMOD Conference, pp. 425–436 (2001)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Huayu Wu
    • 1
  • Tok Wang Ling
    • 1
  • Gillian Dobbie
    • 2
  1. 1.School of ComputingNational University of SingaporeSingapore
  2. 2.Department of Computer ScienceThe University of AucklandNew Zealand

Personalised recommendations